Fix R3 model for OOF Policy Optimization