Merge "Fix R3 model for OOF Policy Optimization"