
Sunday Dec 03, 2023
Episode 11.21: Solving mathematical problems using Process (PRM) and Outcome Reward (ORM) Models.
A footnote to the OpenAI saga involving trying to improve the AIs’ mathematical reasoning.
Sunday Dec 03, 2023
A footnote to the OpenAI saga involving trying to improve the AIs’ mathematical reasoning.
No comments yet. Be the first to say something!