News

Presentation: Fine Tuning the Enterprise: Reinforcement Learning in Practice

  • Wenjie Zi, Will Hang--InfoQ.com
  • published date: 2026-07-03 09:22:00 UTC

The speakers discuss Agent RFT, OpenAI’s platform for fine-tuning reasoning models via real-time tool interactions and custom reward signals. They explain how reinforcement learning solves complex credit assignment challenges within the context window. They s…

Transcript Will Hang: I'm Will. Wenjie Zi: I'm Wenjie. Will Hang: We're on the fine-tuning team at OpenAI. We're excited to talk to you today about Agent RFT, the most powerful way to enhance the … [+42202 chars]