Almanac
technique

reinforcement fine-tuning

techniqueactivereinforcement-fine-tuning-f86bb8aa·2 events·first seen 28d ago

Aliases: reinforcement fine-tuning

Co-occurring entities

More like this (12)

Recent events (2)

7Openai Blog·28d ago·source ↗

OpenAI Introduces AgentKit, Expanded Evals, and Reinforcement Fine-Tuning for Agents

OpenAI has released a suite of developer tools aimed at accelerating agent development from prototype to production. The release includes AgentKit (a new agent-building framework), expanded evaluation capabilities, and reinforcement fine-tuning (RFT) specifically designed for agentic use cases. These tools represent OpenAI's continued push to provide end-to-end infrastructure for building and deploying AI agents at scale.

5Openai Blog·28d ago·source ↗

Doppel's AI Defense System Uses GPT-5 and Reinforcement Fine-Tuning to Counter Deepfake Attacks

Doppel, a digital risk protection company, has deployed GPT-5 combined with reinforcement fine-tuning to detect and stop deepfake and impersonation attacks. The system reportedly cuts analyst workloads by 80% and reduces incident response times from hours to minutes. This represents a production deployment of GPT-5 in a cybersecurity context, showcasing enterprise use of frontier models for threat detection.