Entity · technique

RA-RFT

techniqueactivera-rft-2c3c210b·1 events·first seen Jun 12, 2026

Aliases: RA-RFT

Co-occurring entities

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning GRPO Qwen3-4B AIME 2025 Qwen3-1.7B

More like this (12)

APS-RAG AuRA ALER-TI ASRD FACTR 2 FunASR G-RRM MADA-RL VQA-RAD ARPA-H C-RASP RMM

Recent events (1)

6arXiv · cs.AI·Jun 12, 2026·source ↗

RA-RFT: Retrieval-Augmented Reinforcement Fine-Tuning teaches LLMs to reason by analogy

Researchers propose Retrieval-Augmented Reinforcement Fine-Tuning (RA-RFT), a post-training framework that trains a retriever to rank contexts by expected reasoning benefit rather than semantic similarity, then fine-tunes a policy model via reinforcement learning using retrieved analogous demonstrations. The key insight is that reasoning-relevant retrieval surfaces complementary solution strategies rather than superficially similar problems. On mathematical reasoning benchmarks, RA-RFT improves AIME 2025 average@32 accuracy by 7.1 and 2.8 points over GRPO for Qwen3-1.7B and Qwen3-4B respectively, suggesting reasoning-aware retrieval is orthogonal to reward design and training curriculum improvements.

Evaluation and Benchmarking Alignment and RLHF RA-RFT Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning GRPO +3 more