Entity · paper

DeepRubric

paperactivedeeprubric-998f055e·1 events·first seen Jun 16, 2026

Aliases: DeepRubric, DeepRubric-8B

Co-occurring entities

More like this (12)

Rubric Reward RubricsTree Rubrics on Trial Rubric-based Feedback Evaluation QUBRIC rubric-based rewards Preference-Aware Rubric Learning RuBench When Rubrics Change: Cross-Rubric Generalization for Critical Thinking Essay Scoring OCR-Robust rubric-based reward shaping Rubric-Conditioned Self-Distillation

Recent events (1)

6arXiv · cs.CL·Jun 16, 2026·source ↗

DeepRubric: Evidence-tree rubric supervision cuts RL training cost for deep research agents by 13x

DeepRubric is a data construction framework that improves reinforcement learning efficiency for deep research agents by reversing the typical rubric-generation process: rather than inferring evaluation criteria from a query, it builds an evidence tree of verifiable sub-questions first, then synthesizes aligned query-rubric pairs. The authors construct 9K training examples and train DeepRubric-8B using rubric-based GRPO, achieving comparable performance to prior open-source state-of-the-art deep research models on three benchmarks while using roughly 13x fewer RL GPU-hours. The work addresses a key bottleneck in RL-based training of long-form research agents: unreliable reward signals from incomplete rubrics.

Evaluation and Benchmarking Agent and Tool Ecosystem DeepRubric GRPO +1 more