paper
EDIT: Evidence-Diagnosed Intervention Training for Rule-Faithful LLM Grading
paperactiveprovisional
edit-evidence-diagnosed-intervention-training-for-rule-faithful-llm-grading-cacd610a·1 events·first seen 12d agoAliases: EDIT: Evidence-Diagnosed Intervention Training for Rule-Faithful LLM Grading
Co-occurring entities
More like this (12)
Evidence-Diagnosed Intervention TrainingMeasuring Epistemic Resilience of LLMs Under Misleading Medical ContextRevising Context, Shifting Simulated Stance: Auditing LLM-Based Stance Simulation in Online DiscussionsLLM-judged explanation scoreLLM PretrainingLLM-judge scoringLLM-based content classificationTraining LLMs to Enforce Multi-Level Instruction Hierarchies via Gravity-Weighted Direct Preference OptimizationDual-Evidence Gradient PurificationOn The Effectiveness-Fluency Trade-Off In LLM Conditioning: A Systematic StudyBeyond Third-Person Audits: Situated Interaction Auditing for User-Centered LLM Bias ResearchBe My Tutor: On-Policy Co-Distillation for Mutual LLM Improvement via Peer Feedback
Recent events (1)
EDIT framework trains more rubric-faithful LLM graders via internal-state diagnostics
Researchers introduce Evidence-Diagnosed Intervention Training (EDIT), a two-phase framework for improving LLM-based rubric grading. The first phase (EDIT-SFT) identifies problematic reasoning steps using posterior belief signals and input-grounding scores, then revises only those steps with rubric checklists; the second phase (EDIT-RL) uses belief-guided reward shaping to penalize harmful belief drifts during RL. Experiments on two real-world multi-subject grading benchmarks show consistent improvements over SFT and RL baselines on both in-domain and out-of-domain splits.