Entity · benchmark

HMMT25

benchmarkactivehmmt25-a1278bf5·1 events·first seen May 28, 2026

Aliases: HMMT25

Co-occurring entities

OPSD AIME24 SGSD GRPO AIME25 Skill-Conditioned Gated Self-Distillation (SGSD)Qwen3-1.7B

More like this (12)

HMMT 2025 HMMT 2026 BM25 HM3D AIME25 CC12M M1-TTS mT5 ETTm2 TTT-E2E MetaWorld MT50 Hy3-A21B

Recent events (1)

6arXiv · cs.AI·May 28, 2026·source ↗

Skill-Conditioned Gated Self-Distillation (SGSD) for LLM Reasoning

SGSD is a new on-policy self-distillation method for LLM reasoning that replaces trusted privileged information (e.g., reference answers) with an experience-derived skill bank of skill-mistake pairs. It constructs a multi-teacher pool, validates each teacher's contribution via a verifier, and applies a gated objective to distill informative disagreements while suppressing noisy signals. On Qwen3-1.7B, SGSD outperforms GRPO by 6.2% and answer-conditioned OPSD by 1.7% on average across AIME24, AIME25, and HMMT25. The method relaxes the assumption of trusted privileged information, making self-distillation more practical under weaker supervision.

Frontier Model Releases Evaluation and Benchmarking OPSD AIME24 SGSD +7 more