technique
symbolic verifier outputs
techniqueactiveprovisional
symbolic-verifier-outputs-57e7b26e·1 events·first seen 20d agoAliases: symbolic verifier outputs
Co-occurring entities
More like this (12)
symbolic meta-verificationreinforcement learning from verifier feedbackProver-Verifier GamesSelf-Trained Verification (STV)execution-based verificationmultimodal meta-verificationselective-verification layertoken-wise self-certaintyOmniVerifier-M1VIA-SD: Verification via Intra-Model Routing for Speculative DecodingGSM-Symbolicsymbolic attention heads
Recent events (1)
OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration
OmniVerifier-M1 is a generalist visual verifier trained using symbolic meta-verification rationales (e.g., bounding boxes) and decoupled reinforcement learning objectives for binary judgment versus meta-verification. The paper finds that symbolic verifier outputs outperform textual explanations as rationales, enabling rule-based RL rewards without auxiliary judge models, and that decoupling RL objectives substantially improves performance over joint optimization. The system further enables M1-TTS, a verifier-driven agentic generation pipeline supporting dynamic region-level self-correction in multimodal outputs.