benchmark

Multi-Round Coreference Resolution

benchmarkactiveprovisionalmulti-round-coreference-resolution-f1c0305a·1 events·first seen 47h ago

Aliases: Multi-Round Coreference Resolution

Co-occurring entities

More like this (12)

Multilingual Coreference Resolution Shared Task CORE (Contrastive Reflection)Reeve Foundation Multilingual Corpus Accuracy and Satisfaction in Multi-Turn LLM Dialogues for NFR Assessment Long-context Reasoning Benchmarks Multi-hop Question Answering Context-Driven Incremental Compression for Multi-Turn Dialogue Generation Context-Driven Incremental Compression for Multi-Turn Dialogue Generation Multi-Task Bayesian In-Context Learning When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models multi-round event injection Uncertainty-Aware Hybrid Retrieval for Long-Document RAG

Recent events (1)

5arXiv · cs.CL·47h ago·source ↗

Randomized YaRN improves LLM length generalization for long-context reasoning

Researchers propose Randomized YaRN, a training method that combines YaRN-based positional extrapolation with randomized positional encodings and a length curriculum to improve LLM generalization to long contexts. Models trained on sequences under 8K tokens show consistent reasoning improvements on context lengths from 16K to 128K on BABILong and MRCR benchmarks. The key insight is that exposing models to out-of-distribution positional representations during short-context training enables better generalization at far longer inference-time lengths.

Long Context Evolution Evaluation and Benchmarking BABILong Multi-Round Coreference Resolution YaRN +1 more