Almanac
benchmark

HMMT 2026

benchmarkactiveprovisionalhmmt-2026-1dfd9299·1 events·first seen 13d ago

Aliases: HMMT 2026

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.AI·13d ago·source ↗

StreamMA: Streaming communication in multi-agent reasoning reduces latency and improves accuracy

Researchers introduce StreamMA, a multi-agent reasoning system that streams individual reasoning steps to downstream agents as they are generated, rather than waiting for a complete chain. This pipelining approach reduces end-to-end latency and also improves accuracy by shielding downstream agents from error-prone late reasoning steps. Evaluated across eight benchmarks, two frontier LLMs (Claude Opus 4.6 and GPT-5.4), and three topologies, StreamMA outperforms serial and single-agent baselines by an average of 7.3 percentage points. The paper also identifies a 'step-level scaling law' — a new scaling dimension orthogonal to agent-count scaling.