Almanac
benchmark

Claw-Eval

benchmarkactiveprovisionalclaw-eval-2d3d5793·1 events·first seen 26h ago

Aliases: Claw-Eval

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.AI·26h ago·source ↗

TokenPilot: Dual-granularity context management cuts LLM agent inference costs by up to 87%

TokenPilot is a cache-efficient context management framework for LLM agents that addresses the trade-off between token sparsity and prompt cache continuity. It combines Ingestion-Aware Compaction (global prefix stabilization) with Lifecycle-Aware Eviction (local segment offloading) to reduce inference costs by 56–87% across benchmarks while maintaining competitive task performance. The system is evaluated on PinchBench and Claw-Eval and has been integrated into the open-source LightMem2 library.