Almanac
product

Agentic CLEAR

productactiveagentic-clear-db2d0548·1 events·first seen 25d ago

Aliases: Agentic CLEAR

Co-occurring entities

More like this (12)

Recent events (1)

6arXiv · cs.CL·25d ago·source ↗

Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents

Agentic CLEAR is an automatic evaluation framework for LLM-based agentic systems that analyzes behavior at three granularity levels: system, trace, and node. Unlike existing tools that rely on static error taxonomies or focus only on observability, it dynamically generates textual insights and integrates above the observability layer with an accessible UI. Experiments across four benchmarks and seven agentic settings demonstrate strong alignment with human-annotated errors and predictive accuracy for task success rates.