Entity · paper

The Measurement Gap in the Automation of EU Law: Benchmarking Doctrinal Legal Reasoning under the EU AI Act

paperactivethe-measurement-gap-in-the-automation-of-eu-law-benchmarking-doctrinal-legal-reasoning-under-the-eu-ai-act-71de2b65·1 events·first seen Jun 17, 2026

Aliases: The Measurement Gap in the Automation of EU Law: Benchmarking Doctrinal Legal Reasoning under the EU AI Act

Co-occurring entities

EU AI Act

More like this (12)

EU AI Act EU Code of Practice on AI content transparency Clinician-Level Agreement Without Clinical Caution: LLM Evaluator Limits in Medical AI Benchmarking EU General-Purpose AI Code of Practice Towards Agentic AI Governance: A Preliminary Assessment Flaws in the LLM Automation Narrative Measuring the Gap Between Human and LLM Research Ideas Measuring the Gap Between Human and LLM Research Ideas Legal Services AI Open Source AI Gap Map OpAI-Bench Can LLMs Judge Better Than They Generate? Evaluating Task Asymmetry, Mechanistic Interpretability and Transferability for In-Context QA

Recent events (1)

5arXiv · cs.CL·Jun 17, 2026·source ↗

Benchmark gap paper: EU AI Act requires doctrinal legal reasoning evals that don't yet exist

A new arXiv preprint identifies a critical measurement gap in legal AI evaluation: existing benchmarks test paralegal and ancillary tasks rather than doctrinal legal reasoning, which is the interpretive core of legal work. The authors argue this gap is not merely methodological but legally significant, because the EU AI Act's 'appropriate accuracy' requirement for high-risk AI in the judicial domain cannot be operationalized without a doctrinal-reasoning benchmark. The paper proposes a benchmark framework aimed at filling this gap under EU AI Act compliance requirements.

Evaluation and Benchmarking Regulatory Developments The Measurement Gap in the Automation of EU Law: Benchmarking Doctrinal Legal Reasoning under the EU AI Act EU AI Act