Almanac
benchmark

MaDI-Bench

benchmarkactiveprovisionalmadi-bench-d8e5b238·1 events·first seen 15h ago

Aliases: MaDI-Bench

Co-occurring entities

More like this (12)

Recent events (1)

4arXiv · cs.CL·15h ago·source ↗

MaDI-Bench: First end-to-end benchmark for relational table data integration

Researchers introduce MaDI-Bench (Mannheim Data Integration Benchmark), the first benchmark covering the full data integration pipeline for relational tables, including schema matching, value normalization, entity matching, and conflict resolution. Prior benchmarks evaluated these steps in isolation or omitted stages, limiting research on holistic integration methods. The benchmark includes base tasks across multiple domains, a mechanism to generate variants to prevent saturation, and is validated against human-engineered, best-of-breed, and LLM-based pipelines. All artifacts are publicly available.