Almanac
benchmark

Alyah

benchmarkactivealyah-3e92db87·1 events·first seen 28d ago

Aliases: Alyah

Co-occurring entities

More like this (12)

Recent events (1)

4Hugging Face Blog·28d ago·source ↗

Alyah: Benchmark for Evaluating Emirati Dialect Capabilities in Arabic LLMs

TII UAE introduces Alyah, a benchmark designed to evaluate large language models on Emirati Arabic dialect understanding and generation. The work addresses a gap in Arabic NLP evaluation, where most benchmarks focus on Modern Standard Arabic and neglect regional dialects. The benchmark aims to provide robust assessment of LLM capabilities specific to Emirati linguistic and cultural context.