Almanac
benchmark

Arabic Instruction Following Eval (IFEval)

benchmarkactivearabic-instruction-following-eval-ifeval--462550da·1 events·first seen 28d ago

Aliases: Arabic Instruction Following Eval (IFEval)

Co-occurring entities

More like this (12)

Recent events (1)

4Hugging Face Blog·28d ago·source ↗

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

Hugging Face introduces new Arabic-language evaluation infrastructure, including an Arabic Instruction Following benchmark and updates to the AraGen leaderboard. The post covers evaluation methodology for Arabic LLM capabilities, expanding the ecosystem of non-English benchmarks. This is part of a broader effort to track model performance on Arabic language tasks beyond standard English-centric evaluations.