benchmark
Arabic Instruction Following Eval (IFEval)
benchmarkactive
arabic-instruction-following-eval-ifeval--462550da·1 events·first seen 28d agoAliases: Arabic Instruction Following Eval (IFEval)
Co-occurring entities
More like this (12)
Recent events (1)
Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More
Hugging Face introduces new Arabic-language evaluation infrastructure, including an Arabic Instruction Following benchmark and updates to the AraGen leaderboard. The post covers evaluation methodology for Arabic LLM capabilities, expanding the ecosystem of non-English benchmarks. This is part of a broader effort to track model performance on Arabic language tasks beyond standard English-centric evaluations.