benchmark
FilBench
benchmarkactive
filbench-3783e2d5·1 events·first seen 28d agoAliases: FilBench
Co-occurring entities
More like this (12)
Recent events (1)
FilBench: Benchmarking LLM Capabilities in Filipino Language
FilBench is a new benchmark introduced to evaluate large language models on their ability to understand and generate Filipino. The benchmark targets a historically underrepresented language in NLP evaluation suites, assessing both comprehension and generation tasks. This work addresses gaps in multilingual LLM evaluation coverage, particularly for Southeast Asian languages.