Almanac
benchmark

LifeSciBench

benchmarkactiveprovisionallifescibench-920ee5a0·1 events·first seen 3d ago

Aliases: LifeSciBench

Co-occurring entities

More like this (12)

Recent events (1)

6Openai Blog·3d ago·source ↗

OpenAI introduces LifeSciBench, a life sciences AI evaluation benchmark

OpenAI has released LifeSciBench, a benchmark designed to evaluate AI systems on real-world life science research tasks and decisions. The benchmark is described as expert-authored and expert-reviewed, targeting domain-specific evaluation in biology and related fields. This addresses a gap in specialized scientific benchmarking for AI systems.