Measuring AI's capability to accelerate biological research
OpenAI introduces a real-world evaluation framework designed to measure how AI systems can accelerate biological research in wet lab settings. The work uses GPT-5 to optimize a molecular cloning protocol as a concrete demonstration case. The framework explicitly addresses both the potential benefits and biosecurity risks of AI-assisted experimentation, positioning this as a dual-use capability assessment.
Related guides (3)
Related events (8)
Preparing for future AI risks in biology
OpenAI has published a post outlining its proactive approach to assessing and mitigating biosecurity risks from advanced AI systems capable of biological applications. The piece describes capability evaluations and safeguards designed to prevent misuse of AI in biology and medicine. This reflects OpenAI's ongoing effort to get ahead of dual-use risks before capabilities reach dangerous thresholds.
Early experiments in accelerating science with GPT-5
OpenAI has published initial research cases demonstrating GPT-5's application to scientific discovery across mathematics, physics, biology, and computer science. The examples highlight human-AI collaboration in generating mathematical proofs and uncovering novel insights. This represents OpenAI's first public documentation of GPT-5's scientific research capabilities beyond general benchmarks.
Measuring progress toward AGI: A cognitive framework
DeepMind is introducing a cognitive framework designed to measure progress toward AGI, providing structured criteria for assessing how close AI systems are to general intelligence. Alongside the framework, they are launching a Kaggle hackathon to crowdsource the development of relevant evaluations. The announcement signals a formal effort by a Tier 1 lab to operationalize AGI progress measurement, which has historically been contested and informal.
Medical Research with GPT-5
OpenAI published a blog post describing how GPT-5 is being used for medical research applications. The post appears to be an announcement or case study highlighting GPT-5's capabilities in a healthcare/research context. Specific details about methods, benchmarks, or outcomes are not provided in the available text.
OpenAI introduces LifeSciBench, a life sciences AI evaluation benchmark
OpenAI has released LifeSciBench, a benchmark designed to evaluate AI systems on real-world life science research tasks and decisions. The benchmark is described as expert-authored and expert-reviewed, targeting domain-specific evaluation in biology and related fields. This addresses a gap in specialized scientific benchmarking for AI systems.
Building an Early Warning System for LLM-Aided Biological Threat Creation
OpenAI published a blueprint for evaluating whether LLMs can meaningfully assist in biological threat creation. In a controlled study with biology experts and students, GPT-4 was found to provide at most mild uplift in biological threat creation accuracy. The results are inconclusive but are framed as a starting point for ongoing safety research and community deliberation on biosecurity risks from AI.
OpenAI and Retro Biosciences Deploy GPT-4b micro for Protein Engineering in Longevity Research
OpenAI collaborated with Retro Biosciences to apply a specialized model called GPT-4b micro to protein engineering tasks relevant to stem cell therapy and longevity research. The work represents a concrete application of a fine-tuned or specialized variant of GPT-4 to life sciences, specifically improving protein design effectiveness. This is a notable example of frontier AI models being deployed in wet-lab-adjacent scientific research contexts.
GPT-5 lowers the cost of cell-free protein synthesis
An autonomous laboratory system integrating OpenAI's GPT-5 with Ginkgo Bioworks' cloud automation platform achieved a 40% reduction in cell-free protein synthesis costs. The system operates via closed-loop experimentation, where the AI model iteratively designs, executes, and refines biological experiments without human intervention. This represents a concrete application of frontier LLMs to wet-lab automation and cost optimization in synthetic biology.


