7Google DeepMind Blog·1mo ago

Measuring progress toward AGI: A cognitive framework

DeepMind is introducing a cognitive framework designed to measure progress toward AGI, providing structured criteria for assessing how close AI systems are to general intelligence. Alongside the framework, they are launching a Kaggle hackathon to crowdsource the development of relevant evaluations. The announcement signals a formal effort by a Tier 1 lab to operationalize AGI progress measurement, which has historically been contested and informal.

Frontier Model Releases Evaluation and Benchmarking AI Safety Research Kaggle DeepMind AGI cognitive framework

Related guides (3)

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: How We Measure AI — and Why It Keeps Getting Harder

Read asBeginner In-depth

Related events (8)

6Google Deepmind Blog·1mo ago·source ↗

Rethinking how we measure AI intelligence

DeepMind has announced Game Arena, a new open-source evaluation platform designed for rigorous head-to-head comparison of frontier AI models. The platform uses environments with clear winning conditions to assess model capabilities. This represents DeepMind's contribution to addressing ongoing concerns about the adequacy of existing AI benchmarks.

Frontier Model Releases Evaluation and Benchmarking Game Arena DeepMind

5Google Deepmind Blog·1mo ago·source ↗

Taking a Responsible Path to AGI

DeepMind published a blog post outlining its approach to AGI development, emphasizing technical safety, proactive risk assessment, and collaboration with the broader AI community. The post signals DeepMind's public positioning on responsible AGI development practices. It appears to be a high-level strategic communication rather than a technical disclosure or specific capability announcement.

Frontier Model Releases AI Safety Research AGI Google DeepMind

8Openai Blog·1mo ago·source ↗

Measuring AI's capability to accelerate biological research

OpenAI introduces a real-world evaluation framework designed to measure how AI systems can accelerate biological research in wet lab settings. The work uses GPT-5 to optimize a molecular cloning protocol as a concrete demonstration case. The framework explicitly addresses both the potential benefits and biosecurity risks of AI-assisted experimentation, positioning this as a dual-use capability assessment.

Frontier Model Releases Evaluation and Benchmarking wet lab biological research evaluation framework OpenAI molecular cloning +3 more

4Ai Snake Oil·1mo ago·source ↗

AGI is not a milestone

This commentary argues that AGI should not be understood as a discrete capability threshold that triggers sudden societal or economic impacts. The piece challenges the milestone framing common in AI discourse, suggesting that AI impacts are and will continue to be gradual and diffuse rather than punctuated. It positions itself against narratives from major labs that treat AGI as a definable, imminent event.

Frontier Model Releases Evaluation and Benchmarking AGI (Artificial General Intelligence)Normal Tech

6Google Deepmind Blog·1mo ago·source ↗

DeepMind Publishes Framework for Evaluating Cybersecurity Threats of Advanced AI

DeepMind has released a framework designed to help cybersecurity experts assess and prioritize defenses against potential threats posed by advanced AI systems. The framework aims to systematically identify which defensive measures are necessary given AI's expanding capabilities in offensive cyber operations. This represents DeepMind's structured approach to evaluating AI-enabled cyber risks before they materialize at scale.

Evaluation and Benchmarking AI Safety Research DeepMind AI Cybersecurity Threat Evaluation Framework

5Openai Blog·1mo ago·source ↗

Planning for AGI and Beyond

OpenAI published a strategic document outlining its mission and approach to developing artificial general intelligence that benefits all of humanity. The post articulates OpenAI's long-term planning philosophy around AGI safety, deployment, and governance. It represents a high-level policy and values statement from the leading frontier AI lab rather than a technical announcement.

AI Safety Research Regulatory Developments AGI (Artificial General Intelligence)OpenAI

4Openai Blog·1mo ago·source ↗

Security on the path to AGI

OpenAI published a post outlining its approach to security as the organization advances toward AGI. The piece describes how security measures are being built directly into infrastructure and models proactively. The content is high-level and framing-oriented, with limited technical specifics visible in the excerpt.

Training Infrastructure AI Safety Research AGI OpenAI

5Interconnects·6d ago·source ↗

Welcome to the AGI era of AI governance

A commentary piece from Interconnects argues that AI governance has entered an 'AGI era,' framing this as a one-way transition that the field was unprepared for. The piece appears to analyze the governance and policy implications of AI systems reaching or approaching AGI-level capabilities. The framing suggests a significant shift in how AI oversight and regulation must be approached.

AI Safety Research Regulatory Developments Interconnects