Entity · product

AlphaEvolve

productactivealphaevolve-f661eb99·8 events·first seen May 17, 2026

Aliases: AlphaEvolve

Co-occurring entities

More like this (12)

AlphaGenome AlphaTensor MetaEvolve AlphaFold AlphaStar OpenEvolve OpenEvolve XAlpha AlphaEarth Foundations EvoStruct AlphaFold3 AlphaFold2

Recent events (8)

6arXiv · cs.CL·5d ago·source ↗

MetaEvolve framework trains LLMs to self-evolve via meta-skill reinforcement learning

Researchers introduce MetaEvolve, a framework that cultivates 'meta-skills' such as self-reflection and multi-round refinement in LLMs through a data synthesis pipeline and evolution-aware reinforcement learning. Grounded in coding tasks where program execution provides continuous reward signals, the system trains on evolution trajectories containing programs, fitness scores, and attempt histories. MetaEvolve outperforms the strongest baseline by 10.01% on in-distribution coding benchmarks and 24.12% on out-of-distribution tasks, with a 46.9% relative improvement on open-ended algorithm optimization problems outside the training domain. The work is motivated by AlphaEvolve-style test-time scaling and aims to produce generalizable meta-skills transferable to domains with sparse training signals.

Evaluation and Benchmarking Agent and Tool Ecosystem Teaching LLMs to Self-Evolve: Cultivating Core Meta-Skills with Reinforcement Learning AlphaEvolve MetaEvolve +1 more

6arXiv · cs.CL·Jun 5, 2026·source ↗

MLEvolve: Self-evolving multi-agent framework for automated ML algorithm discovery

MLEvolve is a new LLM-based multi-agent framework for end-to-end machine learning algorithm discovery, addressing limitations of existing MLE agents including information isolation and memoryless search. The system introduces Progressive MCGS (a graph-extended tree search), Retrospective Memory for experience accumulation, and decoupled strategic planning from code generation. Evaluated on MLE-Bench, it achieves state-of-the-art medal and valid submission rates within a 12-hour budget, and also outperforms AlphaEvolve on mathematical algorithm optimization tasks.

Evaluation and Benchmarking Agent and Tool Ecosystem MLEvolve MLE-bench Progressive MCGS +3 more

7The Batch·Jun 3, 2026·source ↗

Google's Aletheia agent uses Gemini 3 Deep Think to generate novel solutions to unsolved Erdős problems

Google researchers introduced Aletheia, an agentic workflow using Gemini 3 Deep Think that generates, verifies, and revises solutions to previously unsolved mathematical problems. Applied to Erdős problems, Aletheia produced 13 correct solutions out of 200 evaluated, with 4 being genuinely novel contributions not found in existing literature. The announcement also reveals Gemini 3 Deep Think's benchmark performance: 48.4% on HLE, 84.6% on ARC-AGI-2, and 93.8% on GPQA Diamond. The system demonstrates both the promise and current limitations of AI-assisted mathematical research, with a 6.5% correct-under-intended-interpretation rate on a hard problem set.

Frontier Model Releases Evaluation and Benchmarking Gemini 3.5 Pro Gemini Deep Think Tony Feng +9 more

5The Batch·Jun 1, 2026·source ↗

Persona Generators: Evolutionary LLM Method for Diverse Synthetic Human Personas

Google researchers Davide Paglieri, Logan Cross, and colleagues propose Persona Generators, a system that uses the AlphaEvolve evolutionary algorithm to generate code that produces 25 diverse persona prompts covering a broad range of attitudes and opinions. The method iteratively optimizes persona prompt diversity using six metrics, outperforming Nemotron Personas (82% vs 76% coverage of possible responses) and a Concordia memory-based baseline (46%). The system uses Gemini 2.5 Pro for questionnaire generation and Gemma 3-27B-IT for persona simulation via the Concordia agent library. The approach reframes persona generation as a coverage optimization problem rather than a data-matching one, enabling more representative synthetic user populations for product research.

Evaluation and Benchmarking Agent and Tool Ecosystem Gemma 2 9B Persona Generators Davide Paglieri +6 more

5The Batch·Jun 1, 2026·source ↗

Researchers at UT-Austin and Google Model Human Decision-Making in Rock-Paper-Scissors

Researchers from UT-Austin and Google used AlphaEvolve, an evolutionary code-optimization method, to synthesize interpretable Python programs that predict move-by-move decisions of LLMs and humans playing rock-paper-scissors against bots. They found that Gemini 2.5 Pro, Gemini 2.5 Flash, and GPT-4.1 share similar sequential-pattern-tracking strategies that are more systematic than typical human play, while GPT-OSS 120B and humans relied on simpler opponent-move-frequency heuristics. The study demonstrates that code synthesis from behavioral data can serve as an interpretability tool for LLM decision-making, revealing that LLMs do not simply mimic human strategies.

Evaluation and Benchmarking AI Safety Research Google Gemini-2.5-Flash-Lite AlphaEvolve +6 more

7arXiv · cs.AI·May 22, 2026·source ↗

Vector Policy Optimization: Training for Diversity Improves Test-Time Search

Vector Policy Optimization (VPO) is a new RL post-training algorithm for LLMs that replaces the scalar reward paradigm with vector-valued rewards, explicitly training models to produce diverse solution sets that specialize across different reward trade-offs. VPO is designed as a near-drop-in replacement for the GRPO advantage estimator and targets inference-scaling search procedures like AlphaEvolve. Across four tasks, VPO matches or outperforms scalar RL baselines on pass@k and best@k metrics, with advantages growing as search budget increases, and unlocks evolutionary search problems that GRPO-trained models cannot solve. The paper argues that diversity-optimized post-training may need to become the default as inference-time search becomes standard.

Evaluation and Benchmarking Inference Economics GRPO pass@k AlphaEvolve +4 more

8Google Deepmind Blog·May 19, 2026·source ↗

AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms

DeepMind has announced AlphaEvolve, a coding agent powered by Gemini that autonomously evolves algorithms for mathematical and practical computing applications. The system combines large language model creativity with automated evaluators to iteratively improve algorithmic solutions. It represents a significant step in AI-driven algorithm discovery, extending DeepMind's prior work in this space (e.g., AlphaTensor, FunSearch). The announcement comes from DeepMind's official blog, indicating a substantive capability release rather than a research preview.

Frontier Model Releases Evaluation and Benchmarking AlphaEvolve Google DeepMind AlphaTensor +3 more

7Google Deepmind Blog·May 17, 2026·source ↗

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields

DeepMind published a blog post detailing the real-world impact of AlphaEvolve, a Gemini-powered coding agent designed to discover and optimize algorithms. The post covers applications spanning business operations, infrastructure, and scientific research. AlphaEvolve represents a deployment of LLM-driven evolutionary algorithm search at scale across multiple domains.

Frontier Model Releases Inference Economics AlphaEvolve Google DeepMind Gemini +1 more