Entity · company

Andon Labs

companyactiveandon-labs-e3e7dea4·2 events·first seen Jun 1, 2026

Aliases: Andon Labs

Co-occurring entities

Claude Mythos Anthropic Axel Backlund Claude Haiku 4.5 Lukas Petersson VendingBench Latent Space DeepLearning.AI Chubb Limited Luna Claude Design Travelers Group Goldman Sachs OpenAI Morgan Stanley Andrew Ng Berkshire Hathaway GPT-Rosalind

More like this (12)

Isomorphic Labs Anthropic Labs Anduril Industries AI21 Labs AIA Labs Annapurna Labs Kyutai Labs Pangram Labs Analytics-Everywhere-Lab Lambda Labs AWS Labs Solayer Labs

Recent events (2)

5Latent Space·Jun 4, 2026·source ↗

Andon Labs on building frontier evals: VendingBench and evaluating Claude models

Latent Space interviews Lukas Petersson and Axel Backlund of Andon Labs, the creators of VendingBench, about their approach to building real-world AI evaluations. The conversation covers their experience evaluating Claude models across the capability spectrum from Haiku to Mythos, and their methodology for constructing durable frontier evals. The episode is notable for touching on a speculative or unreleased Claude model tier called 'Mythos.'

Frontier Model Releases Evaluation and Benchmarking Claude Mythos Axel Backlund Claude Haiku 4.5 +5 more

5The Batch·Jun 1, 2026·source ↗

Insurance Companies Carve Out AI Risk Exceptions; GPT-Rosalind, Claude Design, and Agentic Retail Deployments Highlighted

Major insurers including Berkshire Hathaway units, Travelers Group, and Chubb are excluding or restricting AI-related liability coverage, signaling growing concern over hard-to-model AI-driven claims. OpenAI introduced GPT-Rosalind, a domain-specific LLM fine-tuned for life sciences workflows, while Anthropic launched Claude Design for visual asset generation targeting non-designers. Additional items cover an AI-run San Francisco retail store exposing agentic system limitations, Wall Street banks cutting junior roles via AI deployment, and Anthropic's continued engagement with the Trump administration despite prior Pentagon restrictions.

Frontier Model Releases Inference Economics DeepLearning.AI Claude Mythos Chubb Limited +15 more