Entity · company

Cognition

companyactivecognition-0fd097c4·11 events·first seen May 20, 2026

Aliases: Cognition

Co-occurring entities

More like this (12)

memory Executable Operational Cognition Cognizant Notion Deep Think Before You Think: System 0, AI-Mediated Cognition and Cognitive Colonization Thinking-with-Images AGI cognitive framework cognitive-graph Basque Center on Cognition, Brain, and Language meta-cognitive configurator Cerebras

Recent events (11)

6The Batch·Jul 16, 2026·source ↗

Data Points: PrismML fits 27B model on iPhone; Cognition SWE-1.7, Nvidia Audex, Anthropic language-value study

A newsletter digest covers four notable AI developments: PrismML (a Caltech/Khosla spinout) compressed Alibaba's Qwen 27B model to under 4 GB via ternary/binary quantization for on-device iPhone inference; Cognition released SWE-1.7 (trained on Kimi K2.7), jumping from 9.4% to 42.3% on FrontierCode 1.1 Main with novel RL and infrastructure techniques; Nvidia introduced Audex, a 30B unified audio-text transformer trained on 157B audio tokens; and Anthropic published research showing Claude's expressed values shift measurably by language across 309,815 conversations. Each item represents a distinct technical development across on-device inference, coding agents, multimodal models, and model behavior analysis.

Inference Economics Agent and Tool Ecosystem Kimi K2 Claude Sonnet Claude Opus 4.6 +18 more

6Hacker News·Jul 8, 2026·source ↗

Cognition releases SWE-1.7, claiming near-GPT-5.5 and near-Opus intelligence

Cognition has released SWE-1.7, a new version of their software engineering-focused model, claiming performance approaching GPT-5.5 and Claude Opus 4.8 on relevant tasks. The announcement is hosted on Cognition's blog and has generated significant community discussion on Hacker News with 228 points and 118 comments. This represents a notable capability claim from a specialized coding AI lab competing with frontier general-purpose models.

Frontier Model Releases Agent and Tool Ecosystem Cognition SWE-1.7 OpenAI +3 more

7The Batch·Jul 2, 2026·source ↗

U.S. lifts export controls on Claude Fable 5 and Mythos 5; Anthropic launches Claude Sonnet 5 and Claude Science platform

The Trump administration lifted export restrictions on Anthropic's Claude Fable 5 and Claude Mythos 5 after Anthropic committed to stronger safeguards, resolving a dispute over jailbreak vulnerabilities. Separately, Anthropic launched Claude Sonnet 5, a mid-tier agentic model priced at $2/$10 per million tokens through August 2026, and Claude Science, a unified research workbench for life sciences integrating PubMed, Jupyter, and HPC cluster access. The newsletter also covers Google's Nano Banana 2 Lite image model and Gemini Omni Flash video model, and Cognition's Devin Fusion multi-model routing system claiming 35% cost reduction versus GPT-5.5 and Opus 4.8.

Frontier Model Releases Inference Economics Dario Amodei Claude Sonnet 3.5 Claude Mythos +20 more

7The Batch·Jun 10, 2026·source ↗

The Batch: Claude Mythos 5 / Fable 5 debut, Apple AFM 3, Google Live Translate, OpenAI IPO filing, FrontierCode benchmark

Anthropic launched Claude Fable 5 (a safety-guardrailed model) and Claude Mythos 5 (same underlying model with safeguards removed, for vetted cyberdefense/infrastructure users via Project Glasswing with US government collaboration), both priced at $10/$50 per million tokens. Apple released five new Apple Foundation Models (AFM 3) spanning on-device and cloud tiers, built with Google and Nvidia infrastructure. Additional headlines cover Google's Gemini 3.5 Live Translate (70+ languages, real-time), OpenAI's confidential SEC IPO filing, a NotebookLM upgrade to Gemini 3.5, and Cognition's FrontierCode benchmark for code-quality evaluation where Claude Opus 4.8 leads at 34.3%.

Frontier Model Releases Evaluation and Benchmarking Claude Mythos Claude Opus 4.6 Google +19 more

8Hacker News·Jun 9, 2026·source ↗

Anthropic releases Claude Fable 5

Anthropic has released Claude Fable 5, a new model in the Claude family, announced via their official news channel. The Hacker News discussion generated substantial engagement with 1,468 points and 1,156 comments, indicating significant community interest. No detailed capability claims or benchmark results are available from this item alone.

Frontier Model Releases AI Safety Research Claude Mythos IMC Claude Opus 4.6 +11 more

9Anthropic News·Jun 3, 2026·source ↗

Anthropic introduces computer use capability, upgraded Claude 3.5 Sonnet, and Claude 3.5 Haiku

Anthropic announced three major developments: an upgraded Claude 3.5 Sonnet with significant coding improvements (SWE-bench Verified rising from 33.4% to 49.0%, surpassing all publicly available models including reasoning models), a new Claude 3.5 Haiku that matches Claude 3 Opus performance at Haiku-tier speed, and a public beta of 'computer use' — a capability allowing Claude to control computers by viewing screens, moving cursors, clicking, and typing. Computer use is available via the Anthropic API, Amazon Bedrock, and Google Cloud Vertex AI, with early adopters including Replit, The Browser Company, and Cognition. Both safety institutes (US AISI and UK AISI) conducted pre-deployment testing, and the model was assessed as remaining within ASL-2 under Anthropic's Responsible Scaling Policy.

Frontier Model Releases Evaluation and Benchmarking OpenAI o1-preview Amazon Bedrock Claude 3.5 Sonnet +15 more

9Anthropic News·Jun 1, 2026·source ↗

Anthropic Introduces Claude Opus 4 and Sonnet 4 with Leading Coding Benchmarks and Agent Capabilities

Anthropic has released Claude Opus 4 and Claude Sonnet 4, positioning Opus 4 as the world's best coding model with 72.5% on SWE-bench and 43.2% on Terminal-bench, and Sonnet 4 at 72.7% on SWE-bench. Both models are hybrid (near-instant + extended thinking), support extended thinking with tool use in beta, parallel tool execution, and improved memory via local file access. Alongside the models, Anthropic is launching Claude Code as generally available with GitHub Actions, VS Code, and JetBrains integrations, plus four new API capabilities: code execution tool, MCP connector, Files API, and one-hour prompt caching. Pricing is unchanged from prior Opus and Sonnet tiers ($15/$75 and $3/$15 per million tokens respectively), with availability on Anthropic API, Amazon Bedrock, and Google Cloud Vertex AI.

Long Context Evolution Frontier Model Releases Claude Sonnet 4 Amazon Bedrock Claude Opus 4.6 +21 more

9Anthropic News·Jun 1, 2026·source ↗

Claude 3.7 Sonnet and Claude Code: Anthropic's First Hybrid Reasoning Model and Agentic Coding Tool

Anthropic has released Claude 3.7 Sonnet, described as their most capable model to date and the first hybrid reasoning model on the market, capable of operating in both standard and extended thinking modes within a single unified model. The model achieves state-of-the-art results on SWE-bench Verified and TAU-bench, with particular strength in coding and front-end web development. Alongside the model, Anthropic is launching Claude Code in limited research preview, a command-line agentic coding tool that can read/edit files, run tests, and push to GitHub. Pricing remains unchanged at $3/M input and $15/M output tokens, with availability across Claude.ai plans, Amazon Bedrock, and Google Cloud Vertex AI.

Frontier Model Releases Evaluation and Benchmarking Canva Amazon Bedrock GitHub +14 more

5Latent Space·May 28, 2026·source ↗

The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray

A Latent Space podcast episode featuring Cognition's Walden Yan and OpenInspect's Cole Murray discussing the current state of autonomous software engineering agents. Topics include Devin's reported 80% commit rate, spec-to-PR workflows, full VM environments for agents, agent memory, and the emerging pattern of product managers shipping code directly. The conversation covers practical deployment patterns and tooling for async agentic coding workflows.

Frontier Model Releases Enterprise Deployment Patterns Devin Cole Murray Cognition +4 more

7Latent Space·May 28, 2026·source ↗

Cognition raises $1B in $26B Series D

Cognition, the AI coding agent company behind Devin, has raised $1B in a Series D round at a $26B valuation. The round signals continued investor conviction in autonomous coding agents as a large and growing market. The Latent Space newsletter frames coding as an 'uncapped TAM market,' reflecting broader industry sentiment around AI-driven software development.

Enterprise Deployment Patterns Agent and Tool Ecosystem Devin Cognition Latent Space

4Openai Blog·May 20, 2026·source ↗

Coding with OpenAI o1

OpenAI published a brief feature in which Scott Wu, CEO of Cognition (maker of the Devin AI software engineer), describes how o1 approaches coding decisions in a more human-like, reasoning-oriented manner. The piece is a short promotional commentary tied to the o1 model launch, highlighting o1's potential impact on AI-assisted software development. No new technical benchmarks or capability details are disclosed.

Frontier Model Releases Agent and Tool Ecosystem Scott Wu Devin Cognition +1 more