8OpenAI Blog·1mo ago

OpenAI Announces Computer-Using Agent (CUA)

OpenAI has announced a Computer-Using Agent (CUA) capable of interacting with graphical user interfaces across web browsers and desktop applications. The system combines GPT-4o's vision capabilities with reinforcement learning to navigate and operate software as a human would. This represents OpenAI's entry into the agentic computer-control space, competing with similar efforts from Anthropic (Computer Use) and others. The announcement signals a significant step toward general-purpose AI agents that can autonomously complete multi-step tasks on computers.

Frontier Model Releases Enterprise Deployment Patterns Agent and Tool Ecosystem Multimodal Progress GPT-4o Computer-Using Agent OpenAI Claude Computer Use Anthropic

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Anthropic

Anthropic: The AI Safety Company at the Center of the Frontier

Read asBeginner

Multimodal ProgressTopic guide

Multimodal Progress: How AI Learned to See, Hear, and Act

Read asBeginner

Related events (8)

8Openai Blog·1mo ago·source ↗

Introducing ChatGPT Agent

OpenAI has launched ChatGPT agent, a new capability that combines reasoning with tool use to autonomously complete multi-step tasks such as research, bookings, and presentation creation. The agent operates under user guidance, integrating thinking and acting in a unified workflow. This represents OpenAI's move to bring agentic capabilities directly into the ChatGPT product for general consumers.

Frontier Model Releases Enterprise Deployment Patterns ChatGPT ChatGPT agent OpenAI +1 more

4Hugging Face Blog·1mo ago·source ↗

CUGA on Hugging Face: Democratizing Configurable AI Agents

IBM Research has released CUGA (Configurable Universal Generative Agent) on Hugging Face, positioning it as a framework for building configurable AI agents. The announcement appears on the Hugging Face blog as a tier-2 commentary piece from IBM Research. Details on architecture, benchmarks, and specific capabilities are not available from the body text provided.

Enterprise Deployment Patterns Agent and Tool Ecosystem IBM Research Hugging Face CUGA

8Openai Blog·1mo ago·source ↗

Introducing Operator

OpenAI has announced Operator, a new AI agent product capable of taking actions on the web on behalf of users. The announcement comes from OpenAI's official blog, signaling a major step toward autonomous web-based task execution. Operator represents OpenAI's entry into the agentic AI product space, where models can browse, interact with, and complete tasks across websites without direct user intervention.

Frontier Model Releases Enterprise Deployment Patterns OpenAI Operator +1 more

8Openai Blog·1mo ago·source ↗

ChatGPT Agent System Card

OpenAI has published a system card for its ChatGPT agent, an agentic model that integrates research, browser automation, and code execution tools into a unified system. The release is accompanied by safety documentation under OpenAI's Preparedness Framework. The system card details the safeguards and evaluations applied to the agent prior to deployment. This represents OpenAI's formal safety disclosure for a production agentic product.

Frontier Model Releases Evaluation and Benchmarking ChatGPT agent Preparedness Framework OpenAI +3 more

6The Batch·8d ago·source ↗

Andrew Ng introduces OpenCoworker, an open-source desktop AI agent harness

Andrew Ng and collaborators Rohit Prasad and Devika Verma have released OpenCoworker, a free open-source desktop agent built by extending the aisuite library to support agent harnesses. The tool allows users to connect frontier LLMs (OpenAI, Anthropic, Google) or local models via Ollama to desktop tasks including file access, messaging, and workflow automation, with privacy as a design priority. Ng frames this as a response to data-retention concerns with commercial desktop agents, citing Anthropic's Fable release as a recent example of policy opacity. The post also provides a concise overview of the current desktop agent landscape and the shift toward LLM-driven agentic loops.

Open Weights Progress Agent and Tool Ecosystem Ollama DeepLearning.AI aisuite +7 more

5Openai Blog·1mo ago·source ↗

OpenAI Releases Universe: A Platform for Training AI Across Games, Websites, and Applications

OpenAI released Universe, a software platform designed to measure and train AI general intelligence across a broad range of environments including games, websites, and other applications. The platform aims to expose AI agents to the world's supply of software as training and evaluation environments. This represented an early effort to develop general-purpose AI agents capable of operating across diverse real-world interfaces.

Evaluation and Benchmarking Agent and Tool Ecosystem Universe OpenAI

4Github Trending·1mo ago·source ↗

Agent-S: Open Agentic Framework for Human-Like Computer Use

Agent-S is an open-source Python framework by Simular AI designed to enable AI agents to interact with computers in a human-like manner. The project has accumulated 11,388 GitHub stars with modest daily growth of 29 stars. It represents an entry in the growing space of computer-use agent frameworks targeting GUI and desktop automation tasks.

Open Weights Progress Agent and Tool Ecosystem Agent-S Simular AI

7Openai Blog·1mo ago·source ↗

OpenAI Introduces Deep Research Agent

OpenAI has launched 'deep research,' an agentic capability that uses reasoning to synthesize large volumes of online information and complete multi-step research tasks autonomously. The feature is initially available to ChatGPT Pro users, with rollout to Plus and Team tiers to follow. It represents a step toward practical autonomous research agents built on OpenAI's reasoning model infrastructure.

Frontier Model Releases Enterprise Deployment Patterns ChatGPT Deep Research ChatGPT Plus OpenAI +2 more