OpenAI Announces Computer-Using Agent (CUA)
OpenAI has announced a Computer-Using Agent (CUA) capable of interacting with graphical user interfaces across web browsers and desktop applications. The system combines GPT-4o's vision capabilities with reinforcement learning to navigate and operate software as a human would. This represents OpenAI's entry into the agentic computer-control space, competing with similar efforts from Anthropic (Computer Use) and others. The announcement signals a significant step toward general-purpose AI agents that can autonomously complete multi-step tasks on computers.
Related guides (4)
Related events (8)
Introducing ChatGPT Agent
OpenAI has launched ChatGPT agent, a new capability that combines reasoning with tool use to autonomously complete multi-step tasks such as research, bookings, and presentation creation. The agent operates under user guidance, integrating thinking and acting in a unified workflow. This represents OpenAI's move to bring agentic capabilities directly into the ChatGPT product for general consumers.
CUGA on Hugging Face: Democratizing Configurable AI Agents
IBM Research has released CUGA (Configurable Universal Generative Agent) on Hugging Face, positioning it as a framework for building configurable AI agents. The announcement appears on the Hugging Face blog as a tier-2 commentary piece from IBM Research. Details on architecture, benchmarks, and specific capabilities are not available from the body text provided.
Introducing Operator
OpenAI has announced Operator, a new AI agent product capable of taking actions on the web on behalf of users. The announcement comes from OpenAI's official blog, signaling a major step toward autonomous web-based task execution. Operator represents OpenAI's entry into the agentic AI product space, where models can browse, interact with, and complete tasks across websites without direct user intervention.
ChatGPT Agent System Card
OpenAI has published a system card for its ChatGPT agent, an agentic model that integrates research, browser automation, and code execution tools into a unified system. The release is accompanied by safety documentation under OpenAI's Preparedness Framework. The system card details the safeguards and evaluations applied to the agent prior to deployment. This represents OpenAI's formal safety disclosure for a production agentic product.
Andrew Ng introduces OpenCoworker, an open-source desktop AI agent harness
Andrew Ng and collaborators Rohit Prasad and Devika Verma have released OpenCoworker, a free open-source desktop agent built by extending the aisuite library to support agent harnesses. The tool allows users to connect frontier LLMs (OpenAI, Anthropic, Google) or local models via Ollama to desktop tasks including file access, messaging, and workflow automation, with privacy as a design priority. Ng frames this as a response to data-retention concerns with commercial desktop agents, citing Anthropic's Fable release as a recent example of policy opacity. The post also provides a concise overview of the current desktop agent landscape and the shift toward LLM-driven agentic loops.
OpenAI Releases Universe: A Platform for Training AI Across Games, Websites, and Applications
OpenAI released Universe, a software platform designed to measure and train AI general intelligence across a broad range of environments including games, websites, and other applications. The platform aims to expose AI agents to the world's supply of software as training and evaluation environments. This represented an early effort to develop general-purpose AI agents capable of operating across diverse real-world interfaces.
Agent-S: Open Agentic Framework for Human-Like Computer Use
Agent-S is an open-source Python framework by Simular AI designed to enable AI agents to interact with computers in a human-like manner. The project has accumulated 11,388 GitHub stars with modest daily growth of 29 stars. It represents an entry in the growing space of computer-use agent frameworks targeting GUI and desktop automation tasks.
OpenAI Introduces Deep Research Agent
OpenAI has launched 'deep research,' an agentic capability that uses reasoning to synthesize large volumes of online information and complete multi-step research tasks autonomously. The feature is initially available to ChatGPT Pro users, with rollout to Plus and Team tiers to follow. It represents a step toward practical autonomous research agents built on OpenAI's reasoning model infrastructure.



