Almanac
← Events
7OpenAI Blog·1mo ago

How we built OWL, the new architecture behind our ChatGPT-based browser, Atlas

OpenAI published a technical deep dive into OWL, the architecture underpinning ChatGPT Atlas, a new Chromium-based browser. The post covers how OWL decouples Chromium internals to enable fast startup, rich UI rendering, and agentic browsing capabilities integrated with ChatGPT. This represents OpenAI's move into the browser layer as an agentic interface, extending ChatGPT's reach into web navigation and task execution.

Related guides (3)

Related events (8)

6Openai Blog·1mo ago·source ↗

Continuously hardening ChatGPT Atlas against prompt injection

OpenAI is applying automated red teaming trained with reinforcement learning to harden ChatGPT Atlas, its browser agent, against prompt injection attacks. The approach creates a proactive discover-and-patch loop to identify novel exploits before they can be weaponized. This work is framed as part of broader efforts to secure increasingly agentic AI systems against adversarial manipulation of external content.

5Openai Blog·1mo ago·source ↗

OpenAI Introduces Canvas: A New Writing and Coding Interface for ChatGPT

OpenAI announced Canvas, a new collaborative interface for ChatGPT designed to support writing and coding workflows. Canvas provides a side-by-side editing environment where users can work with ChatGPT on longer-form documents and code projects. The feature represents a shift from purely conversational interaction toward a more structured, document-centric collaboration model.

8Openai Blog·1mo ago·source ↗

Introducing ChatGPT Agent

OpenAI has launched ChatGPT agent, a new capability that combines reasoning with tool use to autonomously complete multi-step tasks such as research, bookings, and presentation creation. The agent operates under user guidance, integrating thinking and acting in a unified workflow. This represents OpenAI's move to bring agentic capabilities directly into the ChatGPT product for general consumers.

6Openai Blog·1mo ago·source ↗

WebGPT: Improving the factual accuracy of language models through web browsing

OpenAI fine-tuned GPT-3 to answer open-ended questions more accurately by giving it access to a text-based web browser. The system, called WebGPT, uses reinforcement learning from human feedback to learn to search the web, read pages, and cite sources. This work represents an early demonstration of retrieval-augmented generation and tool-use in large language models.

9Openai Blog·1mo ago·source ↗

OpenAI Spring Update: GPT-4o Announced, Expanded Free ChatGPT Capabilities

OpenAI announced GPT-4o, a new flagship model, alongside an expansion of capabilities available to free-tier ChatGPT users. GPT-4o represents a new omnimodal architecture capable of handling text, audio, and vision in a unified model. The announcement was made via a live demo event and marks a significant shift in OpenAI's product and model strategy.

7Openai Blog·1mo ago·source ↗

Introducing ChatGPT Search

OpenAI has launched ChatGPT Search, a feature that provides fast, timely answers with links to relevant web sources directly within ChatGPT. This integrates real-time web retrieval into the ChatGPT interface, moving the product closer to a search engine replacement. The announcement comes from OpenAI's official blog, indicating a significant product expansion.

7Openai Blog·1mo ago·source ↗

Introducing ChatGPT and Whisper APIs

OpenAI announced the release of dedicated APIs for ChatGPT (gpt-3.5-turbo) and Whisper, enabling developers to integrate conversational AI and speech-to-text capabilities into their applications. The ChatGPT API offered significant cost reductions compared to existing GPT-3.5 endpoints. This marked a major step in OpenAI's platform strategy, opening programmatic access to its most widely used consumer models.

3Github Trending·5d ago·source ↗

smol-ai/GodMode: multi-model AI chat browser aggregating ChatGPT, Claude, Bard, and others

GodMode is an open-source TypeScript desktop app that provides unified browser-based access to multiple AI chat interfaces including ChatGPT, Claude, Bard, Bing, and Llama 2. The project has accumulated 5,536 GitHub stars with modest recent momentum (+14 today). It functions as a thin wrapper enabling side-by-side or rapid switching between frontier chat products.