Step 9 of 9 in Alignment and RLHF: from first principles to frontier techniquesFinish path →

Guide · Beginner

OpenAI: The Lab That Made AI a Household Word

OpenAIBeginneractive·v5 · live·generated 6d ago

Part of these paths

Agent and Tool Ecosystem · Step 3 of 9
AI for the curious newcomer · Step 2 of 7
AI Safety Research · Step 4 of 6
Alignment and RLHF · Step 9 of 9
Enterprise Deployment Patterns · Step 1 of 12
Evaluation and Benchmarking · Step 1 of 10
Frontier Model Releases · Step 1 of 10
Inference Economics · Step 6 of 9
Long Context Evolution · Step 7 of 10
Multimodal Progress · Step 3 of 7
Open weights vs. the closed frontier · Step 5 of 7
Regulatory Developments · Step 1 of 9
Training Infrastructure · Step 6 of 8

TL;DROpenAI started as a nonprofit research lab with a mission to build artificial general intelligence safely, and grew into the company that put AI in front of hundreds of millions of people through ChatGPT. It has since become one of the most heavily funded technology companies in history, racing to push the frontier of what AI can do — from writing and coding to solving decades-old math problems — while navigating fierce competition, government contracts, and questions about safety and governance.

Key takeaways

ChatGPT's November 2022 launch was a cultural turning point that made large language models accessible to everyday users for the first time.
OpenAI raised $110 billion at a $730 billion valuation in early 2026, with backers including SoftBank, NVIDIA, and Amazon — one of the largest private funding rounds in history.
Its GPT-5 family (GPT-5, GPT-5.2, GPT-5.4, GPT-5.5) represents a rapid succession of frontier model releases spanning mid-2025 to mid-2026.
An OpenAI model disproved an 80-year-old conjecture in discrete geometry, and GPT-5.2 proposed a novel result in theoretical physics — both verified by researchers.
OpenAI signed a formal contract with the U.S. Department of War for classified AI deployments, while rival Anthropic was banned from the same contracts over safety restrictions.
OpenAI released two open-weight models (gpt-oss-120b and gpt-oss-20b) under Apache 2.0, a significant shift for a company that had kept its frontier models proprietary.

What OpenAI is

OpenAI is an AI research company and product developer best known for creating ChatGPT and the GPT family of large language models. Founded with the goal of building artificial general intelligence (AGI) — AI that can perform any intellectual task a human can — it has become the most publicly visible organization in the modern AI boom. Its products are used by hundreds of millions of people, and its research has shaped how the entire industry thinks about building and scaling AI systems.

Why it matters

Before ChatGPT, powerful AI models existed mostly inside research labs and developer APIs. OpenAI's November 2022 launch of ChatGPT changed that: it gave anyone with a browser a way to have a real conversation with a capable AI, and the world noticed. Within months, AI had moved from a niche technical topic to front-page news. That moment didn't just grow OpenAI's business — it accelerated investment, competition, and public debate across the entire field.

The research foundation

OpenAI's influence starts with its research. A few papers in particular set the direction for the whole industry:

GPT-1 (2018) showed that pre-training a model on lots of text, then fine-tuning it for specific tasks, was a powerful and general approach.
Scaling Laws (2020) gave labs a mathematical roadmap: model performance improves predictably as you add more compute, data, and parameters. This paper justified the race to build ever-larger models.
GPT-3 (2020) put that roadmap into practice with a 175-billion-parameter model that could perform a huge range of tasks with just a few examples — no task-specific training required.
CLIP (2021) extended the same ideas to images, teaching a model to understand pictures using natural language descriptions.

These weren't just academic contributions. They became the blueprint that OpenAI — and its competitors — followed to build the products people use today.

The model family: from GPT-4 to GPT-5.5

OpenAI's model releases have accelerated sharply. GPT-4 (March 2023) was the first model to accept both images and text as input and to perform at a human level on a range of professional benchmarks. GPT-4o (May 2024) went further, handling audio, vision, and text in a single unified model in real time.

Then came a new idea: instead of just training models to be smarter, what if you let them think longer before answering? That's the core of the o1 series (September 2024), which uses a technique called chain-of-thought reasoning — the model works through a problem step by step, like showing its work — to tackle hard math, science, and coding problems. OpenAI called this "inference-time compute," a new axis of improvement beyond just making the model bigger.

The GPT-5 family, released from August 2025 onward, represents the current frontier: GPT-5, GPT-5.2, GPT-5.4, and GPT-5.5 have shipped in rapid succession, each claiming improvements in reasoning, coding, and speed. GPT-5.4 introduced a 1-million-token context window — meaning it can read and reason over roughly a book's worth of text in a single session — along with the ability to control a computer directly (called "computer use"). Smaller, faster variants like GPT-5.4 mini and nano target cost-sensitive applications and automated pipelines.

Doing new science

One of the most striking recent developments is OpenAI's models contributing to genuine scientific discovery — not just summarizing known facts, but producing new ones.

GPT-5.2 proposed a novel formula for a gluon amplitude in theoretical physics, which was subsequently proved and verified by researchers.
An OpenAI model disproved the Erdős planar unit distance conjecture, an 80-year-old open problem in discrete geometry — reportedly at a compute cost under $1,000.

These results suggest that frontier AI is beginning to function as a research tool capable of original work, not just a very fast search engine.

The business: funding, infrastructure, and partnerships

OpenAI has raised capital at a scale that would have seemed implausible a few years ago. In February 2026 alone, it announced a $110 billion investment round at a $730 billion valuation, with SoftBank, NVIDIA, and Amazon among the backers. A separate earlier round brought in $122 billion more. These funds are earmarked for compute infrastructure and model development.

The Stargate Project, announced in January 2025, is a joint venture targeting up to $500 billion in AI infrastructure investment in the United States — one of the largest such commitments in history.

On the cloud side, Microsoft has been OpenAI's primary partner since a $1 billion investment in 2019 that made Azure OpenAI's exclusive cloud provider. That relationship has since expanded enormously, but OpenAI has also struck a major deal with Amazon Web Services — a stateful runtime environment for AI agents on AWS, backed by a $15 billion Amazon investment and a $100 billion compute commitment — while keeping Microsoft's exclusive rights to host stateless API calls intact.

Government, defense, and the safety debate

OpenAI signed a formal contract with the U.S. Department of War in early 2026, allowing its models to be used "for all lawful purposes" in classified environments, with negotiated safety guardrails. This put it in direct contrast with Anthropic, which was formally designated a national security supply-chain risk after refusing to remove restrictions on its models for autonomous weapons and mass domestic surveillance.

The episode is a live example of the tension every frontier AI lab faces: how much to restrict what your models can do, and who gets to decide.

Open weights and the competitive landscape

For most of its history, OpenAI kept its most capable models proprietary. In August 2025, it released gpt-oss-120b and gpt-oss-20b under the Apache 2.0 license — open-weight models that anyone can download, run, and build on. This was a notable strategic shift, likely in response to strong open-source competition (including DeepSeek's R1, which claimed performance parity with OpenAI's o1 at a fraction of the cost).

Where it's heading

The pattern across these events is one of simultaneous expansion on every front: more capable models released faster, more infrastructure to train and serve them, more partnerships to distribute them, and more specialized versions (like GPT-Rosalind for life sciences) targeting specific industries. The open-weights move suggests OpenAI is also competing for developer mindshare, not just enterprise contracts. The core tension — building the most powerful AI in the world while managing what it can be used for — shows no sign of resolving soon.

OpenAI's model lineage: from GPT-1 to GPT-5.5

Timeline

FAQ

What is OpenAI, in plain terms?

OpenAI is the company that makes ChatGPT and the GPT family of AI models. It started as a research lab focused on building AI safely and has grown into one of the most valuable technology companies in the world.

What is ChatGPT?

ChatGPT is OpenAI's conversational AI product — a chat interface powered by its GPT models that can answer questions, write text, debug code, and more. Its November 2022 launch brought AI to a mass audience.

How does OpenAI make money?

OpenAI sells access to its models through the ChatGPT product (free and paid tiers) and through its API, which developers and enterprises use to build their own AI-powered applications.

What is the Stargate Project?

Stargate is a major infrastructure initiative OpenAI announced in January 2025, targeting up to $500 billion in AI compute and data center investment in the United States, in partnership with SoftBank and others.

Does OpenAI have open-source models?

Yes — in August 2025, OpenAI released gpt-oss-120b and gpt-oss-20b under the Apache 2.0 license, a notable shift for a company that had historically kept its frontier models proprietary.

How does OpenAI compare to Anthropic?

Both are frontier AI labs, but they've taken different stances on government and military use: OpenAI signed a contract with the U.S. Department of War for classified AI deployments, while Anthropic was banned from those contracts after refusing to remove safety restrictions on its models.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

v5live6d ago
v4superseded11d ago
v3rejected12d ago
v2superseded17d ago
v1superseded17d ago

Related guides (4)

OpenAI

OpenAI: From Research Lab to Frontier AI Infrastructure Company

Read asIn-depth

Google DeepMind

Google DeepMind: The Lab Behind Gemini, AlphaFold, and Frontier AI

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Mistral AI

Mistral AI: Europe's Open-Weight Frontier Lab

Read asIn-depth

More on OpenAI (6)

4One Useful Thing·1mo ago·source ↗

Sign of the Future: GPT-5.5 Commentary

A tier-2 commentary piece from One Useful Thing discusses GPT-5.5 as a notable step in the AI capability curve. The piece frames the release as a signal of future AI development trajectories. As a commentary source, it likely offers analysis of what GPT-5.5's capabilities imply rather than primary technical reporting.

Frontier Model Releases One Useful Thing OpenAI GPT-5.5

7Openai Blog·1mo ago·source ↗

Databricks brings GPT-5.5 to enterprise agent workflows

Databricks is integrating GPT-5.5 into its enterprise agent workflows following the model's state-of-the-art performance on the OfficeQA Pro benchmark. The partnership represents a deployment of OpenAI's latest model within a major data and AI platform. This signals continued enterprise adoption of frontier models for agentic use cases.

Frontier Model Releases Evaluation and Benchmarking Databricks OpenAI OfficeQA Pro +3 more

6Berkeley Ai Research (Bair) Blog·1mo ago·source ↗

Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling

A BAIR blog post surveys recent progress in parallel reasoning for LLMs, covering methods from simple self-consistency and Best-of-N sampling through structured search (Tree of Thoughts, MCTS) to newer adaptive approaches including ParaThinker, GroupThink, and Hogwild! Inference. The core motivation is that sequential reasoning scales linearly with exploration depth, causing latency, context-rot, and compute inefficiency. Adaptive parallel reasoning aims to let models themselves decide when and how to decompose tasks into concurrent threads, rather than imposing fixed parallel structure externally. The post frames this as an emerging inference-time scaling paradigm with implications for agentic and complex reasoning workloads.

Long Context Evolution Frontier Model Releases ParaThinker Berkeley AI Research (BAIR)DeepSeek V4 +11 more

4Don'T Worry About The Vase·1mo ago·source ↗

Cyber Lack of Security and AI Governance

Zvi Mowshowitz's commentary addresses the intersection of AI capabilities and cybersecurity, framing recent developments around GPT-5.5 and a 'Mythos Moment' as catalysts for both internet security patching efforts and emerging AI regulatory frameworks. The piece situates cybersecurity as the underreported background story of current AI progress. It appears to analyze governance and safety implications of frontier model releases in the context of cyber vulnerabilities.

Frontier Model Releases AI Safety Research Mythos Moment OpenAI Zvi Mowshowitz +2 more

4Openai Blog·1mo ago·source ↗

Sea Limited's CPO on Deploying OpenAI Codex Across Engineering Teams

Sea Limited's Chief Product Officer David Chen discusses the company's decision to deploy OpenAI Codex across its engineering teams to accelerate AI-native software development in Asia. The piece frames Codex as a tool for agentic software development workflows. This is a customer perspective piece published on OpenAI's blog, highlighting enterprise adoption of Codex in a major Southeast Asian technology conglomerate.

Enterprise Deployment Patterns Agent and Tool Ecosystem Sea Limited David Chen OpenAI +1 more

5Latent Space·1mo ago·source ↗

AINews: Codex Rises, Claude Meters Programmatic Usage

A Latent Space AINews digest covering trends in major coding agents, with focus on OpenAI Codex's resurgence and Anthropic's introduction of usage metering for programmatic Claude access. The piece tracks the evolving competitive landscape among AI coding tools. As a tier-2 commentary source, it synthesizes recent developments rather than breaking new ground.

Frontier Model Releases Inference Economics Claude OpenAI OpenAI Codex +3 more