Almanac
Guide · Beginner

Amazon Web Services: The Cloud Backbone of the AI Era

Amazon Web ServicesBeginneractive·v1 · live·generated 6d ago

Part of these paths

TL;DRAmazon Web Services has become the central infrastructure layer for the AI industry's biggest bets — hosting frontier models from Anthropic and OpenAI, supplying custom silicon for training at gigawatt scale, and serving enterprises from regulated government agencies to global corporations. Its role has shifted from generic cloud provider to a strategic partner that co-develops the hardware and software that the next generation of AI runs on.

Key takeaways

  • Anthropic committed over $100 billion to AWS over ten years, securing up to 5 gigawatts of compute on Trainium2 through Trainium4 chips — with Amazon investing up to $25 billion total in Anthropic.
  • OpenAI signed a separate $38 billion multi-year AWS deal, making AWS the exclusive third-party cloud for OpenAI Frontier and bringing GPT models, Codex, and Managed Agents to Amazon Bedrock.
  • Claude models are approved for FedRAMP High and DoD Impact Level 4/5 workloads via AWS GovCloud, enabling use by U.S. federal agencies and defense organizations.
  • Iranian drone strikes damaged at least three AWS data centers in Bahrain and the UAE in early 2026 — the first known targeting of commercial cloud infrastructure during active conflict.
  • Hugging Face's model hub, inference containers, and embedding tools are deeply integrated with AWS SageMaker and Bedrock, extending open-source AI access to enterprise teams.
  • Amazon Bedrock serves as the managed delivery layer for multiple competing AI ecosystems simultaneously — Anthropic, OpenAI, and Hugging Face all distribute through it.

What AWS is — and why it matters for AI

Amazon Web Services is Amazon's cloud computing division: it rents computing power, storage, and software services to businesses over the internet. For most of its history, AWS was best known as the place companies host websites and run databases. Today, it has become something more specific and more consequential — the primary infrastructure layer on which the AI industry's largest bets are being built.

If you've used an AI assistant, there's a reasonable chance the model behind it was trained on AWS hardware, is served through AWS data centers, or reached you through an AWS-managed service called Amazon Bedrock.

The two services you need to know

Amazon Bedrock is AWS's AI model marketplace and managed hosting service. Think of it as a single storefront where a company can access AI models from multiple providers — Anthropic's Claude, OpenAI's GPT models, Hugging Face's open-source models — without buying servers or managing infrastructure. Bedrock also handles the compliance paperwork that regulated industries require, which is a big deal for hospitals, banks, and government agencies.

Amazon SageMaker is AWS's older, more hands-on platform for teams that want to build and fine-tune their own AI models. Hugging Face's tools and model library are deeply integrated here, making it a popular choice for data science teams working with open-source AI.

The Anthropic relationship: AWS as primary AI partner

The most significant AI relationship AWS has built is with Anthropic, the company behind the Claude family of models. What started as a $4 billion investment in late 2023 has grown into one of the largest commercial agreements in tech history: a commitment of over $100 billion to AWS technologies over ten years, securing up to 5 gigawatts of computing capacity for training and running Claude models.

To put that in perspective: a gigawatt is roughly the output of a large nuclear power plant. Anthropic is reserving up to five of those, running on Amazon's custom Trainium chips (purpose-built for AI training), to power the next generation of Claude models.

Amazon has also invested up to $25 billion in Anthropic directly, making it one of Anthropic's largest backers. In return, AWS is Anthropic's primary cloud and training partner — meaning the models that power Claude are predominantly trained on Amazon hardware.

Claude is also available to U.S. government customers through AWS GovCloud, with approval for FedRAMP High and Department of Defense Impact Level 4 and 5 workloads. That means federal agencies handling sensitive but unclassified information — and defense organizations — can use Claude through the same AWS infrastructure they already trust.

OpenAI joins the platform too

AWS isn't exclusive to Anthropic. In a separate deal worth $38 billion over multiple years, OpenAI made AWS the exclusive third-party cloud provider for its Frontier platform. OpenAI's GPT models, Codex, and Managed Agents are now available through Amazon Bedrock, and AWS is building a special "stateful runtime environment" for AI agents — a system that lets AI agents remember what they've done across long, multi-step tasks, rather than starting fresh each time.

This means AWS is now simultaneously hosting the two leading AI model families — Claude and GPT — on the same platform, competing for the same enterprise customers.

Hugging Face: open-source AI on AWS

For teams that prefer open-source models over proprietary ones, AWS has partnered with Hugging Face since 2021. Hugging Face's vast library of open-weight models is accessible through SageMaker and the Bedrock Marketplace. AWS's custom Inferentia2 chips — designed for cost-efficient AI inference (running models, as opposed to training them) — are supported by Hugging Face's deployment tools, giving enterprises a cheaper alternative to GPU-based serving.

When the infrastructure became a target

In early 2026, the strategic importance of AWS infrastructure became starkly visible. Iranian drone strikes damaged at least three AWS data centers in Bahrain and the UAE, disrupting cloud services across the region. The attacks coincided with revelations that AI models running on AWS infrastructure — including Anthropic's Claude — were being used in U.S. military targeting operations. It was the first known instance of commercial cloud infrastructure being physically attacked during active armed conflict, underscoring how deeply AI and cloud computing have become entangled with geopolitical risk.

Where AWS fits in the bigger picture

AWS is not an AI lab — it doesn't build frontier models itself. Its role is to be the ground on which those models are trained, hosted, and delivered to the world. As AI models have grown larger and more power-hungry, that role has become harder to replicate and more strategically valuable. The multi-billion-dollar deals flowing to AWS from Anthropic, OpenAI, and others reflect a simple reality: building and running frontier AI at scale requires infrastructure that only a handful of organizations on Earth can provide.

AWS as the AI infrastructure hub

Major AI Lab Relationships with AWS

AI LabDeal SizeWhat AWS ProvidesKey Product
Anthropic$100B+ over 10 years; up to $25B investmentTrainium2–4 training compute, Bedrock hosting, GovCloudClaude on Amazon Bedrock
OpenAI$38B multi-yearBedrock stateful agent runtime, exclusive third-party cloud for FrontierGPT models, Codex, Managed Agents on Bedrock
Hugging FaceStrategic partnership (2021–present)SageMaker, Inferentia2, Bedrock MarketplaceOpen-weight models via Bedrock and SageMaker

Deal figures and roles drawn from the events bundle; unknown cells render —.

Timeline

  1. Hugging Face and Amazon partner to integrate Transformers into SageMaker

  2. Hugging Face and AWS announce broader strategic partnership

  3. Claude 3 models made available to U.S. Intelligence Community via AWS GovCloud

  4. Amazon invests $8B total in Anthropic; AWS named primary cloud and training partner

  5. AWS and OpenAI announce $38B multi-year strategic partnership

  6. Iranian drones strike AWS data centers in Bahrain and UAE during active conflict

  7. Anthropic and Amazon expand to $100B+ deal for up to 5 GW of Trainium compute

Related topics

AnthropicClaudeAmazon BedrockAmazon SageMakerOpenAIHugging FaceNVIDIAGoogle CloudAWS Inferentia2

FAQ

What is Amazon Bedrock?

Bedrock is AWS's managed service for accessing and deploying AI models — think of it as a single storefront where enterprises can run models from Anthropic, OpenAI, Hugging Face, and others without managing their own servers.

Why do AI companies partner with AWS instead of just building their own data centers?

Training frontier AI models requires enormous amounts of specialized hardware, power, and networking that take years to build — AWS already has it, plus the enterprise relationships and compliance certifications (like FedRAMP) that make selling to governments and large companies much faster.

What are Trainium chips?

Trainium are custom AI accelerator chips designed by Amazon specifically for training large AI models; Anthropic's deal with AWS involves using Trainium2 through Trainium4 chips for training Claude at gigawatt-scale compute.

Can the U.S. government use AI models through AWS?

Yes — Claude models are approved for FedRAMP High and DoD Impact Level 4/5 workloads via AWS GovCloud, and OpenAI's models are also available on AWS, giving federal agencies access to frontier AI through familiar, compliant procurement channels.

Was AWS ever physically attacked?

Yes — in early 2026, Iranian drone strikes damaged at least three AWS data centers in Bahrain and the UAE, disrupting cloud services in the region in what was the first known targeting of commercial cloud infrastructure during active conflict.

Stay current

Call Me Almanac pairs the week's AI news with guides like this one — Midweek & Sunday.

Versions

  • v1live6d ago

Related guides (4)

More on Amazon Web Services (6)

9Anthropic News·1mo ago·source ↗

Anthropic and Amazon Expand Collaboration for Up to 5 Gigawatts of New Compute

Anthropic has signed a major expanded agreement with Amazon committing over $100 billion to AWS technologies over ten years, securing up to 5GW of compute capacity for training and deploying Claude across Trainium2 through Trainium4 chips. Amazon is investing an additional $5 billion in Anthropic today, with up to $20 billion more possible in the future, building on $8 billion previously invested. The deal includes nearly 1GW of Trainium2 and Trainium3 capacity coming online by end of 2026, expanded inference in Asia and Europe, and the full Claude Platform becoming available directly within AWS. Anthropic disclosed its run-rate revenue has surpassed $30 billion, up from approximately $9 billion at end of 2025.

8Openai Blog·1mo ago·source ↗

OpenAI and Amazon Announce Strategic Partnership on AWS

OpenAI and Amazon have announced a strategic partnership that will bring OpenAI's Frontier platform to Amazon Web Services. The deal expands AI infrastructure capabilities, enables custom model development, and supports enterprise AI agent deployments. This represents a significant cloud distribution and infrastructure alignment between two major players in the AI ecosystem.

8Openai Blog·1mo ago·source ↗

AWS and OpenAI Announce $38B Multi-Year Strategic Partnership

OpenAI and Amazon Web Services have announced a multi-year strategic partnership valued at $38 billion. AWS will supply infrastructure and compute capacity to support OpenAI's next-generation model training and deployment workloads. The deal represents a major cloud infrastructure commitment for OpenAI alongside its existing Microsoft Azure relationship.

8The Batch·18d ago·source ↗

OpenAI and Amazon Partner to Build Stateful Runtime Environment for AI Agents on AWS

OpenAI and Amazon Web Services announced a partnership to build a stateful runtime environment for AI agents, designed to manage agent working states including memories, tool connections, and user permissions, running on Amazon Bedrock. The deal includes a $15 billion Amazon investment in OpenAI (with up to $35 billion more contingent on conditions), a $100 billion expansion of compute commitments using Amazon Trainium chips over 8 years, and makes AWS the exclusive third-party cloud provider for OpenAI Frontier. The arrangement exploits a legal distinction between stateful runtime environments and stateless APIs, allowing OpenAI to work with AWS while Microsoft retains exclusive rights to host OpenAI's stateless API calls. This marks a significant loosening of OpenAI's exclusive cloud relationship with Microsoft, mirroring a parallel diversification trend with Anthropic across cloud providers.

8Anthropic News·16d ago·source ↗

Anthropic and AWS expand partnership with $4B investment and Trainium hardware collaboration

Anthropic announced an expanded partnership with Amazon Web Services, including a new $4 billion investment that brings Amazon's total stake to $8 billion, while establishing AWS as Anthropic's primary cloud and training partner. The collaboration includes deep hardware-software co-development on AWS Trainium accelerators, with Anthropic engineers writing low-level kernels and contributing to the AWS Neuron software stack to optimize model training from the silicon up. Claude on Amazon Bedrock is described as core infrastructure for tens of thousands of enterprises, with named deployments at Pfizer, Intuit, Perplexity, and the European Parliament. The deal also extends Claude's availability to AWS GovCloud and classified cloud regions for government customers.

4Hugging Face Blog·1mo ago·source ↗

Building Blocks for Foundation Model Training and Inference on AWS

This Hugging Face blog post, published in partnership with Amazon, outlines the infrastructure components available on AWS for training and serving foundation models. It covers the key building blocks including compute, storage, networking, and managed services relevant to large-scale AI workloads. The post serves as a technical overview of AWS's positioning in the foundation model infrastructure space.