OpenAI unveils first custom AI chip, manufactured by Broadcom
OpenAI has announced its first custom silicon chip, built in partnership with Broadcom. This marks a significant strategic move for OpenAI to reduce dependence on Nvidia and control its own inference and training infrastructure. Custom chip development is a major capital and engineering commitment that signals OpenAI's long-term infrastructure ambitions.
Related guides (3)
Related events (8)
OpenAI and Broadcom Announce Strategic Collaboration to Deploy 10 GW of OpenAI-Designed AI Accelerators
OpenAI and Broadcom have announced a multi-year strategic partnership targeting deployment of 10 gigawatts of OpenAI-designed AI accelerators by 2029. The collaboration involves co-developing next-generation AI accelerator systems and Ethernet networking solutions aimed at scalable, energy-efficient AI infrastructure. This represents OpenAI's continued push into custom silicon, reducing dependence on third-party chip suppliers like NVIDIA.
OpenAI and Broadcom unveil Jalapeño, a custom LLM inference chip
OpenAI and Broadcom have jointly announced Jalapeño, a custom silicon chip designed specifically for LLM inference workloads. The chip targets improvements in performance, efficiency, and scale across AI systems. This marks OpenAI's entry into custom inference silicon, reducing dependence on third-party GPU suppliers for inference at scale.
OpenAI partners with Cerebras for 750MW of high-speed AI compute
OpenAI has announced a partnership with Cerebras Systems to add 750MW of AI compute capacity. The collaboration is aimed at reducing inference latency and improving response speeds for ChatGPT and other real-time AI workloads. Cerebras is known for its wafer-scale chip architecture optimized for fast inference.
OpenAI and Foxconn Collaborate on U.S. AI Infrastructure Hardware Manufacturing
OpenAI and Foxconn have announced a partnership to design and manufacture next-generation AI infrastructure hardware in the United States. The collaboration targets multiple generations of data-center systems and aims to build key components domestically to strengthen U.S. AI supply chains. The announcement aligns with broader efforts to onshore AI hardware production amid geopolitical and supply chain pressures.
The Batch: Jalapeño inference chip, Fugu multi-agent system, Claude Tag, Robin bio-agent, and Getty-OpenAI deal
OpenAI and Broadcom announced Jalapeño, OpenAI's first custom inference chip, designed in nine months with AI-assisted design and showing better performance-per-watt than current accelerators; engineering samples are already running GPT-5.3-Codex-Spark with datacenter deployment planned by end of 2026. Sakana AI released Fugu, a multi-agent routing system that scored 73.7% on SWE-Bench Pro, outperforming Claude Opus 4.8 and GPT-5.5 while remaining below the inaccessible Fable 5. Additional items cover Anthropic's Claude Tag Slack integration for async team collaboration, Seedance 2.5 video model improvements, the Robin autonomous biology research agent that identified a novel drug candidate, and a Getty Images licensing partnership with OpenAI.
OpenAI and NVIDIA Announce Strategic Partnership to Deploy 10 Gigawatts of AI Datacenters
OpenAI and NVIDIA have announced a strategic partnership targeting deployment of 10 gigawatts of AI datacenter capacity powered by NVIDIA systems. The first phase of the buildout is scheduled to launch in 2026. This represents a major infrastructure commitment between two of the most prominent organizations in AI compute and model development.
AMD and OpenAI Announce Strategic Partnership to Deploy 6 Gigawatts of AMD GPUs
AMD and OpenAI have entered a multi-year strategic partnership to deploy 6 gigawatts of AMD Instinct GPUs for OpenAI's AI infrastructure, with 1 gigawatt planned for 2026. The deal represents a significant diversification of OpenAI's compute supply beyond its existing NVIDIA dependency. This is one of the largest publicly announced GPU deployment commitments in the industry.
Data Points: OpenAI and Microsoft sever their exclusive relationship
This edition of The Batch covers several major AI industry developments: OpenAI has revised its partnership with Microsoft, ending exclusivity while retaining Microsoft as primary cloud partner through 2032 and gaining freedom to deploy on AWS and Google Cloud. DeepSeek released V4 model weights featuring 1M-token context and Huawei Ascend chip optimization, though it trails leading open and closed models on aggregate benchmarks. Google and Amazon are deepening investments in Anthropic with up to $40B and $25B respectively in funding-for-compute deals, and an agentic AI system autonomously designed a functional RISC-V CPU from a 219-word spec in 12 hours.


