8Hacker News (AI-filtered, score >= 200)·5d ago

OpenAI unveils first custom AI chip, manufactured by Broadcom

OpenAI has announced its first custom silicon chip, built in partnership with Broadcom. This marks a significant strategic move for OpenAI to reduce dependence on Nvidia and control its own inference and training infrastructure. Custom chip development is a major capital and engineering commitment that signals OpenAI's long-term infrastructure ambitions.

Training Infrastructure Inference Economics Broadcom OpenAI

Related guides (3)

Training InfrastructureTopic guide

Training Infrastructure: The Compute Arms Race Powering Modern AI

Read asBeginner In-depth

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

8Openai Blog·1mo ago·source ↗

OpenAI and Broadcom Announce Strategic Collaboration to Deploy 10 GW of OpenAI-Designed AI Accelerators

OpenAI and Broadcom have announced a multi-year strategic partnership targeting deployment of 10 gigawatts of OpenAI-designed AI accelerators by 2029. The collaboration involves co-developing next-generation AI accelerator systems and Ethernet networking solutions aimed at scalable, energy-efficient AI infrastructure. This represents OpenAI's continued push into custom silicon, reducing dependence on third-party chip suppliers like NVIDIA.

Training Infrastructure Inference Economics Broadcom NVIDIA OpenAI +2 more

8Openai Blog·5d ago·source ↗

OpenAI and Broadcom unveil Jalapeño, a custom LLM inference chip

OpenAI and Broadcom have jointly announced Jalapeño, a custom silicon chip designed specifically for LLM inference workloads. The chip targets improvements in performance, efficiency, and scale across AI systems. This marks OpenAI's entry into custom inference silicon, reducing dependence on third-party GPU suppliers for inference at scale.

Training Infrastructure Inference Economics Broadcom OpenAI Jalapeño

7Openai Blog·1mo ago·source ↗

OpenAI partners with Cerebras for 750MW of high-speed AI compute

OpenAI has announced a partnership with Cerebras Systems to add 750MW of AI compute capacity. The collaboration is aimed at reducing inference latency and improving response speeds for ChatGPT and other real-time AI workloads. Cerebras is known for its wafer-scale chip architecture optimized for fast inference.

Training Infrastructure Frontier Model Releases ChatGPT Cerebras Systems OpenAI +2 more

6Openai Blog·1mo ago·source ↗

OpenAI and Foxconn Collaborate on U.S. AI Infrastructure Hardware Manufacturing

OpenAI and Foxconn have announced a partnership to design and manufacture next-generation AI infrastructure hardware in the United States. The collaboration targets multiple generations of data-center systems and aims to build key components domestically to strengthen U.S. AI supply chains. The announcement aligns with broader efforts to onshore AI hardware production amid geopolitical and supply chain pressures.

Training Infrastructure Inference Economics Foxconn OpenAI +1 more

7The Batch·4d ago·source ↗

The Batch: Jalapeño inference chip, Fugu multi-agent system, Claude Tag, Robin bio-agent, and Getty-OpenAI deal

OpenAI and Broadcom announced Jalapeño, OpenAI's first custom inference chip, designed in nine months with AI-assisted design and showing better performance-per-watt than current accelerators; engineering samples are already running GPT-5.3-Codex-Spark with datacenter deployment planned by end of 2026. Sakana AI released Fugu, a multi-agent routing system that scored 73.7% on SWE-Bench Pro, outperforming Claude Opus 4.8 and GPT-5.5 while remaining below the inaccessible Fable 5. Additional items cover Anthropic's Claude Tag Slack integration for async team collaboration, Seedance 2.5 video model improvements, the Robin autonomous biology research agent that identified a novel drug candidate, and a Getty Images licensing partnership with OpenAI.

Training Infrastructure Frontier Model Releases Seedance 2.0 Fugu Microsoft +17 more

8Openai Blog·1mo ago·source ↗

OpenAI and NVIDIA Announce Strategic Partnership to Deploy 10 Gigawatts of AI Datacenters

OpenAI and NVIDIA have announced a strategic partnership targeting deployment of 10 gigawatts of AI datacenter capacity powered by NVIDIA systems. The first phase of the buildout is scheduled to launch in 2026. This represents a major infrastructure commitment between two of the most prominent organizations in AI compute and model development.

Training Infrastructure Frontier Model Releases NVIDIA OpenAI +1 more

8Openai Blog·1mo ago·source ↗

AMD and OpenAI Announce Strategic Partnership to Deploy 6 Gigawatts of AMD GPUs

AMD and OpenAI have entered a multi-year strategic partnership to deploy 6 gigawatts of AMD Instinct GPUs for OpenAI's AI infrastructure, with 1 gigawatt planned for 2026. The deal represents a significant diversification of OpenAI's compute supply beyond its existing NVIDIA dependency. This is one of the largest publicly announced GPU deployment commitments in the industry.

Training Infrastructure Frontier Model Releases AMD Instinct NVIDIA OpenAI +2 more

7The Batch·28d ago·source ↗

Data Points: OpenAI and Microsoft sever their exclusive relationship

This edition of The Batch covers several major AI industry developments: OpenAI has revised its partnership with Microsoft, ending exclusivity while retaining Microsoft as primary cloud partner through 2032 and gaining freedom to deploy on AWS and Google Cloud. DeepSeek released V4 model weights featuring 1M-token context and Huawei Ascend chip optimization, though it trails leading open and closed models on aggregate benchmarks. Google and Amazon are deepening investments in Anthropic with up to $40B and $25B respectively in funding-for-compute deals, and an agentic AI system autonomously designed a functional RISC-V CPU from a 219-word spec in 12 hours.

Training Infrastructure Frontier Model Releases Google Cloud Google TPU knowledge distillation +25 more