Custom AI Inference Chips: Why OpenAI, Google, and Meta Are Building Beyond NVIDIA

Custom AI Inference Chips: Why OpenAI, Google, and Meta Are Building Beyond NVIDIA

Introduction On June 24, 2026, OpenAI announced Jalapeño — its first custom LLM inference accelerator, built with Broadcom. OpenAI is not the first to do this. Google has run TPUs since 2016. Meta ships MTIA chips. Amazon built Trainium and Inferentia. Every major AI lab is now building hardware it owns rather than buying it … Read more