LLM Infrastructure

Custom AI Inference Chips: Why OpenAI, Google, and Meta Are Building Beyond NVIDIA

June 26, 2026 by Sohel Patel

Introduction On June 24, 2026, OpenAI announced Jalapeño — its first custom LLM inference accelerator, built with Broadcom. OpenAI is not the first to do this. Google has run TPUs since 2016. Meta ships MTIA chips. Amazon built Trainium and Inferentia. Every major AI lab is now building hardware it owns rather than buying it … Read more