OpenAI's First Custom Silicon: Meet Jalapeno, Built With Broadcom

OpenAI has unveiled its first custom inference chip, codenamed Jalapeno, developed in partnership with Broadcom. The move signals OpenAI's push to reduce dependence on Nvidia and take greater control of its AI infrastructure stack.

OpenAI has officially entered the custom silicon race with Jalapeno, its first proprietary inference chip built in collaboration with Broadcom. The announcement marks a significant strategic shift for the company, which has historically relied almost exclusively on Nvidia GPUs to power its models.

What Is Jalapeno?

Jalapeno is an inference-optimized ASIC (Application-Specific Integrated Circuit), meaning it is designed specifically to run trained AI models — not to train them from scratch. This distinction matters: inference chips can be architected for maximum throughput and energy efficiency on a known workload, rather than the flexibility required during training.

Key details about the chip:

Designed for large-scale inference workloads, targeting OpenAI's production API infrastructure
Co-developed with Broadcom, one of the leading custom ASIC partners in the industry (also working with Google on its TPU line)
Intended to be deployed inside OpenAI's own data centers, rather than sold commercially
Complements, rather than replaces, existing Nvidia hardware in the near term

Why This Matters

Building custom silicon is expensive and technically demanding, but the economics become compelling at OpenAI's scale. The company reportedly serves hundreds of millions of users across ChatGPT and its API — meaning even marginal improvements in inference cost-per-token translate into enormous savings.

"This is about building the infrastructure for the next decade of AI," an OpenAI spokesperson noted in the announcement.

The move also reduces supply chain vulnerability. OpenAI's dependence on Nvidia has been a well-documented bottleneck — GPU shortages have constrained growth for AI companies across the board. A proprietary chip gives OpenAI more leverage and predictability.

The Broadcom Partnership

Broadcom has established itself as the go-to ASIC partner for hyperscalers looking to build custom AI accelerators. Its existing relationships with Google (TPUs) and Meta (MTIA) give it deep expertise in co-designing chips for ML inference at scale.

The partnership structure likely involves:

OpenAI defining the architecture requirements and workload targets
Broadcom handling the physical chip design and manufacturing coordination
Production through a leading-edge fab — likely TSMC

What Comes Next

Jalapeno appears to be a first-generation effort — a foundation that OpenAI can iterate on as it accumulates real-world deployment data. Training chips, which are far more complex to design, are not part of this announcement.

With Microsoft, Google, Amazon, and Meta all running custom silicon programs, OpenAI's entry into the space was widely anticipated. The question now is how quickly Jalapeno can reach meaningful deployment scale — and whether future generations will challenge Nvidia's dominance more directly.