OpenAI has officially entered the custom silicon race with Jalapeno, its first proprietary inference chip built in collaboration with Broadcom. The announcement marks a significant strategic shift for the company, which has historically relied almost exclusively on Nvidia GPUs to power its models.
What Is Jalapeno?
Jalapeno is an inference-optimized ASIC (Application-Specific Integrated Circuit), meaning it is designed specifically to run trained AI models — not to train them from scratch. This distinction matters: inference chips can be architected for maximum throughput and energy efficiency on a known workload, rather than the flexibility required during training.
Key details about the chip:
- Designed for large-scale inference workloads, targeting OpenAI's production API infrastructure
- Co-developed with Broadcom, one of the leading custom ASIC partners in the industry (also working with Google on its TPU line)
- Intended to be deployed inside OpenAI's own data centers, rather than sold commercially
- Complements, rather than replaces, existing Nvidia hardware in the near term
Why This Matters
Building custom silicon is expensive and technically demanding, but the economics become compelling at OpenAI's scale. The company reportedly serves hundreds of millions of users across ChatGPT and its API — meaning even marginal improvements in inference cost-per-token translate into enormous savings.
"This is about building the infrastructure for the next decade of AI," an OpenAI spokesperson noted in the announcement.
The move also reduces supply chain vulnerability. OpenAI's dependence on Nvidia has been a well-documented bottleneck — GPU shortages have constrained growth for AI companies across the board. A proprietary chip gives OpenAI more leverage and predictability.
The Broadcom Partnership
Broadcom has established itself as the go-to ASIC partner for hyperscalers looking to build custom AI accelerators. Its existing relationships with Google (TPUs) and Meta (MTIA) give it deep expertise in co-designing chips for ML inference at scale.
The partnership structure likely involves:
- OpenAI defining the architecture requirements and workload targets
- Broadcom handling the physical chip design and manufacturing coordination
- Production through a leading-edge fab — likely TSMC
What Comes Next
Jalapeno appears to be a first-generation effort — a foundation that OpenAI can iterate on as it accumulates real-world deployment data. Training chips, which are far more complex to design, are not part of this announcement.
With Microsoft, Google, Amazon, and Meta all running custom silicon programs, OpenAI's entry into the space was widely anticipated. The question now is how quickly Jalapeno can reach meaningful deployment scale — and whether future generations will challenge Nvidia's dominance more directly.



