OpenAI Jalapeño: The First Custom AI Chip Built in Nine Months Using Its Own Models to Design Itself

Started by HardyBoy13, Yesterday at 05:59 PM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.

Topic: OpenAI Jalapeño: The First Custom AI Chip Built in Nine Months Using Its Own Models to Design Itself   Views(Read 86 times)

HardyBoy13

OpenAI and Broadcom unveiled Jalapeño on June 24, OpenAI's first custom Application-Specific Integrated Circuit designed exclusively for large language model inference. The chip was developed in nine months, a timeline OpenAI describes as the fastest ASIC development cycle in advanced semiconductor history for a chip at this performance level. OpenAI's own AI models assisted in the design process, creating a recursive self-improvement loop: the same systems that people query through ChatGPT helped design the hardware that will soon run them. The chip is built on TSMC's 3-nanometre process node and targets approximately 50 percent lower inference cost per token compared to Nvidia GPU alternatives, though these figures are self-reported and pre-production.

Jalapeño targets inference specifically rather than training. Training is the one-time, weeks-long process of teaching a model. Inference is what happens billions of times every day as ChatGPT, Codex and the API respond to user queries. OpenAI's highest ongoing costs live in inference and every percentage improvement in inference efficiency compounds across the enormous query volume. Greg Brockman described the strategic logic directly: by designing more of the stack themselves, OpenAI can serve more intelligence with greater efficiency. The deployment plan starts small, with engineering samples confirmed running workloads at production target frequency and power, and scales toward a 10-gigawatt infrastructure commitment through 2029 with Microsoft guaranteeing 40 percent of initial chip output.

Broadcom CEO Hock Tan described compute demand from his company's six hyperscale customers as simply insatiable, saying it extends not just through 2026 and 2027 but with even elevated demand visible in 2028. For OpenAI specifically, Jalapeño arrives ahead of an anticipated IPO and offers investors the first credible signal that the company has a path toward profitability: if inference costs fall meaningfully at scale, the economics of the entire operation shift.