OpenAI and Broadcom Unveil Jalapeño, a Custom AI Inference Chip
OpenAI and Broadcom jointly announced Jalapeño on June 24, 2026, marking OpenAI's first custom Application-Specific Integrated Circuit (ASIC) designed exclusively for large language model inference. The chip pairs a reticle-sized compute chiplet with High-Bandwidth Memory, placing storage physically close to processing units to reduce the data-movement delays that typically bottleneck token generation. Unlike general-purpose GPUs, which handle training, graphics, and diverse workloads, Jalapeño is hardwired solely for inference, trading versatility for greater energy efficiency. Early testing indicates substantially improved performance-per-watt compared to GPUs, though final figures are still being measured. The design went from conception to tape-out in approximately nine months, which OpenAI describes as one of the fastest chip development cycles it has undertaken.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.
Discussion (0)
Log in to join the discussion and vote.
Log in