OpenAI and Broadcom announce a chip designed for LLM inference at scale
OpenAI unveiled its first custom silicon, co-designed with Broadcom and aimed squarely at running inference for its own models rather than training. It's an attempt to wean off Nvidia and own more of the cost curve as serving demand explodes.