Google in Talks With Marvell to Build New AI Chips for Inference

The Information
Generative AI AI Hardware

Google is in talks with Marvell Technology to develop two new chips aimed at running AI models efficiently, according to two people with direct knowledge of the discussions. One is a memory processing unit designed to work alongside Google’s tensor processing unit. The other is a new TPU built specifically for running AI models. The moves underscore surging demand for inference chips that run AI powering commercial products such as autonomous agents. At its GTC conference in March, Nvidia released a chip designed to improve the efficiency of inference workloads.