Google says its new AI Supercomputer with TPU v4 chips, outperforms Nvidia’s A100 in pace

Tech titan Google claims that the supercomputers it makes use of to coach its AI fashions are quicker and “greener” (aka extra power-efficient) than the supercomputers utilized by multinational tech firm Nvidia. These claims come after the Alphabet unit launched new details about its supercomputers within the type of a scientific paper.

Named “TPU v4: An Optically Reconfigurable Supercomputer for Machine Studying with {Hardware} Assist for Embeddings,” the paper speaks concerning the fourth era of Google’s Tensor Processing Unit (TPU), which is the corporate’s customized chip. These chips, in accordance with the corporate, are used for over 90% of its work on coaching AI fashions – it feeds them knowledge to make them efficient at duties akin to producing photos or responding to queries with human-like textual content. The fourth era of the TPU can also be the corporate’s fifth Google domain-specific structure (DSA) and its third supercomputer for such ML fashions.

Google launched the scientific paper on Tuesday, highlighting the way it related over 4,000 TPUs to create a supercomputer. The corporate additionally developed customized optical switches to assist join particular person machines. Then, the AI fashions (akin to the corporate’s PaLM mannequin) are cut up throughout hundreds of chips, which should then work collectively for weeks or extra to coach the fashions in query.

A spokesperson for Nvidia declined to touch upon the matter.

“Circuit switching makes it straightforward to route round failed parts,” Google Fellow Norm Jouppi and Google Distinguished Engineer David Patterson wrote concerning the system. “This flexibility even permits us to alter the topology of the supercomputer interconnect to speed up the efficiency of an ML (machine studying) mannequin.”

For comparably sized methods, Google’s supercomputer is as much as 1.7 occasions quicker and 1.9 occasions extra power-efficient than a system that’s primarily based on Nvidia’s A100 chip in the marketplace similtaneously the fourth-generation TPU. The corporate refrains from making a direct comparability between the TPU v4 and Nvidia’s present flagship H100 chip because the H100 arrived in the marketplace after Google’s chip and is made utilizing newer expertise.

Google added that the supercomputer is 4 occasions bigger at 4096 chips, whereas it additionally makes use of 1.3-1.9 occasions much less energy than the Nvidia A100. TPU v4s contained in the energy-optimized warehouse-scale computer systems of Google Cloud additionally use practically thrice much less power, in addition to produce practically 20 occasions much less CO2e than modern DSAs in a typical on-premise knowledge middle.