pwshub.com

IBM to make Intel’s Gaudi 3 AI chip available in its public cloud

IBM Corp. today announced plans to make Intel Corp.’s Gaudi 3 artificial intelligence processor available in its public cloud platform.

Gaudi 3 is positioned as an alternative to Nvidia Corp.’s bestselling H100 graphics processing unit. The H100 was the chipmaker’s flagship AI accelerator before the March debut of its newest GPU, the Blackwell B200, which is significantly faster. Intel says that Gaudi 3 can perform inference with up to 2.3 times the power-efficiency of the H100, while some large language models take less time to train.

Gaudi 3 is the third iteration of an AI processor series that Intel obtained through a 2019 startup acquisition. The chip is based on Taiwan Semiconductor Manufacturing Co. Ltd.’s five-nanometer node. Gaudi 3’s processing power is provided by two sets of onboard computing modules, dubbed MMEs and TPCs, that are each optimized for a different set of tasks.

The chip’s MME modules are designed to perform matrix multiplications. Those are mathematical calculations that are carried out on collections of numbers organized in rows and columns, similarly to the fields in a spreadsheet. Artificial intelligence models use such calculations to turn input data into decisions. 

Certain AI models, such as those used for object recognition tasks, carry out most of their processing with matrix multiplications. More advanced forms of AI such as large language models also use other types of calculations. Gaudi 3’s TPC modules, the second type of computing circuit the chip includes, are optimized for those other calculations. 

TPCs are based on a so-called very long instruction word architecture. That’s a type of chip design optimized to carry out multiple computations in parallel. Because performing calculations side-by-side is faster than completing them one after another, Gaudi 3’s TPCs help speed up AI models’ performance. 

Gaudi 3 includes 64 TPCs, nearly three times as many as its predecessor. There are also four times as many MMEs, the compute modules optimized for matrix multiplications. The chip’s logic circuits are supported by a 120-gigabyte memory pool that offers higher clock speeds than the RAM in Intel’s previous-generation AI processor.

The company says that the upgrades introduced in Gaudi 3 boost its top speed to 1,835 TFLOPS, or trillion computations per second, when it processes BF16 data. That’s a data format commonly used by AI models to store information. 

The Gaudi 3’s increased performance is not its only selling point. There’s also an onboard Ethernet module that can be used to link together the Gaudi 3 processors in an AI server, as well as connect several such servers to one another. Intel has doubled the bandwidth of the individual Ethernet networking links in the chip to 200 gigabits per second.

IBM plans to make the Gaudi 3 available early next year in IBM Cloud Virtual Servers for VPC. Those are the compute instances the company provides in its public cloud platform. IBM will also add support for Gaudi 3 to its watsonx product suite, which includes software tools that companies can use to build AI models, deploy them in production and perform related tasks.

Source: siliconangle.com

Related stories
2 weeks ago - All eyes were on Nvidia’s earnings report this week as a proxy for the artificial intelligence economy, and even for the graphics chip giant, it was too much to live up to. Nvidia earnings disappointed, but really, how could they not?...
2 weeks ago - The rapid advancement in generative AI and graphics processing unit computing technologies has ushered in a new era of innovation and competition, but it’s also brought regulatory scrutiny, most recently for Nvidia AI chips. Nvidia Corp.,...
1 month ago - Amid a glut of funding for artificial intelligence companies, there’s understandably increasing concern among investors this past week, apparent in disappointment in the earnings results of a number of technology companies, whether all...
1 day ago - What does the a new gaming console contract mean for AMD investors? Uncover the potential financial impact and future benefits -- and what it all means for arch-rival Intel.
1 week ago - It’s no surprise that entrepreneurs with a pedigree like Ilya Sutskever’s can raise a billion dollars, as the OpenAI co-founder did this week for his startup, SSI. And he wasn’t alone, as Nvidia and others also invested in two other...
Other stories
56 minutes ago - (Bloomberg) -- Skechers U.S.A. Inc. shares delivered their worst daily performance since February after the footwear company’s chief financial officer told an industry conference that China sales will be under pressure the rest of the...
1 hour ago - The Fed's cutting cycle in 1995 sparked an economic boom, with the stock market more than doubling in value by the end of the decade.
1 hour ago - There's nothing like a potentially massive government contract to win the hearts of both investors and analysts.
3 hours ago - Shares of Truth Social’s parent company fell Thursday, extending the latest round of declines for Trump Media & Technology Group.
3 hours ago - European Union officials are taking new steps to ensure that Apple Inc. complies with the bloc’s DMA tech industry regulation. The European Commission, the EU’s executive arm, announced the initiative today. The DMA is a piece of...