pwshub.com

Bugs, performance issues hinder Huawei’s AI chips

Ascend's difficult ascent —

Export controls bar import of Nvidia chips, but homegrown alternative is struggling.

Bugs, performance issues hinder Huawei’s AI chips

Barcroft Media | Getty Images

China’s efforts to match US computing power in artificial intelligence are being hampered by bug-ridden software, with customers of leading AI chipmaker Huawei complaining about performance issues and the difficulty of switching from Nvidia products.

The Chinese technology giant has emerged as the frontrunner in the race to develop a domestic alternative to industry leader Nvidia, after Washington further tightened export controls on high-performance silicon last October.

Its Ascend series has become an increasingly popular option for Chinese AI groups to run inference, a process that applications such as OpenAI’s ChatGPT use to generate responses to queries.

But multiple industry insiders, including an AI engineer at a partner company, said the chips still lagged far behind Nvidia’s for the initial training of models. They blamed stability issues, slower inter-chip connectivity, and inferior software developed by Huawei called Cann.

Nvidia’s software platform, Cuda, is renowned as the company’s “secret sauce” for being easy for developers to use and capable of vastly accelerating data processing. Huawei is one of many companies trying to break Nvidia’s stranglehold on AI chips by creating alternative software.

Huawei’s own employees are among those complaining about Cann. One researcher, who declined to be named, said it made the Ascend product “difficult and unstable to use” and work on testing it was being hampered.

“When random errors occur, it is very difficult to find out where it comes from due to poor documentation. You need talented developers to read the source code to see what the issue is, which slows everything down. The coding is imperfect,” they said.

Another Chinese engineer briefed on Baidu’s use of the Huawei processors said the chips crashed frequently, complicating AI development work.

The Huawei researcher said crashes happened because it was difficult to use the hardware. “It is easy to get bad results because people don’t know much about the hardware itself,” they said.

To tackle the problem, Huawei has been sending engineers to help customers on site with transferring training code previously written on Cuda into Cann, according to multiple people familiar with the matter. Baidu, iFlytek, and Tencent are among the tech companies that have received teams of engineers, these people said.

Huawei declined to comment. Baidu, iFlytek, and Tencent did not respond to requests for comment.

A former Baidu employee said: “Huawei excels at customer service, so of course they have engineers on site at their big customers, helping them to use their chips.”

Huawei can leverage a huge workforce to accelerate the shift. According to the company, more than 50 percent of its 207,000 employees work in research and development, including the engineers dispatched to install technology for customers.

“Huawei’s advantage over Nvidia is it can work closely with its customers,” said technology analyst Tilly Zhang at consultancy Gavekal. “Unlike Nvidia, it has a large team of engineers to help solve clients’ problems and get them to transition to their hardware.”

Huawei has also set up an online portal for developers to give feedback on how its software can be improved.

After the US tightened export controls in October, Huawei raised the price of the Ascend 910B, its chip used for training, by 20 to 30 percent, according to people familiar with the matter.

Huawei’s customers have also expressed concern about supply constraints for the Ascend chip, likely due to manufacturing difficulties, with Chinese companies prevented from buying state-of-the-art chipmaking machinery from the Dutch company ASML.

Huawei has seen strong demand for its AI chips. It reported a 34 percent increase in first-half revenues on Thursday, without providing a breakdown of sales for its different businesses.

More than 50 foundational models have “been trained and iterated” on the Ascend chip, Huawei executive director Zhang Ping’an said at the World Artificial Intelligence Conference in Shanghai in July.

iFlytek has said its large language model has been trained exclusively on Huawei chips after Huawei sent a group of engineers to its headquarters in Hefei, eastern China, last year to integrate the technology.

© 2024 The Financial Times Ltd. All rights reserved. Not to be redistributed, copied, or modified in any way.

Source: arstechnica.com

Related stories
5 days ago - No love for months-long wait to fix this, either Security researchers have revealed a litany of failures in the Feeld dating app that could be abused to access all manner of private user data, including the most sensitive images not...
1 month ago - Now that's a TRACTOR pull request To accelerate the transition to memory safe programming languages, the US Defense Advanced Research Projects Agency (DARPA) is driving the development of TRACTOR, a programmatic code conversion vehicle.…
2 weeks ago - iOS 18 is nearing release along with new iPhone 16 models this month, so is it safe to install the latest public beta on your current iPhone?
1 week ago - The third iOS 18.1 developer beta brings Apple Intelligence, as well as several bug fixes.
1 month ago - Apple, in a surprise move, has released the first iOS 18.1 developer beta, which contains some of the new Apple Intelligence features for the iPhone.
Other stories
32 minutes ago - Experts at the Netherlands Institute for Radio Astronomy (ASTRON) claim that second-generation, or "V2," Mini Starlink satellites emit interference that is a staggering 32 times stronger than that from previous models. Director Jessica...
32 minutes ago - The PKfail incident shocked the computer industry, exposing a deeply hidden flaw within the core of modern firmware infrastructure. The researchers who uncovered the issue have returned with new data, offering a more realistic assessment...
32 minutes ago - Nighttime anxiety can really mess up your ability to sleep at night. Here's what you can do about it right now.
32 minutes ago - With spectacular visuals and incredible combat, I cannot wait for Veilguard to launch on Oct. 31.
32 minutes ago - Finding the perfect pair of glasses is difficult, but here's how to do so while considering your face shape, skin tone, lifestyle and personality.