pwshub.com

Microsoft's Maia 100 looks to bring customers a cost effective AI acceleration solution

Serving tech enthusiasts for over 25 years.
TechSpot means tech analysis and advice you can trust.

Why it matters: Nvidia holds an estimated 75 to 90 percent of the AI chip market. Despite its market dominance, rivals have continued developing hardware and accelerators to chip away at the company's AI empire. Microsoft drew the interest of AI professionals and enthusiasts after outlining its design for a custom accelerator.

Microsoft introduced its first AI accelerator, Maia 100, at this year's Hot Chips conference. It features an architecture that uses custom server boards, racks, and software to provide cost-effective, enhanced solutions and performance for AI-based workloads. Remond designed the custom accelerator to run OpenAI models in Azure data center environments.

The chips are built on TSMC's 5nm process node and are provisioned as 500w parts but can support up to a 700w TDP. Maia's design can deliver high levels of overall performance while efficiently managing its targeted workload's overall power draw. The accelerator also features 64GB of HBM2E, a step down from the Nvidia H100's 80GB and the B200's 192GB of HBM3E.

According to Microsoft's Hot Chips presentation and a recent blog post, the Maia 100 SoC architecture features a high-speed tensor unit (16xRx16) offering rapid processing for training and inferencing while supporting a wide range of data types, including low precision types such as Microsoft's MX format. It has a loosely coupled superscalar engine (vector processor) built with custom ISA to support data types, including FP32 and BF16, a Direct Memory Access engine supporting different tensor sharding schemes, and hardware semaphores that enable asynchronous programming.

The Maia 100 AI accelerator also provides developers with the Maia SDK. The kit includes tools enabling AI developers to quickly port models previously written in Pytorch and Triton. The SDK includes framework integration, developer tools, two programming models, and compilers. It also has optimized compute and communication kernels, the Maia Host/Device Runtime, a hardware abstraction layer supporting memory allocation kernel launches, scheduling, and device management.

Microsoft posted additional information on the SDK, Maia's backend network protocol, and optimization in its Inside Maia 100 blog post. It makes a good read for developers and AI enthusiasts.

Source: techspot.com

Related stories
2 weeks ago - Microsoft is introducing a new "Windows App" for both Windows and Apple platforms, providing users with a secure remote tool to access a Windows 11 or Windows 10 PC from "any device." Currently in preview, the app is available for...
2 weeks ago - Deal can't lessen competition if AI minnow wasn't much of a competitor Microsoft's "acquihire" of Inflection AI was today cleared by UK authorities on the grounds that the startup isn't big enough for its absorption by Microsoft to...
1 week ago - Why You Can Trust CNET Our expert, award-winning staff selects the products we cover and rigorously researches and tests our top picks. If you buy...
1 month ago - According to the latest data from web traffic analysts StatCounter, Chrome still sits unchallenged at the front of the class with a market share of 64.73 percent. Microsoft's Edge – not Firefox or Safari as you might have guessed – is the...
1 month ago - Clipchamp's noise suppression lets users automatically filter out unwanted background audio like wind, crowd noise, and other ambient sounds. It turns a complicated editing process into something users can fix with a toggle.Read Entire...
Other stories
33 minutes ago - As an Amazon Prime member, not only do you get a free Grubhub+ membership, you can also score $10 off your first $15 order.
33 minutes ago - Amazon's second Prime Day event of 2024 is still a few weeks away, but there are some bargains you can score now.
34 minutes ago - YouTube will roll out a new generative AI video tool named Veo later this year that'll allow creators to create 6-second clips with nothing more...
1 hour ago - FBI Director hails successful action but calls it “just one round in a much longer fight.”
2 hours ago - SocialAI takes the social media "filter bubble" to an extreme with 100% fake interactions.