pwshub.com

Startup FuriosaAI debuts RNGD chip for LLM and multimodal AI inference

FuriosaAI Inc., a semiconductor startup that’s laser-focused on artificial intelligence, has unveiled a new accelerator chip it says is geared for large language models and multimodal AI.

Its new chip is called RNGD, pronounced “Renegade,” and it was unveiled at the Hot Chips 2024 conference in Stanford University today. It’s sampling to early access customers now, with broader availability slated for next year.

According to Furiosa, the RNGD chip is an extremely efficient data center accelerator that’s designed to support high-performance LLMs and multimodal model inference. The company is positioning it as an alternative to Nvidia Corp.’s graphics processing units.

RNGD is based on a Tensor Contraction Processor or TCP architecture, which the company says provides the perfect balance between efficiency, programmability and performance. It boasts some formidable specifications, with a Thermal Design Power of 150-watts, compared to more than 1,000 watts for some of the leading GPUs on the market today.

Furiosa also claims extremely high performance, with the chip packing 48 gigabytes of high-bandwidth memory. That makes it possible to run open-source LLMs such as Meta Platforms Inc.’s Llama 3.1 8B efficiently on a single card.

The RNGD chip was built on Taiwan Semiconductor Manufacturing Co.’s five-nanometer process and boasts a frequency of 1 gigahertz and 1.5 megabytes of memory bandwidth, with 256 megabytes of on-chip standard random-access memory and a PCIe Gen5 x16 interconnect that supports up to 64-gigabits-per-second throughput.

Programmability is enabled by a “robust compiler” that’s co-designed to be optimized for TCP-based chips, treating entire AI models as a single-fused operation. This means that the RNGD chips can be customized to run almost any LLM or multimodal AI workload, the company said.

What all of these numbers mean is that the Furiosa RNGD chip (pictured, adjacent) is extremely capable when it comes to running some of the best-known LLMs. Indeed, the startup claims some impressive results on industry standard benchmarks with models such as OpenAI’s GPT-J 6B, where it was able to process 15.13 queries per second.

Furiosa has a decent pedigree. It was founded in 2017 by three hardware and software engineers who previously worked for chipmaking giants such as Advanced Micro Devices Inc., Qualcomm Inc. and Samsung Electronics Co. Ltd.

Since its founding, the company has focused on a strategy of rapid iteration and product delivery. Its first-generation chip, known as Warboy, is a high-performance data center accelerator specifically designed for computer vision workloads that compares well with some of Nvidia’s older GPU designs in in the ResNet-50 image classification and SSD – MobileNetV1 object detection benchmarks.

Furiosa co-founder and Chief Executive June Paik revealed RNGD is the result of years of innovation by the startup. “RNGD is a sustainable and accessible AI computing solution that meets the industry’s real-world needs for inference,” he said. “With our hardware now running LLMs at full speed, we’re entering an exciting phase of continuous advancement.”

Featured image: SiliconANGLE/Microsoft Designer

Source: siliconangle.com

Related stories
2 weeks ago - All eyes were on Nvidia’s earnings report this week as a proxy for the artificial intelligence economy, and even for the graphics chip giant, it was too much to live up to. Nvidia earnings disappointed, but really, how could they not?...
1 month ago - Artificial intelligence-powered “scam detection” startup Scamnetic Inc. said it’s making its flagship platform available to software providers today. Customers will be able to integrate its platform into their software via an application...
1 month ago - (Bloomberg) -- Everyone involved agrees on the basic facts about March 8, 2022 — a venture capitalist and a startup co-founder spent a big night on the sidelines of a Miami conference. There, agreement ends. Cailin Hardell says the...
1 month ago - Startup Fabric Cryptography Inc., which sells chips optimized to run encryption algorithms, has raised $33 million in early-stage funding to support its product development efforts. Blockchain Capital and 1kx co-led the Series A...
1 week ago - A startup called Data.R.X LTD, which likes to be known as Datricks, has closed on a $15 million Series A funding round after developing a platform that uses artificial intelligence to try and uncover financial fraud at enterprises and...
Other stories
1 hour ago - YouTubers will soon be able to play with a host of new generative artificial intelligence-powered tools for creating content, including the ability to generate six-second YouTube Shorts clips, and backgrounds for their videos, using...
1 hour ago - Salesforce Inc. is making a major push to deploy AI agents on its CRM platform, an initiative the company views as the next step in enterprise artificial intelligence adoption. Building on its predictive Einstein platform for sales,...
1 hour ago - In a positive step forward and a possible sign of things to come, artificial intelligence video generation startup Runway AI Inc. has signed a deal with entertainment company Lions Gate Entertainment Corp. to explore the use of AI in...
1 hour ago - (Bloomberg) -- Asian equities braced for a tailwind from the Federal Reserve’s half-point rate cut and signs of further policy easing in the months ahead.Most Read from BloombergCalifornia’s Anti-Speeding Bill Can Be a Traffic Safety...
1 hour ago - (Bloomberg) -- US equities will climb through the rest of the year with the Federal Reserve’s aggressive interest-rate cut bolstering the chances of a soft landing for the economy, according to a survey of Bloomberg Terminal...