pwshub.com

Infrastructure advancements tackling the memory wall in AI development

Infrastructure advancements are changing numerous arenas, more specifically, the integration of artificial intelligence infrastructure and memory technology, which, in turn, is reshaping the data landscape.

The discussions at HPE Discover 2024 highlighted the industry’s relentless pursuit to overcome the memory wall, a challenge arising from the imbalance between CPU/GPU capabilities and memory performance. These developments are crucial in supporting the burgeoning demands of AI and large language models, paving the way for more efficient and powerful computing solutions, according to Alan Walker (pictured, right), senior director of sales at Samsung Semiconductor Inc.

R.C. Hurbanis and Alan Walker, of Samsung Semiconductor Inc talking to theCUBE about AI infrastructure advancements at HPE Discover 2024

Samsung Semiconductor’s R.C. Hurbanis and Alan Walker talk to theCUBE about AI infrastructure advancements.

“When we think about the traditional memory pyramid with the cache memory at the top, and then your system memory storage underneath, that pyramid is now growing several layers and becoming a much larger pyramid,” Walker said. “At the top, we have high bandwidth memory…under the memory itself, we’re now adding additional capabilities…and then even under that, you now have multiple different types of SSDs…all of that is to help us solve what we’re calling the memory wall.”

Walker and R.C. Hurbanis (left), senior manager of Device Solutions Americas business enablement at Samsung Semiconductor, spoke with theCUBE Research’s Dave Vellante and Rebecca Knight at HPE Discover, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed AI infrastructure integration, advancements in memory technology and future developments in memory capacity and bandwidth. (* Disclosure below.)

How AI infrastructure advancements are changing memory technology

The future of memory technology is promising, driven by continuous innovations. These advancements aim to meet the growing demands of AI and large-scale data processing, Hurbanis noted.

“One of the challenges is HBM is difficult to make because it’s stacking a bunch of layers of silicon on top of one another and connecting them, that’s challenging to do,” he said. “That takes some very special technology to do … were not there. A big part of what the industry is doing is spending CapEx to increase … the capabilities to do that. That’s also why it’s going to take a little bit of time for the industry to catch up.”

AI’s growing demand for higher memory capacity and bandwidth is shaping industry priorities, Hurbanis explained. The focus remains on developing cost-effective and energy-efficient solutions.

“There’s new technologies that we’re working on, we’ve mentioned a few earlier…these are going to come into play to help ultimately also bring down that memory wall,” he said. “Once those guys come out with the next generation of their chips, then the industry starts again and we start attempting to improve the infrastructure and address whatever other bottleneck might come to be.”

Here’s the complete video interview, part of SiliconANGLE’s and theCUBE Research’s coverage of HPE Discover:

(* Disclosure: TheCUBE is a paid media partner for HPE Discover. Neither Hewlett Packard Enterprise Co. and Intel Corp., the primary sponsors of theCUBE’s event coverage, nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

Source: siliconangle.com

Related stories
1 month ago - Driven by breakthroughs in artificial intelligence, cloud infrastructure and advanced data platforms, the landscape of AI data governance is rapidly evolving. This technological shift is not only reshaping how businesses manage and...
1 week ago - API management is becoming a cornerstone of digital transformation as artificial intelligence and APIs increasingly work together to reshape enterprise infrastructure. With businesses relying more on AI to deliver personalized user...
3 weeks ago - Enterprise technology is at a pivotal moment, as companies manage the rapid convergence of generative AI, cloud computing and evolving infrastructure demands. This shift from traditional systems to cloud-based solutions is more than just...
1 month ago - As enterprises continue to harness generative artificial intelligence, two stakeholder technologies will shape the ongoing boom: cloud and hyperconverged infrastructure. Winning gen AI at scale combines many factors, from performance to...
3 weeks ago - Nvidia Corp. today revealed details about what it will discuss during the Hot Chip 2024 semiconductor technology conference in Cupertino, California, on Monday, which includes advancements to its Blackwell platform, research on liquid...
Other stories
8 minutes ago - The Fed's cutting cycle in 1995 sparked an economic boom, with the stock market more than doubling in value by the end of the decade.
8 minutes ago - There's nothing like a potentially massive government contract to win the hearts of both investors and analysts.
1 hour ago - Shares of Truth Social’s parent company fell Thursday, extending the latest round of declines for Trump Media & Technology Group.
1 hour ago - European Union officials are taking new steps to ensure that Apple Inc. complies with the bloc’s DMA tech industry regulation. The European Commission, the EU’s executive arm, announced the initiative today. The DMA is a piece of...
1 hour ago - Shares in automotive chip maker Mobileye Global Inc. jumped nearly 15% today after its majority shareholder, Intel Corp., said that it has no plans to divest its interest in the company. Reports earlier this month suggested that Intel...