pwshub.com

Cloudera AI Inference service boosts scalable AI with Nvidia NIM - SiliconANGLE

As artificial intelligence drives faster insights and real-time decision-making across the enterprise, the Cloudera AI Inference service, designed to operationalize machine learning at scale, is gaining traction.

To boost large language model performance and the private deployment of models, the Cloudera AI Inference service uses Nvidia NIM microservices and accelerated computing, according to Priyank Patel (pictured), vice president of artificial intelligence and machine learning at Cloudera Inc.

Priyank Patel, vice president of artificial intelligence and machine learning at Cloudera Inc., talks to theCUBE during Cloudera Evolve24 about how the Cloudera AI Inference service enables fast deployment of models.

Cloudera’s Priyank Patel talks to theCUBE about the transformative power of the Cloudera AI Inference service.

“What we are integrating is the software stack that the Nvidia team has built out, something called NIM — NIM microservices,” Patel stated. “It’s an integrated hardware-software layer that sits above their [graphics processing units]. We learned more of what goes into the NIM, and that really formed the basis of the Cloudera AI Inference service. It’s the model serving offering from Cloudera that works anywhere on public clouds as well as on-premises and fundamentally enables our customers and enterprises to have private endpoints for AI to be able to build and run AI privately.”

Patel spoke with theCUBE Research’s Bob Laliberte and co-host Rebecca Knight at the Cloudera Evolve24 event during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how the Cloudera AI Inference service enables fast deployment of models. (* Disclosure below.)

Delving deeper into the Cloudera AI Inference service

Given that data is growing exponentially, broadening AI solutions with products such as the Cloudera AI Inference service is important. The solution helps enhance user experience, scalability and operational efficiency, Patel pointed out. 

“AI with Cloudera is about us building the best platform for our customers to build their AI applications with,” he noted. “AI in Cloudera is about us infusing AI within our platform without our customers ever needing to know about it, and that means there are dozens of teams internally within our organization who are building the copilots, the assistants and the capabilities that would ease the regular day-to-day user of the Cloudera platform. Cloudera manages a significant amount of data estate both on-premise and the cloud.” 

Making developers’ lives easier is top of mind for enterprises. As a result, AI fits into the picture since it transforms developers’ work through enhanced collaboration, improved productivity and automated code generation, according to Patel.

“When we started out two years ago, the core competence of actually building these AI systems was with the data science teams, the AI teams [and] the machine-learning teams because that’s the technology evolution of these deep learning networks,” he said. “As it has progressed to now, we see and … internally use the term gen AI builders, intentionally not calling them developers [or] scientists because we think that there is a simplification of the skill set and up-leveling of skill set that has gone through in the industry.”

Here’s the complete video interview, part of SiliconANGLE’s and theCUBE Research’s coverage of the Cloudera Evolve24 event

(* Disclosure: Cloudera Inc. sponsored this segment of theCUBE. Neither Cloudera nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

Source: siliconangle.com

Related stories
21 hours ago - It has been a little over a year since Charles Sansbury (pictured) was appointed chief executive officer of Cloudera Inc., and it didn’t take him long to figure out that the AI experience would be his customers’ and his company’s focus...
2 hours ago - Cloudera Inc. has taken a team approach by expanding its enterprise AI ecosystem to include a diverse range of industry-leading AI providers, building a portfolio of enterprise AI solutions for its customers. The data management and...
7 hours ago - Data complexity is the primary pain point as companies make the jump to generative artificial intelligence. Data unification simplifies this complexity by reducing data silos and providing organizations with comprehensive insights. “I...
1 day ago - Amdocs Group Company, which provides software services for telecommunication and media service providers, is charging into the age of artificial intelligence. The company has not only incorporated AI into its software, but uses it to...
1 day ago - Despite cloud’s growing dominance, the ongoing artificial intelligence boom is giving rise to hybrid data management. Cloudera Inc. looks to capitalize on this trend by bringing data to customers on-premises and in the cloud, as well as...
Other stories
6 minutes ago - SoFi Technologies (SOFI) has positioned itself as one of the most exciting fintech companies, offering a wide range of services and products that many traditional banks struggle to match. While the stock has declined by about 10% this...
6 minutes ago - An ongoing strike by Boeing's biggest union, the International Association of Machinists and Aerospace Workers (IAM), is proving costly on several fronts for the company.
1 hour ago - (Reuters) -U.S. planemaker Boeing will cut 17,000 jobs, or 10% of its global workforce, delay first delivery of its 777X jet by a year and expects substantial new losses in its defense business as a month-long strike batters company...
1 hour ago - S&P 500 closes above 5,800 for first time, Dow notches fresh record as bank earnings impressThe Dow Jones Industrial Average (^DJI) rose nearly...
1 hour ago - Are you looking for reliable income stocks to add to your portfolio this month? Dividend Aristocrats – companies with at least 25 consecutive years of dividend growth – offer some of the most compelling opportunities due to their...