Red Hat is enhancing its artificial intelligence capabilities with the introduction of Red Hat AI Enterprise. This new platform aims to simplify the deployment and management of AI models, agents, and applications within hybrid cloud settings.

Launching alongside the latest version of Red Hat AI and a co-engineered software platform, Red Hat AI Factory with Nvidia, these innovations are designed to move enterprise AI projects beyond the pilot phase. Red Hat states that many organizations struggle to scale AI initiatives due to fragmented tools and inconsistent infrastructure.

Red Hat AI Enterprise addresses this by unifying model and application lifecycles, enabling AI to be managed like traditional enterprise systems. The platform offers a "foundation for AI production," providing tools for AI inference, model tuning, customization, deployment, and management. It supports any AI model across cloud or on-premises environments, built on Red Hat's OpenShift cloud application platform.

This offering promises fast, scalable, and cost-effective AI inference, integrated lifecycle management, and flexible deployment options. Red Hat emphasizes operationalizing AI as a core component of enterprise software stacks.

The Red Hat AI Factory with Nvidia combines Red Hat's model management tools with Nvidia's accelerated computing software. This aims to simplify the management of both infrastructure and complex AI computing stacks, accelerating the transition from pilot to production AI. It handles provisioning and optimization of infrastructure for AI workloads and provides access to preconfigured AI models from IBM and Nvidia, inheriting Red Hat’s security and compliance features.

Red Hat AI 3.3, a significant upgrade to its existing platform, offers an expanded library of AI models, including compressed versions of Mistral-Large-3 and new foundational models. It also introduces a technology preview for Model-as-a-Service, facilitating self-service access to privately-hosted models via an API gateway. Additionally, generative AI support is being previewed on Intel CPUs for more cost-effective small language models.

New features include the Red Hat AI Python Index for enterprise-grade tools, on-demand GPU resources, and enhanced observability and security.