Multimodal AI
4 stories
-
techGoogle Unveils Gemini Omni: New AI Model Family for Video Generation
Google DeepMind introduces Gemini Omni, a multimodal AI model for video generation and editing, integrated into YouTube Shorts.
-
techGoogle Unveils Gemini Omni, Its First Native Multimodal AI Model for Enterprise
Google launches Gemini Omni at I/O 2024: a native multimodal AI processing video, audio, images, and text from the architecture level.
-
techNvidia Unveils Nemotron 3 Nano Omni for Advanced Agentic AI
Nvidia launches Nemotron 3 Nano Omni, a multimodal AI model unifying text, vision, and speech for faster, smarter agentic AI applications.
-
techOpenAI's Sora Video Tool Set for ChatGPT Integration
OpenAI is reportedly planning to integrate its advanced AI video generator, Sora, directly into the ChatGPT platform, signaling a significant expansion into multimodal AI capabilities.