4 stories tagged #Multimodal AI

  1. Google Unveils Gemini Omni: New AI Model Family for Video Generation
    tech

    Google Unveils Gemini Omni: New AI Model Family for Video Generation

    Google DeepMind introduces Gemini Omni, a multimodal AI model for video generation and editing, integrated into YouTube Shorts.

    last wk. 1 min read
  2. Google Unveils Gemini Omni, Its First Native Multimodal AI Model for Enterprise
    tech

    Google Unveils Gemini Omni, Its First Native Multimodal AI Model for Enterprise

    Google launches Gemini Omni at I/O 2024: a native multimodal AI processing video, audio, images, and text from the architecture level.

    2w ago 1 min read
  3. Nvidia Unveils Nemotron 3 Nano Omni for Advanced Agentic AI
    tech

    Nvidia Unveils Nemotron 3 Nano Omni for Advanced Agentic AI

    Nvidia launches Nemotron 3 Nano Omni, a multimodal AI model unifying text, vision, and speech for faster, smarter agentic AI applications.

    last mo. 1 min read
  4. OpenAI's Sora Video Tool Set for ChatGPT Integration
    tech

    OpenAI's Sora Video Tool Set for ChatGPT Integration

    OpenAI is reportedly planning to integrate its advanced AI video generator, Sora, directly into the ChatGPT platform, signaling a significant expansion into multimodal AI capabilities.

    3mo ago 1 min read