2 stories tagged #multimodal

  1. OpenAI's ChatGPT Can Now See, Hear, and Complete Your Paperwork
    tech

    OpenAI's ChatGPT Can Now See, Hear, and Complete Your Paperwork

    ChatGPT's new voice and image features turn the AI into a multimodal personal assistant capable of processing documents in real time.

    last wk. 1 min read
  2. Thinking Machines Unveils Real-Time AI Model for Humanlike Interaction
    tech

    Thinking Machines Unveils Real-Time AI Model for Humanlike Interaction

    Startup founded by ex-OpenAI CTO Mira Murati unveils interaction model with sub-0.4 second latency, enabling real-time duplex communication.

    3w ago 2 min read