multimodal
2 stories
-
techOpenAI's ChatGPT Can Now See, Hear, and Complete Your Paperwork
ChatGPT's new voice and image features turn the AI into a multimodal personal assistant capable of processing documents in real time.
-
techThinking Machines Unveils Real-Time AI Model for Humanlike Interaction
Startup founded by ex-OpenAI CTO Mira Murati unveils interaction model with sub-0.4 second latency, enabling real-time duplex communication.