pwshub.com

Mistral unveils Pixtral 12B, a multimodal AI model that can process both text and images

Mistral AI, a Paris-based artificial intelligence startup, today unveiled its latest advanced AI model capable of processing both images and text.

The new model, called Pixtral 12B, employs about 12 billion parameters and is the first of its models capable of vision encoding, making it possible for it to “see” images alongside text.

The new model is based on Mistral’s Nemo 12B, an AI model previously released by the company capable of understanding text, with the addition of a 400 million-parameter vision adapter. The adapter allows users to add images through URLs or encode them via base64 within the inputted text.

Many other AI large language models have also added multimodal capabilities that allow users to input images such as Anthropic PBC’s Claude family, OpenAI’s GPT-4o and Google LLC’s Gemini. The addition of image reasoning capabilities to Pixtral 12B should provide it the ability similarly to answer questions about images, provide captioning, count objects and more.

The company released the parameters and code via a torrent link on GitHub and the AI distribution platform Hugging Face. The company has encouraged developers to start downloading and using it.

Now that the model is available for download, developers will be able to fine-tune and train the model for their own purposes. The company offers some of its models open-source under the Apache 2.0 license without restrictions. For others, Mistral offers a dev license that is free for development, but requires a paid license for commercial applications, but not for research uses. The company has not clarified what license Pixtral 12B will fall under.

 Sophia Yang, head of Mistral developer relations, said in a post on X, that the model will soon be available for testing on Mistral’s chatbot and application programming interface platforms, Le Chat and Le Platforme.

Source: siliconangle.com

Related stories
1 month ago - This was the week that Apple finally infused artificial intelligence into its new iPhones, Watches and AirPods, though some of features won’t be coming for a bit and overall, the AI stuff seemed a little underwhelming. The medical...
4 hours ago - Mistral AI, a Paris-based artificial intelligence startup, today introduced two new AI large language models, Ministral 3B and 8B, designed for on-device and edge computing thanks to their small size. The company called this new model...
1 month ago - Artificial intelligence code completion tool provider Tabnine Ltd. today introduced a new more intuitive way for developers to complete AI-assisted coding tasks directly in the editor with inline actions that work directly on selected...
1 month ago - Cisco Systems Inc. said Monday it’s finalizing the acquisition of an artificial intelligence-focused security startup called Robust Intelligence Inc. for an undisclosed price. Robust Intelligence has created a platform that’s designed to...
1 month ago - Two more tech giants may join the new funding round that OpenAI is rumored to be raising. Citing sources familiar with the matter, Bloomberg reported today that Nvidia Corp. and Apple Inc. may participate in the investment. OpenAI is...
Other stories
16 minutes ago - Enterprise software-as-a-service startup Everstage Inc. announced today that it had raised $30 million in new funding to increase product innovation and elevate its customer experience. Founded in 2020, Everstage offers a sales...
16 minutes ago - Machine learning in drug discovery, along with artificial intelligence, is transforming the pharmaceutical industry by accelerating the development of new treatments. Historically, the process of discovering and developing new drugs has...
25 minutes ago - Taiwan Semiconductor Manufacturing Co, the dominant producer of advanced chips used in artificial intelligence applications, is expected to report a 42% leap in third-quarter profit on Thursday thanks to soaring demand. TSMC is set to...
25 minutes ago - (Reuters) -Kinder Morgan fell short of Wall Street estimates for third-quarter profit on Wednesday and lowered its annual forecast as the U.S. pipeline operator contends with weaker commodity prices and lower crude volumes. Shares of the...
1 hour ago - Startup LatticeFlow AG today released COMPL-AI, a framework that can help companies check whether their large language models comply with the EU AI Act. Zurich-based LatticeFlow is backed by more than $14 million in venture funding. It...