pwshub.com

Mistral introduces Ministral 3B and 8B on-device AI computing models

Mistral AI, a Paris-based artificial intelligence startup, today introduced two new AI large language models, Ministral 3B and 8B, designed for on-device and edge computing thanks to their small size.

The company called this new model family “les Ministraux,” for its rating in the sub-10 billion-parameter category, which makes them small enough to run on platforms such as smartphones, tablets and internet of things devices. Mistral said the new frontier models can be tuned for common use cases including specialist tasks and work as AI agents via function-calling capabilities.

Customer and partners have increasingly been asking for “local, privacy-first inference for critical applications such as on-device translation, internet-less smart assistants, local analytics, and autonomous robotics,” the company said in the announcement. Les Ministraux is aimed at providing a compute-efficient and low-latency solution for those scenarios.

These smaller AI models can be used to moderate larger models, such as Mistral Large, as intermediaries in multistep workflows to handle input parsing, task routing and application calling to reduce costs.

The company said both models support a context length of up to 128,000 tokens, which puts them in line with OpenAI’s GPT-4 Turbo for how much data can be input. Ministral 8B also comes with a special “sliding window attention pattern,” which allows faster and more memory-efficient deployment.

The release of Ministral 3B and 8B comes a year after the release of Mistral 7B, an LLM that the company touted as a significant advancement in model architecture. The 8B and 3B regards the number of parameters in both models, 8 billion and 3 billion, and the company says the smallest model, Ministral 3B, already outperforms Mistral 7B in most benchmarks.

According to benchmarks, pretrained Ministral 3B beat Google LLC’s Gemma 2 2B and Meta Platforms Inc. Llama 3.2 3B models in the Multi-task Language Understanding evaluation with a score of 60.9 compared to 52.4 and 56.2, respectively. Ministral 8B also outperformed Llama 8B with a 65.0 score compared with 64.7.

The Ministraux model family closely follows Mistral’s introduction of Pixtral 12B last month, an advanced AI model that’s the first of the company’s models capable of vision encoding, making it possible to process both images and text.

Source: siliconangle.com

Related stories
1 month ago - Artificial intelligence code completion tool provider Tabnine Ltd. today introduced a new more intuitive way for developers to complete AI-assisted coding tasks directly in the editor with inline actions that work directly on selected...
1 month ago - This was the week that Apple finally infused artificial intelligence into its new iPhones, Watches and AirPods, though some of features won’t be coming for a bit and overall, the AI stuff seemed a little underwhelming. The medical...
1 week ago - SAP SE today debuted a new version of Joule, an artificial intelligence assistant that ships with many of its business applications. The company is also upgrading the developer tools that customers use to extend its software with custom...
6 days ago - Dell Technologies Inc. today launched five new PowerEdge servers using Advanced Micro Devices Inc.’s 5th Generation EPYC processors and targeted at artificial intelligence development and model deployment. The announcements represent “a...
1 month ago - Mistral AI, a Paris-based artificial intelligence startup, today unveiled its latest advanced AI model capable of processing both images and text. The new model, called Pixtral 12B, employs about 12 billion parameters and is the first of...
Other stories
2 minutes ago - Clerk Chat Inc., a started building a messaging platform that integrates with major telecommunications carriers, today said it is very $7 million in seed funding. The San Francisco-based firm plans to use the cash to improve its...
2 minutes ago - Artificial intelligence is taking center stage in the evolution of cybersecurity, as CrowdStrike Inc. revealed new innovations designed to unify, automate and streamline end-to-end protection. These AI in cybersecurity enhancements span...
3 minutes ago - Amazon.com Inc. today introduced the Kindle Colorsoft Signature Edition, its first e-reader with a color screen. The device debuted at a New York product event alongside upgrades to three existing Kindle devices. Amazon unveiled a new...
9 minutes ago - Nvidia's earnings report will once again be the biggest announcement this earnings season.
9 minutes ago - Lithium Americas and General Motors have formed a joint venture to extract lithium from the Thacker Pass site in Nevada.