pwshub.com

Ideogram Releases New AI Image Model to Compete With Flux, Midjourney

Ideogram AI unveiled Ideogram 2.0 on Wednesday. The next generation of its text-to-image model aims to challenge the dominance of established players in the generative AI space.

The release comes just days after the much-anticipated implementation of Flux.1 as the primary image generator for Grok on X (aka Twitter), a move that has solidified Flux.1's position as a powerful and versatile contender in the post-Stable Diffusion XL (SDXL) era. Open models vying for supremacy include Auraflow, KwaiKolors, Hunyuan, Lumina, and Kandinsky 3.

"Ideogram 2.0 significantly outperforms other text-to-image models across many quality metrics, including image-text alignment, overall subjective preference, and text rendering accuracy," the company said in its official announcement.

Alongside the new model, Ideogram also launched a new set of features to make their whole suite more competitive. These include an iOS app and an API for developers.

Founded by former Google alumni, Ideogram has long been recognized for its pioneering work in incorporating text generation capabilities into its image models. It was the first model to do it, alongside a lesser-known experiment from Stability AI called Deep Floyd IF.

Image generated by Ideogram using Ideogram 2.0
Image: Ideogram

With the release of Ideogram 2.0, the company has increased the overall quality of its model’s outputs, making it faster, more capable, and versatile thanks to a new option for five different presets: realism, drawing, 3D, anime, and a general purpose implementation.

The update also introduces a color palette to give users more control over the aesthetics and the composition.

The “Realistic” style in Ideogram 2.0 enables users to create images that look like real photographs. “Textures are significantly enhanced, and human skin and hair appear lifelike,” Ideogram says. On the other hand, the “Design” preset focuses on accurate and artistic text generation. “This enables you to create premium graphic designs for greeting cards, print on demand, posters, illustrations, and marketing and social media content with long, stylized text.” the announcement reads.

Besides these two styles, the “3D” preset focuses on generating images that mimic a computer render, the “Anime” preset stands as a strong competitor against MidJourney’s Niji style for Hapanese manga-inspired creations, and the “General” preset is a one-size-fits-all versatile setting that will adapt the output to the prompt.

Initial reactions from social media users have been pretty positive overall, with many sharing their Ideogram-generated creations that showcase the model's remarkable abilities in realism and the rendering of famous personalities. Our first tests were satisfactory, particularly when utilizing the "Realism" preset, which at first glance seems to match the performance of Flux.1.

Images generated by Decrypt using the same prompt on Ideogram and Flux Schnell NF4 4 Steps

However, this may not be the best one for power users who want to test it for free. The free version of Ideogram 2.0 comes with a daily limit of 20 images (five batches of four images), with paid plans starting at $8 per month, and an unlimited slow generations plan priced at $20 per month. However, it is still competitive against MidJourney, which asks for $10 for its lowest tier and $30 monthly for unlimited slow generations.

Ideogram's offerings are positioned as a more accessible alternative to MidJourney, as the model's natural language processing capabilities allow for a more intuitive and streamlined prompt experience similar to what ChatGPT offers with Dall-E 3, and in contrast to MidJourney's reliance on the traditional "SDXL" prompting style with specific keywords and commands.

If money is not an issue, users may want to evaluate features over output quality, as both models are pretty competitive. MidJourney offers a very powerful personalization feature that lets users create their own style. It also has a pretty capable image editor that makes it possible to tweak generations with pretty high levels of control.

In contrast, Ideogram 2.0 gives users a lot of control over their generations without having to rely on prompt engineering or additional tools like Style Transfer, LoRAs or IPAdapter. The color palette options and presets may be a great way to get personalized results, especially for new users.

Edited by Ryan Ozawa.

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Source: decrypt.co

Related stories
1 month ago - Google’s Imagen 3 is available for testing on ImageFX, and is set to compete against MidJourney, Flux, or Dall-e 3, and future updates from Ideogram, MidJourney and the Open Model Initiative.
1 month ago - Flux is an advanced, open-source text-to-image model with 12 billion parameters. We compare it to three top contenders, and one comes out on top.
1 month ago - Grok 2 promises great performance when compared to the best LLMs in the industry—and may be the best image generators in specific cases.
1 month ago - MidJourney, the AI-powered image generation platform, has recently unveiled a major update to its web-based image editor.
1 month ago - Toncoin (TON), the cryptocurrency associated with the popular messaging app Telegram, is making headlines as its Masterchain achieves a record-breaking 946 active addresses per day. This achievement has driven a nearly 9% increase in...
Other stories
5 minutes ago - Bitcoin's bull pennant suggests a potential surge, with a price target of $158,000 by May 2025 following S&P 500's record high. The post Bitcoin enters ‘bull pennant’ breakout as S&P 500 hits all-time high appeared first on Crypto Briefing.
43 minutes ago - Finance giant BlackRock is saying that Bitcoin (BTC) is a unique investment opportunity offering investors something much different than traditional assets. BlackRock, a firm with over $10 trillion in assets under management, says in a...
1 hour ago - A new XRP price prediction by popular crypto pundit Egrag Crypto projects that the cryptocurrency could experience a historical price surge of about 9,468%. This predicted price gain would push XRP from its current price of $0.58 to $27,...
1 hour ago - Dookey Dash: Unclogged is the new free-to-play version of last year's NFT-gated game, with a $1 million competition for top players.
2 hours ago - So far, Bitcoin has seen a mixture of bulls and bears just in the past day alone. Although bulls appear to be taking the lead given its current market performance, will this be sustainable? Before the US Federal Reserve announcement of...