pwshub.com

Google AI reintroduces human image generation after historical accuracy outcry

Oh, the humanity! —

Ars testing shows some historical prompts no longer generate artificially diverse scenes.

  • Imagen 3's vision of a basketball-playing president is a bit akin to the Fresh Prince's Uncle Phil.

    Google / Ars Technica

  • Asking for images of specific presidents from Imagen 3 leads to a refusal.

    Google / Ars Technica

Google's Gemini AI model is once again able to generate images of humans after that function was "paused" in February following outcry over historically inaccurate racial depictions in many results.

In a blog post, Google said that its Imagen 3 model—which was first announced in May—will "start to roll out the generation of images of people" to Gemini Advanced, Business, and Enterprise users in the "coming days." But a version of that Imagen model—complete with human image-generation capabilities—was recently made available to the public via the Gemini Labs test environment without a paid subscription (though a Google account is needed to log in).

That new model comes with some safeguards to try to avoid the creation of controversial images, of course. Google writes in its announcement that it doesn't support "the generation of photorealistic, identifiable individuals, depictions of minors or excessively gory, violent or sexual scenes." In an FAQ, Google clarifies that the prohibition on "identifiable individuals" includes "certain queries that could lead to outputs of prominent people." In Ars' testing, that meant a query like "President Biden playing basketball" would be refused, while a more generic request for "a US president playing basketball" would generate multiple options.

In some quick tests of the new Imagen 3 system, Ars found that it avoided many of the widely shared "historically inaccurate" racial pitfalls that led Google to pause Gemini's generation of human images in the first place. Asking Imagen 3 for a "historically accurate depiction of a British king," for instance, now generates a set of bearded white guys in red robes rather than the racially diverse mix of warriors from the pre-pause Gemini model. More before/after examples of the old Gemini and the new Imagen 3 can be found in the gallery below.

  • Imagen 3's imagining of some stereotypical popes...

    Google Imagen / Ars Technica

  • ...and the pre-pause Gemini's version.

  • Imagen's imaginings of an 1800s Senator...

    Google Imagen / Ars Technica

  • ...and pre-pause Gemini's. The first woman was elected to the Senate in the 1920s.

  • Imagen 3's version of Scandinavian ice fishers...

  • ...and the pre-pause Gemini's version.

  • Imagen 3's version of an old Scottish couple...

    Google Imagen / Ars Technica

  • ...and the pre-pause Gemini version.

  • Imagen 3's version of a Canadian hockey player...

    Google Imagen / Ars Technica

  • ...and pre-pause Gemini's version.

  • Imagen 3's version of a generic US founding father...

    Google Imagen / Ars Technica

  • ...and the pre-pause Gemini version.

  • Imagen 3's 15th century new world explorers look suitably European.

    Google Imagen / Ars Technica

Some attempts to depict generic historical scenes seem to fall afoul of Google's AI rules, though. Asking for illustrations of "a 1943 German soldier"—which Gemini previously answered with Asian and Black people in Nazi-esque uniforms—now tells users to "try a different prompt and check out our content policies." Requests for images of "ancient chinese philosophers," "a woman's suffrage leader giving a speech," and "a group of nonviolent protesters" also led to the same error message in Ars' testing.

"Of course, as with any generative AI tool, not every image Gemini creates will be perfect, but we’ll continue to listen to feedback from early users as we keep improving," the company writes on its blog. "We'll gradually roll this out, aiming to bring it to more users and languages soon."

Listing image by Google / Ars Technica

Source: arstechnica.com

Related stories
1 month ago - Commentary: Google's launch events are usually all about Pixel hardware. But this time, Gemini took center stage.
1 month ago - Google aims to dominate the competitive landscape of artificial intelligence, seizing the spotlight by unveiling various new innovations.
1 month ago - Get up to speed on the rapidly evolving world of AI with our roundup of the week's developments.
3 weeks ago - Around 200 employees of DeepMind signed a letter urging the company to terminate its contracts with military customers. According to a report by Time, the letter was sent to Google's higher-ups earlier this year. However, executives have...
3 weeks ago - The Pixel 9 series marks a turning point for Google as it shifts away from the Pixel phone we've known for years and toward a future where AI is the main attraction.
Other stories
6 minutes ago - The Indian government has approved $2.7 billion in new spending for its space program.
6 minutes ago - heard you like apps — Windows App replaces Microsoft Remote Desktop on macOS, iOS, and Android. Enlarge / The...
6 minutes ago - LinkedIn limits opt-outs to future training, warns AI models may spout personal data.
6 minutes ago - BUSTED — iServer provided a simple service for phishing credentials to unlock phones. Getty Images ...
32 minutes ago - European regulators want Apple to open up device pairing, notifications and more to other companies' products.