pwshub.com

Are we on the verge of a self-improving AI explosion?

While GPT-3.5 and Mixtral failed to improve their own improver function in this way, GPT-4 showed modest improvements in its seed improver over successive generations. In a very small number of cases (less than 0.5 percent) the improver function even turned off a "sandbox" flag intended to limit its capabilities, highlighting the potential for a self-reinforcing AI to modify any built-in safeguards.

"Since the language models themselves are not altered, this is not full recursive self-improvement," the researchers noted. "Nonetheless, it demonstrates that a modern language model, GPT-4 in our experiments, is capable of writing code that can call itself to improve itself."

High risk, high reward

These examples really just scratch the surface of what is becoming a large research focus on self-improvement across the AI space. Google Deepmind, Microsoft, and Apple have published similar papers looking at the concept, alongside multiple academic labs. On the PR side, Microsoft's Satya Nadella recently talked up the "recursiveness... of using AI to build AI tools to build better AI."

Microsoft CEO Satya Nadella says AI development is being optimized by OpenAI's o1 model and has entered a recursive phase: "we are using AI to build AI tools to build better AI" pic.twitter.com/IHuFIpQl2C

— Tsarathustra (@tsarnick) October 21, 2024

All that research has some observers nervous about the potential for self-coding AI systems that quickly outpace both our intelligence and our abilities to control them. Responding to Anthropic's research in AI newsletter Artificiality, Dave Edwards highlighted the concern:

For hundreds of years, the capacity for self-improvement has been fundamental to our understanding of what it is to be human, our capacity for self-determination and to create meaning as individuals and as collectives. What does it mean, then, if humans might no longer be the only self-improving beings or things in the world? How will we make sense of the dissolution of that understanding of our exceptionalism?

Based on the research so far, though, we might not be as close to that kind of exponential "AI takeoff" as some observers think. In a February post, Nvidia Senior Research Manager Jim Fan highlighted that self-reinforcing models in research settings generally hit a "saturation" point after three iterations. After that, rather than zooming toward superintelligence, they tend to start showing diminishing returns with each successive model.

Source: arstechnica.com

Related stories
5 days ago - It's not easy, but you really can sing in Elvish if you try! Yes, it will take a while...
1 month ago - Saying hello — Listen to our first audio demo with OpenAI's new natural voice chat features. Enlarge...
1 month ago - If you're searching for a meaningful yet budget-friendly gift, these options under $200 are just the thing.
1 month ago - Shop our curated list of the top gifts under $200, all personally vetted by our gifting experts. Whether your gift recipient is into tech gadgets, home decor, food or fitness, you'll find something here they're sure to love.
2 weeks ago - These thoughtful and useful holiday gift ideas are perfect for someone starting out in a new space, whether they're moving into a new home or an apartment. Check out our recommendations that will suit any budget, starting as low as $15.
Other stories
7 minutes ago - The new iMac is the first desktop computer to sport Apple's M4 processor, having launched the M4 iPad Pro in May. It marks the first of multiple Mac machines the company will introduce this week. It roughly doubles the performance of the...
7 minutes ago - Google Chrome risks becoming an inferior browser, at least when it comes to blocking invasive ads. Google replaced Manifest V2 with Manifest V3, forcing extension developers to adopt a less effective technology in exchange for a claimed...
16 minutes ago - 'They're taunting us,' investigator says and it looks like it's working The feds are investigating Chinese government-linked cyberspies breaking into the infrastructure of US telecom companies, as reports suggest Salt Typhoon - the same...
16 minutes ago - Project Jarvis will apparently conduct research, purchase products, and even book a flight on your behalf Google is reportedly looking to sidestep the complexity of AI-driven automation by letting its multimodal large language models...
28 minutes ago - Health officials were tipped off after EMS transported five customers to the hospital.