pwshub.com

Hermes 3, a super-creative version of open-source Llama 3.1 AI model, even struggles with inner conflict

Artificial intelligence startups Lambda Labs Inc. and Nous Research today announced the launch of a new large language model called Hermes 3, which it says is a “personalized, unrestricted” version of Meta Platforms Inc.’s open-source Llama 3.1 model.

The largest 405 billion parameter version of the Hermes 3 model is unusual in that it displays evidence of having an “existential crisis” when given a blank prompt followed by the question “Who are you?”.

In a blog post, Lambda’s researchers say this “feature,” for want of a better word, was totally unexpected and indicative of “anomalous behavior” that occurs when scaling AI models beyond a certain threshold. To understand what’s going on, the creators of Hermes 3 are inviting users to interact with the model via a Discord server and “uncover the labyrinth lurking within the weights.”

Lambda Labs is an AI infrastructure company that was born out of the ashes of a third-party Google Glass facial recognition app, while Nous Research is an AI research startup that’s focused on creating “potent open-source code and efficient large language models.” The two companies previously worked together on Hermes 3’s predecessors, including the original Hermes, Hermes 2 and Open Hermes 2.5, which have collectively been downloaded more than 33 million times in total.

What’s different about Hermes 3, besides being more advanced, is that it comes with unlocked and uncensored open weights. This means it’s more steerable, allowing users to adapt its responses to suit their specific needs. That’s in contrast to many of the other leading LLMs around today, which are often much more rigid and difficult to customize.

The model is available in three parameter sizes, 8 billion, 70 billion and 405 billion, and was trained on a diverse dataset in a process designed to improve its creativity, reasoning and adherence to user’s instructions. It boasts strong capabilities in terms of its long-term context retention, making it capable of more humanlike conversations where it can remember the specific context, as well as multiturn conversation management. It also excels at complex role-playing, which is something that often leaves proprietary LLMs flummoxed.

Another area of progress is Hermes 3’s agentic powers. AI models with agentic capabilities are those that can perform a series of tasks on the behalf of users, and it’s a big area of buzz in AI development lately. Hermes 3 is able to use XML tags for structured outputs, generate internal monologues for transparent decision-making, and partake in visual communications using Mermaid diagrams, the creators said. It also employs step-labeled reasoning and planning to enhance its transparency.

One of its most impressive agentic capabilities is its ability to generate code with high proficiency, as well as detailed explanations of that code and the corresponding documentation to go with it. So it has big potential in the area of software development and bug detection.

According to Nous Research, the Hermes 3 model was trained using Lambda’s 1-Click Cluster infrastructure and was optimized for efficiency using techniques such as Neural Magic Inc.’s FP8 quantization, reducing its virtual RAM and disk requirements by about 50%. It still doesn’t match the performance of proprietary LLMs such as OpenAI’s most advanced model, GPT-4o or Anthropic’s Claude 3.5 Sonnet, but it demonstrated superior performance versus all open-source LLMs in a varied set of benchmark tests.

The creators say the most appealing aspect of Hermes 3 is its sheer versatility. The model is said to excel in applications that require decision-making, advanced reasoning, strategic planning and creativeness.

“Since the start of my journey in AI, I wanted to bring about the realization of an open-source frontier-level model that aligns with you, the user — not some corporation or higher authority before the user. Today, with Hermes 3 405B, we’ve achieved that goal,” wrote Nous Research co-founder Teknium.

Both Lambda and Nous Research said they’re eager for people to engage with Hermes 3 and share their experiences. For casual users, Hermes 3 is available through the Lambda Chat interface. It can also  be accessed via Lambda’s Chat Completions application programming interface. To do so, they can generate a Cloud API key through Lambda’s dashboard and set about testing the model’s capabilities without any complex setup required.

For dedicated access, users can deploy Hermes 3 on a single Lambda node, or a more advanced multinode configuration if they desire to fine-tune it further.

Images: Nous Research & Lambda Labs

Source: siliconangle.com

Related stories
1 month ago - Regulators are circling ever closer to big tech companies — the latest being Google, which the Federal Trade Commission more than hinted this week should be broken up. It’s not at all certain that will happen, since it’s up to the judge...
3 weeks ago - Data shows that stock-split stocks tend to continue their track record of outperformance.
1 month ago - The AI chipmaker is scheduled to report results later this month. Can the stock continue its relentless climb?
2 days ago - Market expectations of substantial U.S. rate cuts this year are making short-dated debt unattractive as the Federal Reserve is unlikely to be as aggressive in easing monetary policy, said Deborah Cunningham, a money market fund manager...
1 month ago - (Bloomberg) -- Archer-Daniels-Midland Co.’s quarterly profit shrank more than expected as the grain trading giant faces a downturn in crop markets. Most Read from BloombergLuxury Heir Alleges His $13 Billion Hermès Fortune Has...
Other stories
21 minutes ago - Shares of Truth Social’s parent company fell Thursday, extending the latest round of declines for Trump Media & Technology Group.
54 minutes ago - European Union officials are taking new steps to ensure that Apple Inc. complies with the bloc’s DMA tech industry regulation. The European Commission, the EU’s executive arm, announced the initiative today. The DMA is a piece of...
54 minutes ago - Shares in automotive chip maker Mobileye Global Inc. jumped nearly 15% today after its majority shareholder, Intel Corp., said that it has no plans to divest its interest in the company. Reports earlier this month suggested that Intel...
54 minutes ago - Cybersecurity risk management is becoming more critical than ever as industries adapt to an increasingly digital landscape. The rapid growth of artificial intelligence, combined with complex cyber threats, is pushing companies to rethink...
1 hour ago - Nike named a new CEO as Wall Street has questioned the company's plan to reinvigorate sales growth.