pwshub.com

xAI debuts new Grok-2 and Grok-2 mini language models

Elon Musk’s xAI Corp. has debuted two new language models, Grok-2 and Grok-2 mini, that it claims can perform some tasks with similar accuracy to OpenAI’s GPT-4o.

The models rolled out to X on Tuesday. Later this month, they will also become available to developers through an application programming interface. The API will make it possible to integrate Grok-2 and Grok-2 mini into third-party services.

Musk launched xAI early last year to develop large language models. The company released its first LLM, Grok-1, later in 2023 and subsequently raised $6 billion from investors to finance the development of additional models. Grok-2 and Grok 2 mini, the latest fruits of the engineering effort, are rolling out about four months after the previous addition to xAI’s LLM lineup.

Grok-2, the more advanced of the two models, can generate text, troubleshoot code and perform related tasks. It’s also capable of analyzing user-provided images. Grok-2 mini, in turn, is a scaled-down version of the LLM that trades off some output quality for faster response times and lower inference costs.

In an internal test, xAI compared Grok-2 against several competing models to assess the quality of its output. The evaluation comprised eight benchmark datasets that researchers commonly use to measure LLMs’ accuracy. According to xAI, Grok-2 achieved “performance levels competitive” with the most advanced LLMs on the market.

One of the benchmark datasets that xAI used, GPQA, comprises 448 multiple-choice questions spanning several scientific fields. LLMs that complete the test receive a score reflective of how many questions they answered correctly. Grok-2 achieved a score of 56, which put it ahead of both GPT-4o and Meta’s newly released Llama 3 405B model.

The only LLM that outperformed Grok-2 in the GPQA test is Anthropic PBC’s Claude 3.5 Sonnet. The latter model achieved higher scores across most of the benchmark datasets that xAI used in the evaluation with the exception of two that comprised math questions. Grok-2 mini, in turn, achieved lower scores than the other LLMs across nearly all the benchmark datasets. 

Both of xAI’s new models became available in X on Tuesday for users with paid Premium and Premium+ subscriptions. The LLMs are accessible through a ChatGPT-like chatbot interface.

X’s implementation of Grok-2 is integrated with a third-party AI model called FLUX.1. The latter model, which was developed by a startup called Black Forest Labs Inc., allows users to generate images with natural language prompts. The Verge reported that Grok-2’s image generation features currently appear to have few guardrails against harmful output.

Later this month, xAI plans to make Grok-2 and Grok-2 mini available through an API. The offering will enable developers to integrate the models into their own applications. The API includes cybersecurity controls, a traffic analytics tool and the option to deploy the models in data centers near end-users to reduce latency.

Source: siliconangle.com

Related stories
1 month ago - Regulators are circling ever closer to big tech companies — the latest being Google, which the Federal Trade Commission more than hinted this week should be broken up. It’s not at all certain that will happen, since it’s up to the judge...
1 month ago - The Irish Data Protection Commission, the regulator that oversees X Corp.’s business practices in the European Union, has sent the company questions over a newly added privacy setting. Users of the Elon Musk-owned social network noticed...
1 week ago - It’s no surprise that entrepreneurs with a pedigree like Ilya Sutskever’s can raise a billion dollars, as the OpenAI co-founder did this week for his startup, SSI. And he wasn’t alone, as Nvidia and others also invested in two other...
2 weeks ago - Elon Musk’s xAI Corp. has completed the assembly of an artificial intelligence training system that features 100,000 graphics cards. Musk announced the milestone in a Monday post on X. The system, which xAI calls Colossus, came online...
1 month ago - As Elon Musk’s xAI Corp. debuted two new language models today, they’re already under the spotlight for appearing to push the limits of freedom of speech. Though Musk announced today that Grok is “the most fun AI in the world!”, the...
Other stories
39 minutes ago - Shares of Truth Social’s parent company fell Thursday, extending the latest round of declines for Trump Media & Technology Group.
1 hour ago - European Union officials are taking new steps to ensure that Apple Inc. complies with the bloc’s DMA tech industry regulation. The European Commission, the EU’s executive arm, announced the initiative today. The DMA is a piece of...
1 hour ago - Shares in automotive chip maker Mobileye Global Inc. jumped nearly 15% today after its majority shareholder, Intel Corp., said that it has no plans to divest its interest in the company. Reports earlier this month suggested that Intel...
1 hour ago - Cybersecurity risk management is becoming more critical than ever as industries adapt to an increasingly digital landscape. The rapid growth of artificial intelligence, combined with complex cyber threats, is pushing companies to rethink...
1 hour ago - Nike named a new CEO as Wall Street has questioned the company's plan to reinvigorate sales growth.