­
Grok 3: Is It the Smartest AI Model Available? [VIDEO]

Grok 3: Is It the Smartest AI Model Available? [VIDEO]

TECH NEWS – xAI, Elon Musk’s AI-focused company, has unveiled its latest large language model (LLM).

 

In a live broadcast (which you can watch below), they demonstrated Grok 3, which is available to Twitter users who subscribe at the most expensive level, as it is available in the Premium+ category. While the artificial intelligence company continues to tout the new LLM’s capabilities as best-in-class, some experts point to critical shortcomings in the benchmarks released. Musk announced that the older Grok 2 LLM will be open-sourced in a few months.

The xAI was keen to note that Grok 3 LLM beat all other publicly released versions of the base model, including DeepSeek-V3 and GPT-4o, in math, science and coding benchmarks. The LLM achieved an unprecedented score of 1402 on the Arena benchmark. Meanwhile, Manifold Markets’ bet that Grok 3 is the world’s most powerful artificial intelligence is now expected to receive an overwhelming majority of yes votes. It should be added, however, that the probability of a yes win has dropped from 91 percent late Monday night to just 78 percent. Critical comments around xAI Grok 3 may have played a role in this.

Zihan Wang (who coincidentally also used to work at DeepSeek) showed Grok 3 a picture of two different sized iron balls hanging from the Leaning Tower of Pisa at different heights, and then asked which ball would land first. The logical answer could only be the one that was heavier and closer to the ground, but LLM replied that both balls would land at the same time. Others asked why xAI did not publish the Grok 3 score on the FrontierMath, Arc-AGI, or HLE benchmarks.

This may raise the question of whether it is really the best LLM in its category. Meanwhile, Bloomberg recently reported that xAI is in talks with existing investors to raise up to $10 billion in a new funding round that would value the startup at $75 billion. In its last funding round, xAI raised $6 billion at a valuation of $40 billion. Guodang Zhang of xAI confirmed that Grok 3 has been trained on 100,000 GPUs.

You’d have to back up that claim, wouldn’t you, Elon?

Source: WCCFTech

Spread the love
Avatar photo
Anikó, our news editor and communication manager, is more interested in the business side of the gaming industry. She worked at banks, and she has a vast knowledge of business life. Still, she likes puzzle and story-oriented games, like Sherlock Holmes: Crimes & Punishments, which is her favourite title. She also played The Sims 3, but after accidentally killing a whole sim family, swore not to play it again. (For our office address, email and phone number check out our IMPRESSUM)

No comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.