Elon Musk announces Grok-1.5, nearing GPT-4 level performance

by | Mar 29, 2024 | Technology

Join us in Atlanta on April 10th and explore the landscape of security workforce. We will explore the vision, benefits, and use cases of AI for security teams. Request an invite here.

Mere weeks after open-sourcing Grok-1, Elon Musk’s xAI has announced an upgraded version of its proprietary large language model (LLM) — Grok-1.5.

Set to release next week, Grok-1.5 brings enhanced reasoning and problem-solving capabilities and closes in on the performance of known open and closed LLMs, including OpenAI’s GPT-4 and Anthropic’s Claude 3. It is also capable of processing long contexts but remains behind Gemini 1.5 Pro’s context window of up to 1 million tokens.

Musk noted that Grok-1.5 will power xAI’s ChatGPT-challenging chatbot on the X platform, while Grok-2, the successor of the new model, is still in the training phase. He said the next version should be able to “exceed current AI on all metrics” but did not share specifics of when it might become available.

What does Grok-1.5 bring to the table?

xAI announced Grok-1 last November, saying that the AI has been modeled after “The Hitchhiker’s Guide to the Galaxy” and can answer almost anything to assist humanity in its quest for understanding and knowledge – regardless of background or political views. On benchmarks such as GSM8K, HumanEval and MMLU, shared by xAI, Grok-1 outperformed Llama-2-70B and GPT-3.5.

VB Event
The AI Impact Tour – Atlanta

Continuing our tour, we’re headed to Atlanta for the AI Impact Tour stop on April 10th. This exclusive, invite-only event, in partnership with Microsoft, will feature discussions on how generative AI is transforming the security workforce. Space is limited, so request an invite today.

Request an invite

Now, with the release of Grok-1.5, the company is building on that work, delivering significant improvements over the previous model across all major benchmarks, including those related to coding and math-related tasks. 

“In our tests, Grok-1.5 achieved a 50.6% score on the MATH benchmark and a 90% …

Article Attribution | Read More at Article Source

[mwai_chat context=”Let’s have a discussion about this article:nn
Join us in Atlanta on April 10th and explore the landscape of security workforce. We will explore the vision, benefits, and use cases of AI for security teams. Request an invite here.

Mere weeks after open-sourcing Grok-1, Elon Musk’s xAI has announced an upgraded version of its proprietary large language model (LLM) — Grok-1.5.

Set to release next week, Grok-1.5 brings enhanced reasoning and problem-solving capabilities and closes in on the performance of known open and closed LLMs, including OpenAI’s GPT-4 and Anthropic’s Claude 3. It is also capable of processing long contexts but remains behind Gemini 1.5 Pro’s context window of up to 1 million tokens.

Musk noted that Grok-1.5 will power xAI’s ChatGPT-challenging chatbot on the X platform, while Grok-2, the successor of the new model, is still in the training phase. He said the next version should be able to “exceed current AI on all metrics” but did not share specifics of when it might become available.

What does Grok-1.5 bring to the table?

xAI announced Grok-1 last November, saying that the AI has been modeled after “The Hitchhiker’s Guide to the Galaxy” and can answer almost anything to assist humanity in its quest for understanding and knowledge – regardless of background or political views. On benchmarks such as GSM8K, HumanEval and MMLU, shared by xAI, Grok-1 outperformed Llama-2-70B and GPT-3.5.

VB Event
The AI Impact Tour – Atlanta

Continuing our tour, we’re headed to Atlanta for the AI Impact Tour stop on April 10th. This exclusive, invite-only event, in partnership with Microsoft, will feature discussions on how generative AI is transforming the security workforce. Space is limited, so request an invite today.

Request an invite

Now, with the release of Grok-1.5, the company is building on that work, delivering significant improvements over the previous model across all major benchmarks, including those related to coding and math-related tasks. 

“In our tests, Grok-1.5 achieved a 50.6% score on the MATH benchmark and a 90% …nnDiscussion:nn” ai_name=”RocketNews AI: ” start_sentence=”Can I tell you more about this article?” text_input_placeholder=”Type ‘Yes'”]

Share This