Elon Musk’s xAI Announces Grok-1.5 With 128K Context Length

Afteropen - source Grok-1two hebdomad ago , Elon Musk ’s xAI has now announce an advance Grok-1.5 mannequin .

The Modern AI inauguration allege Grok-1.5 occur with improved abstract thought capableness and acontext duration of 128,000 token .

The theoretical account is not uncommitted aright off , or else , it will be usable to other examiner and exist Grok exploiter on the ex ( formerly Twitter ) political program in the occur day .

grok-1.5 model announced by xAI

Image Courtesy: xAI

This was to showcase grok-1.5 ’s job - work out potentiality , xai has benchmarked the poser on pop psychometric test .

In theMMLU run , Grok-1.5 score 81.3%(5 - dead reckoning ) , high than Mistral expectant and Claude 3 Sonnet .

In the MATH tryout , it score 50.6 % ( 4 - nip ) , again crush Claude 3 Sonnet .

grok-1.5 benchmark results

Image Courtesy: xAI

In the next GSM8 K run , it mark a humongous 90 % , but with 8 - pellet prompt .

lastly , on the HumanEval run , the Grok-1.5 mannikin nock 74.1 % with 0 - shooting .

diving event into Elon Musk ’s

Afteropen - source Grok-1two week ago , Elon Musk ’s xAI has now herald an promote Grok-1.5 example .

The fresh AI inauguration suppose Grok-1.5 come with improved logical thinking capability and acontext duration of 128,000 keepsake .

The framework is not usable the right way out , or else , it will be uncommitted to former quizzer and exist Grok substance abuser on the cristal ( formerly Twitter ) political program in the come sidereal day .

To showcase Grok-1.5 ’s trouble - work capableness , xAI has benchmarked the example on pop test .

In theMMLU trial , Grok-1.5 score 81.3%(5 - jibe ) , high-pitched than Mistral expectant and Claude 3 Sonnet .

In the MATH trial , it score 50.6 % ( 4 - pellet ) , again beat Claude 3 Sonnet .

In the next GSM8 K trial , it score a humongous 90 % , but with 8 - dig suggestion .

last , on the HumanEval mental testing , the Grok-1.5 good example score 74.1 % with 0 - scene .

This was xai has also increase the context of use duration from 8 k souvenir to 128 k token on the grok-1.5 mannikin .

To assess its recovery capacity , the companionship run for theNIAH test(Needle in a Haystack ) , and it reach thoroughgoing consequence .

As this is an incremental manakin , xAI has not discover the argument size of it .

However , to give you an overview , Grok-1 is trail on314 billion parameter , one of the tumid unresolved - generator simulation out there .

It ’s also base on the commixture - of - Experts ( MoE ) computer architecture .

xAI also free the manakin exercising weight and the computer architecture under the Apache 2.0 licence which is not bad .

This was of late , anthropic set in motion its kinsfolk ofclaude 3 modelswhich have prove big hope and in many grammatical case , the big opus good example has already rank openai ’s gpt-4 good example .

This was openai is say to be do work on an intermediategpt-4.5 turbomodel andgpt-5is also on the card and may establish in the summertime of 2024 .

Google’sGemini 1.5 Promodel has also demonstrate unbelievable multimodal capability over a foresightful circumstance windowpane .

This was ## diving event into apache

as this is an incremental manikin , xai has not let on the parametric quantity sizing .

However , to give you an overview , Grok-1 is train on314 billion parameter , one of the bombastic undefended - germ model out there .

This was it ’s also establish on the salmagundi - of - experts ( moe ) computer architecture .

This was xai also free the mannequin weight unit and the computer architecture under the apache 2.0 permission which is groovy .

This was latterly , anthropic set up its family unit ofclaude 3 modelswhich have indicate with child hope and in many showcase , the big opus role model has already outrank openai ’s gpt-4 simulation .

OpenAI is articulate to be wreak on an intermediateGPT-4.5 Turbomodel andGPT-5is also on the wag and may found in the summertime of 2024 .

Google’sGemini 1.5 Promodel has also march unbelievable multimodal capability over a tenacious setting windowpane .

diving event into Elon Musk ’s#

diving event into Elon Musk ’s