Grok 3发布会中文字幕版本|聪明得让人害怕
Resumen
TLDR在Grock 3的发布中,XAI团队强调了其在宇宙探索和知识获取上的愿景,并展示了Grock 3在能力上的显著提升。模型的设计灵感来源于对人类和宇宙理解的深刻追求,注重事实和真相的严谨探索。通过建立自有数据中心,团队克服了多重挑战以实现Grock 3的训练,最新的Grock 3表现出色,并通过Blind Test获胜。新功能包括高级推理能力和Deep Search,致力于为用户提供实时准确的信息检索体验,体现了持续改进的潜力。
Para llevar
- 🌌 追求宇宙真理的重要性
- 🚀 Grock 3在各项基准测试中表现出色
- 🛠️ 建立自有数据中心以支持AI训练
- 🧠 加强推理和创造力的能力
- 🔍 Deep Search帮助精确回答问题
- 📈 Grock持续进行优化与改进
- 👾 实时反馈用户使用体验
- 🔋 大规模训练面临的冷却与电力问题
- 🎮 新游戏创作展示AI能力
- 💡 未来AI在各领域的应用展望
Cronología
- 00:00:00 - 00:05:00
在这次GR 3的介绍中,团队的使命是深入了解宇宙的本质,探索宇宙中的一系列基本问题,例如外星人存在与否、生命的意义等。他们表达了追求真理的重要性,并介绍了GR 3的能力提升,感谢团队努力工作的成果。
- 00:05:00 - 00:10:00
团队成员介绍后,详述了GR(Grock)的命名来源,意为“全面而深刻地理解某事”,并强调了同理心的重要性。GR的早期模型(Grock 1)与最新的Grock 3相比,虽然起步不高,但在过去几个月里,GR的能力已经显著提升,特别是在基础设施和团队的努力下。
- 00:10:00 - 00:15:00
讨论了模型的训练进展,GR 1.5和GR 2的发布。随着训练模型的GPU数量的增加,训练能力都得到极大提升。团队通过构建自己的数据中心解决了冷却和电源的问题,成功建立了一个规模庞大的GPU集群。
- 00:15:00 - 00:20:00
GR 3引入了先进的推理能力,并与其他模型进行了盲测比较,GR 3被认为在各项能力上都遥遥领先。此外,团队强调了持续更新和改善的特点,使用户在短时间内就能体验到更好的性能。
- 00:20:00 - 00:25:00
数据中心的不断扩建和GPU的数量激增为GR 3的推出提供了技术支持,展示了通过不断的努力和创新,如何在短时间内构建起世界领先的AI模型训练平台。
- 00:25:00 - 00:30:00
团队演示了GR 3的数学、科学和编程能力,对比了不同版本模型的表现,指出了其在数学、科学知识和编程能力上的出色表现。
- 00:30:00 - 00:35:00
在GR 3的实际应用演示中,团队展示了GR在解决物理问题和游戏设计中的推理能力,以及展示生成的代码,强调了GR在实际应用中的创造力和解决问题的潜力。
- 00:35:00 - 00:40:00
涉及到持续更新GR的性能和功能,团队成员分享了关于GR解决具体问题的能力,提高了在复杂推理任务中的表现,并在特定情境下展示GR如何进行复杂的逻辑推理。
- 00:40:00 - 00:45:00
团队还展示了GR与用户互动的能力,演示了深度搜索新功能,旨在帮助用户解决实际日常问题,并提供比现有搜索引擎更深入的洞察与答案。
- 00:45:00 - 00:50:25
最后,GR 3的发布计划和价格结构清晰展示给广大用户,团队期待用户的反馈和进一步的产品完善。
Mapa mental
Vídeo de preguntas y respuestas
Grock 3有什么新功能?
Grock 3具有增强的推理能力和Deep Search功能,能够更好地理解和回答用户提问。
Grock 3比Grock 2提升了多少性能?
Grock 3在性能上比Grock 2提升了十倍以上。
如何访问Grock 3?
首批访问Grock 3的用户将是X平台的Premium Plus订阅者。
Grock 3会开源吗?
一旦Grock 3稳定成熟,就会考虑开源。
Grock的语音助手什么时候上线?
Grock的语音助手预计会在不久后上线,但仍在打磨中.
Deep Search功能的作用是什么?
Deep Search能够深入分析用户提问,提供更加准确的答案和信息,即更高效的搜索引擎体验.
Ver más resúmenes de vídeos
- 00:00:28X
- 00:00:57for deep V cloud
- 00:01:28standing
- 00:01:58for
- 00:02:28for
- 00:02:56for all right well welcome to the gr 3
- 00:03:00presentation um so the mission of xai
- 00:03:04and Gro is to understand the universe we
- 00:03:07want to understand the nature of the
- 00:03:08universe so we can figure out what's
- 00:03:10going on where are the aliens what's the
- 00:03:12meaning of life how does the universe
- 00:03:13end how did it start all these
- 00:03:15fundamental questions um were driven by
- 00:03:18curiosity about the nature of the
- 00:03:20universe and um that's also what causes
- 00:03:23us to be a maximally truth
- 00:03:27seeking uh AI even if that truth is
- 00:03:31sometimes at odds with what is
- 00:03:32politically correct in order to
- 00:03:35understand the nature of the universe
- 00:03:37you must absolutely rigorously pursue
- 00:03:39truth or you will not understand the
- 00:03:41universe you'll be suffering from some
- 00:03:43amount of delusion or error so that is
- 00:03:46our goal um figure out what's going on
- 00:03:50and uh we're very excited to present
- 00:03:53grock 3 which is we think uh an order of
- 00:03:56magnitude more capable than grock 2 in a
- 00:03:58very short period of time
- 00:04:00and uh that's thanks to uh the hard work
- 00:04:04of an incredible team and um I'm honored
- 00:04:07to work with such a great team and of
- 00:04:09course we'd love to have um some of the
- 00:04:11smartest humans out there join our team
- 00:04:14so uh with that let's let's go hi
- 00:04:18everyone my name is Igor lead
- 00:04:19engineering at XI I'm Jimmy Paul leading
- 00:04:23research I'm Tony working on the
- 00:04:25reasoning Team all right I'm El I don't
- 00:04:28do anything
- 00:04:30I just show up
- 00:04:31occasionally yeah so um like mentioned
- 00:04:34Gro is the tool that we're working on
- 00:04:36Gro is our AI that we're building here
- 00:04:38at XI and we've been working extremely
- 00:04:40hard over the last few months to improve
- 00:04:41grock as much as we can so we can give
- 00:04:43it to all of you so we can give all of
- 00:04:45you access to it um we think it's going
- 00:04:47to be extremely useful do we think it's
- 00:04:49going to be interesting to talk to funny
- 00:04:51really really funny um and um we're
- 00:04:53going to explain to you how we've
- 00:04:54improved gr over the last few months
- 00:04:56we've made quite a jump in in
- 00:04:57capabilities yeah actually we should
- 00:04:59explain maybe also what is why do we
- 00:05:00call it Gro so Gro is a word from um a
- 00:05:04heand novel Stranger in a Strange Land
- 00:05:07um and it's a used by a guy who's who
- 00:05:11was raised on Mars um and the word Gro
- 00:05:14is to sort of fully and profoundly
- 00:05:17understand something that's what the
- 00:05:18word Gro means fully and profoundly
- 00:05:20understand something and empathy is
- 00:05:23important true
- 00:05:26yeah so yeah so uh if we charted xas
- 00:05:30progress uh in the last few months has
- 00:05:33only been 17 months since we started
- 00:05:36kicking off our very first model uh
- 00:05:39grock one was almost like a toy by this
- 00:05:43point only 314 billion parameters and
- 00:05:45now if we PR the progress the time on
- 00:05:49x-axis the performance of favorite
- 00:05:51Benchmark numbers M mlu on the y-axis
- 00:05:54we're literally progressing at
- 00:05:56unprecedent speed across the whole field
- 00:06:00and then we kick off grock 1.5 right
- 00:06:02after grock 1 released after November
- 00:06:052023 and then grock 2 so if you look at
- 00:06:09where the all the performance coming
- 00:06:12from when you have a very correct
- 00:06:14engineering team and all the best AI at
- 00:06:17Talent there only one thing we need is a
- 00:06:20big intelligence comes from big
- 00:06:23cluster so we can reconvert the entire
- 00:06:27progress of xai now replacing the bench
- 00:06:29the y axis to the total amount of
- 00:06:31training flops that is how many gpus we
- 00:06:34can run at any given time to train our
- 00:06:36large language models to compress the
- 00:06:39entire
- 00:06:40internet so after all human all human
- 00:06:43knowledge really that's right yeah
- 00:06:44internet being part of it but it's
- 00:06:46really all human knowledge all
- 00:06:47everything yeah the whole internet fits
- 00:06:49into a USB stick at this point it's like
- 00:06:51all the human tokens yeah that's right
- 00:06:54yeah uh very soon into the real world
- 00:06:57yeah um so we had so much trouble
- 00:07:00actually training Gru back in the days
- 00:07:03uh we kickoff the model around February
- 00:07:07and uh we thought we had a large amount
- 00:07:09of chips but turned out we can barely
- 00:07:11get AK training chips running coherently
- 00:07:14at any given time and we had so many
- 00:07:18Cooling and power issues I think you
- 00:07:21were there in the data center yeah it
- 00:07:23was like really sort of more like 8K
- 00:07:25chps on average at 80% efficiency more
- 00:07:28like like 6,500 effective uh h100s
- 00:07:32training for you know several months but
- 00:07:36now now we're at 100K so yeah that's
- 00:07:39right more than 100K that's right so so
- 00:07:41what's the next step right so after gu 2
- 00:07:45so if we want to continue
- 00:07:47accelerate we have to take the matter
- 00:07:49into our own hands we have to solve all
- 00:07:50the coolings um all the power issues and
- 00:07:54everything yeah so so in April of last
- 00:07:56year Elon decided that really the only
- 00:07:58way for X to succeed for XI to build the
- 00:08:01best AI out there is to build our own
- 00:08:03data center so um we didn't have a lot
- 00:08:06of time that because we wanted to give
- 00:08:07you gr free as quickly as possible so
- 00:08:10really we realized we have to build the
- 00:08:12data center in about four months um it
- 00:08:15turned out it took us 122 days to get
- 00:08:17the first 100K gpus up and running and
- 00:08:20that was a Monumental effort uh to be
- 00:08:22able to do that um it's we believe it's
- 00:08:25the biggest uh fully connected h100
- 00:08:28cluster of its kind um and uh we didn't
- 00:08:30just stop there we actually decided that
- 00:08:32we need to double the size of the
- 00:08:34cluster pretty much immediately if we
- 00:08:36want to build uh the kind of AI that we
- 00:08:38want to build um so we then had another
- 00:08:42phase um which we haven't talked about
- 00:08:44publicly yet so this is the first time
- 00:08:45that we're talking about this uh where
- 00:08:47we doubled the capacity of the data
- 00:08:49center yet again um and that one only
- 00:08:52took us 92 days so we've been able to
- 00:08:55use all of these gpus use all of this
- 00:08:56compute to improve grock in the meantime
- 00:08:59and basically today we're going to
- 00:09:00present you the results of that the the
- 00:09:03fruits that came from that um so let's
- 00:09:07yeah so all the path all the rows leads
- 00:09:09to grock 3 uh 10x more compute more than
- 00:09:1310x really yeah really like maybe 15x
- 00:09:17yep uh compared to our previous
- 00:09:19generation model and gr finished the
- 00:09:22pre-training uh early January um and uh
- 00:09:26then we start you know the model still
- 00:09:28currently training actually so this is a
- 00:09:30little preview of our Benchmark numbers
- 00:09:34so we evaluated gr 3 on you know three
- 00:09:37different categories on General
- 00:09:40mathematical reasonings on general
- 00:09:43knowledge about stem and Science and
- 00:09:46then also on computer science
- 00:09:48coding so Amy uh American Invitational
- 00:09:52math
- 00:09:53examination uh host it you know once a
- 00:09:56year uh and if we evaluate mod
- 00:09:59performance we can see that the gr 3
- 00:10:02across the board is in a league of its
- 00:10:04own even it's little brother gr3 mini is
- 00:10:09reaching the frontier across all the
- 00:10:11other
- 00:10:12competitors so you will say well at this
- 00:10:15point all these benchmarks you're just
- 00:10:18evaluating you know the memorization of
- 00:10:19the textbooks memorization of the GitHub
- 00:10:22repost how about realtime usefulness how
- 00:10:25about we actually use those models in
- 00:10:27our product so what we did instead is we
- 00:10:31actually kicked off a blind test of our
- 00:10:34gr three model code named Chocolate it's
- 00:10:37pretty hot yeah hot chocolate um and uh
- 00:10:41you know been running on this uh
- 00:10:44platform called Cho arena for two weeks
- 00:10:46um I think the entire X platform at some
- 00:10:49point speculated this might be the next
- 00:10:51generation of a AI come me away so uh
- 00:10:56how this CH Arena works is that um it
- 00:10:59strip away the entire product surface
- 00:11:02right it's just raw comparison of the
- 00:11:04engine of those agis the language models
- 00:11:07themselves and place interface where the
- 00:11:09user will submit one single query and
- 00:11:12you get to show two responses you don't
- 00:11:14know which model they come from and in
- 00:11:16end you make the vote so in this blind
- 00:11:18test grock 3 an early version of grock 3
- 00:11:22already reached like 1,400 no other
- 00:11:26models has reached an ELO score had to
- 00:11:28have comparison to all the other models
- 00:11:30at this score and it's not just one
- 00:11:33single category it's, 1400 aggregated
- 00:11:36across all the categories in chb
- 00:11:39capabilities instruction following
- 00:11:41coding so it's number one across the
- 00:11:43board in this blind test and it's it's
- 00:11:45still climbing so we actually to keep
- 00:11:47updating it so it's it's 14,400 above,
- 00:11:501400 in climbing yeah and in fact we
- 00:11:52have a version of the model that we
- 00:11:53think is already much better than the
- 00:11:55one that we tested here yeah we'll see
- 00:11:57you know how how far it gets uh but
- 00:12:00that's the one that we're you know um
- 00:12:02working on or talking about today yeah
- 00:12:04so actually one thing if if you're if
- 00:12:06you're using grock 3 you I think you may
- 00:12:07notice improvements almost every day um
- 00:12:10because we're we're continuously
- 00:12:11improving the model so
- 00:12:13literally even within 24 hours you'll
- 00:12:15see
- 00:12:16improvements yep so but we believe here
- 00:12:20at xai getting the best pre-training
- 00:12:23model is not enough that's not enough to
- 00:12:25build the best AI and the best AI need
- 00:12:28to think like a human
- 00:12:29you to contemplate about all the
- 00:12:31possible
- 00:12:32solutions self-critique verify all the
- 00:12:36solutions backtrack and also think from
- 00:12:39the first principle that's a very
- 00:12:41important capability so we believe that
- 00:12:44as we take the best pre-train model and
- 00:12:47continue training it with reinforcement
- 00:12:49learning it will elicit the additional
- 00:12:52reasoning capabilities that allows the
- 00:12:54model just become so much better and
- 00:12:57scale not just in the training time but
- 00:12:59in the test time as well so we already
- 00:13:02found the model is extremely useful
- 00:13:04internally um for our own engineering
- 00:13:06saving hours of uh time hundreds of
- 00:13:09hours of uh coding time so e you the
- 00:13:12power user of our uh graic reasoning
- 00:13:14model what are some use cases yeah so
- 00:13:16like Jimmy said we've added Advanced
- 00:13:18reasoning capabilities to Grog and we've
- 00:13:20been testing them pretty heavily over
- 00:13:21the last few weeks in order to give you
- 00:13:23a little bit of a taste of what it looks
- 00:13:24like when Gro is solving hard reasoning
- 00:13:27problems so we prepared two little
- 00:13:28problems for you one comes from physics
- 00:13:31and one is actually a game that gr is
- 00:13:32going to write for us um so when it
- 00:13:35comes to the physics problem you know
- 00:13:36what we want gr to do is to plot a
- 00:13:39viable trajectory to do a transfer from
- 00:13:42Earth to Mars and then uh at a later
- 00:13:45point in time a transfer back from Mars
- 00:13:47to Earth um and that requires some know
- 00:13:50some Physics that gr will have to
- 00:13:52understand um so we're going to
- 00:13:53challenge grock you know come up with a
- 00:13:55variable trajectory calculate it and
- 00:13:58then plot for us so we can see it and um
- 00:14:02yeah this is totally unscripted by the
- 00:14:04way this is the that's the entirety of
- 00:14:05the prompt which was we clarify is that
- 00:14:08yeah there's nothing more than that yeah
- 00:14:10exactly this is the gro interface and
- 00:14:12we've typed in this text that you can
- 00:14:14see here generate code for an animated
- 00:14:163D plot of a launch from Earth uh
- 00:14:19landing on Mars and then back to Earth
- 00:14:21at the next launch window um and we've
- 00:14:24not kicked off with the query and you
- 00:14:26can see Gro is thinking so uh part of
- 00:14:29grock's Advanced reasoning capabilities
- 00:14:31are these thinking traces that you can
- 00:14:32see here you can even go inside and
- 00:14:35actually read what Gro is thinking as
- 00:14:37it's going through the problem as it's
- 00:14:38trying to solve it
- 00:14:41um yeah we say like we are doing some
- 00:14:44obscuration of the thinking so that our
- 00:14:46model doesn't get totally copied
- 00:14:48instantly um so there's more to the
- 00:14:51thinking than is displayed uh yeah yeah
- 00:14:56and because this is totally unscripted
- 00:14:58there's actually a chance that grock
- 00:14:59might made a little coding mistake and
- 00:15:01it might not actually work um so um just
- 00:15:04in case we're going to launch two more
- 00:15:06instances of this so if something goes
- 00:15:08wrong we were able to uh to switch to
- 00:15:11those and show you um something that's
- 00:15:14presentable so we're kicking off the
- 00:15:16other two as well um and like I said we
- 00:15:18have a second problem as well um and um
- 00:15:22yeah actually one of the favorite one of
- 00:15:23our favorite activities here at xci is
- 00:15:25having Gro WR games for us um and um not
- 00:15:29just any no uh any old game any game
- 00:15:32that you might already be familiar with
- 00:15:33but actually creating new games on the
- 00:15:35spot and being creative about us um so
- 00:15:38one example that we found was really
- 00:15:40really fun um is create a game that's a
- 00:15:43mixture of the two games Tetris and be
- 00:15:47so this is that maybe an important thing
- 00:15:49like this obviously if you if you ask an
- 00:15:52AI to create a game like Tetris there's
- 00:15:53there are many examples of Tetris on the
- 00:15:55on the Internet or a game like J
- 00:15:58whatever is it can copy it what's
- 00:16:01interesting here is it achieved a
- 00:16:03creative solution combining the two
- 00:16:06games that actually works and and is a
- 00:16:10good game yeah that's the it's cre we're
- 00:16:12seeing the beginnings of
- 00:16:14creativity yeah fingers cross that we
- 00:16:17can recreate that hopefully it works
- 00:16:19yeah embarrassing it so actually because
- 00:16:21this is a bit more challenging we're
- 00:16:23going to use something special here
- 00:16:24which we call Big Brain that's our mode
- 00:16:27in which we use more computation
- 00:16:30reason for just to make there's a good
- 00:16:33chance here that actually might actually
- 00:16:35do it so we also going to fire off know
- 00:16:37three attempts here at at solving this
- 00:16:40game at creating this game that's a
- 00:16:43mixture of know Tetris and
- 00:16:45Bol um yeah let's let's see what Gro
- 00:16:47comes up like I've played the game it's
- 00:16:49pretty good like it's like wow okay this
- 00:16:52is something yeah um so while gr is
- 00:16:55thinking uh in the in the background um
- 00:16:57we can now actually talk about some
- 00:16:59concrete numbers know how how well is gr
- 00:17:01doing across tons of different tasks
- 00:17:03that we've tested it on um so we'll hand
- 00:17:05it over to Tony to talk about that yeah
- 00:17:08okay so let's see how Gro does on those
- 00:17:11interesting challenging benchmarks uh so
- 00:17:14yeah so reasoning again refers to those
- 00:17:16models that actually thinks quite for
- 00:17:19quite a long time before it tries to
- 00:17:21solve a problem so in this case uh you
- 00:17:24know around a month ago the gr 3
- 00:17:26pre-training finishes so after that we
- 00:17:29work very hard to put the reasoning
- 00:17:31capability into the uh current grath 3
- 00:17:34Model but again this is very early days
- 00:17:37so the model is still currently in
- 00:17:39training so right now what we're going
- 00:17:41to show to people is this beta version
- 00:17:43of the gra three reasoning model
- 00:17:45alongside we also are training a mini
- 00:17:48version of the reasoning model so
- 00:17:50essentially on this plot you can see uh
- 00:17:52the grth three reasoning beta and then
- 00:17:54grth three mini reasoning the grth three
- 00:17:56reason mini reasoning is actually a
- 00:17:58model that we train for much longer time
- 00:18:00and you can see that sometimes it
- 00:18:01actually perform slly better compared to
- 00:18:04the gr three reasoning this also just
- 00:18:06means that there's a huge potential for
- 00:18:08the gr three reasoning because it's
- 00:18:10trained for much less time um so all
- 00:18:13right so let's actually look at what how
- 00:18:15how it does on those three benchmarks so
- 00:18:18Jimmy also introduced already so
- 00:18:20essentially we're looking at three
- 00:18:21different areas mathematics science and
- 00:18:24coding um and for math we're picking
- 00:18:27this high school competition math
- 00:18:28problem
- 00:18:29um for science we actually pick those
- 00:18:32PhD level science questions um and for
- 00:18:35coding it's also actually pretty
- 00:18:36challenging it's competitive coding and
- 00:18:38also some uh lead code which is some
- 00:18:40code inter interview problems that
- 00:18:42people usually get when they interview
- 00:18:44for companies so on those benchmarks you
- 00:18:46can see that the gro 3 actually perform
- 00:18:49quite well uh across the board compared
- 00:18:52to other competitors um yeah so it's
- 00:18:55pretty promising these models are very
- 00:18:56smart so Tony what what what are those
- 00:18:59shaded bars yeah so okay so I'm glad you
- 00:19:02asked this question so for those models
- 00:19:05because it can reason it can thinks you
- 00:19:07can also ask them to even think longer
- 00:19:10uh you can spend more what we call test
- 00:19:13and compute which means you can spend
- 00:19:15more time to reason to think about a
- 00:19:18problem before you spit out the answer
- 00:19:21so in this case the Shaded bar here
- 00:19:24means that we just ask the model to
- 00:19:26spend more more time you know you can
- 00:19:28can solve the the same problem many many
- 00:19:30times before it it tries to conclude
- 00:19:33what is the right solution and once you
- 00:19:35give this compute or this this kind of
- 00:19:37budget to the model it turns out the
- 00:19:40model can even perform better so this is
- 00:19:43essentially the Shaded bar in in those
- 00:19:45SPS right so I think this is really
- 00:19:48exciting right because now instead of
- 00:19:50just doing one chain of thoughts with AI
- 00:19:52why not do multiple all at once yes so
- 00:19:55that's a very powerful technique that
- 00:19:56allows to continue scale the model
- 00:19:58capabilities after training um and you
- 00:20:02know people often ask are we actually
- 00:20:04just over fitting to the benchmarks yes
- 00:20:06so how about generalization so yes I
- 00:20:08think uh yeah this is definitely a
- 00:20:11question that we are asking ourselves
- 00:20:13whether we are overfitting to those
- 00:20:14current benchmarks uh luckily uh we have
- 00:20:17a real test so about 5 days ago Amy 2025
- 00:20:22just finished this is where high school
- 00:20:24students compete in this particular
- 00:20:27Benchmark so we got this very fresh new
- 00:20:29competition and then we asked our two
- 00:20:31models to compete on the same Benchmark
- 00:20:34at the same exam and it turns out uh
- 00:20:37very interestingly the grth three
- 00:20:39reasoning the big one um actually does
- 00:20:42uh better um on this particular new
- 00:20:44fresh exam this also means that the
- 00:20:46generalization capability of the big
- 00:20:48model is stronger much stronger compared
- 00:20:51to the smaller model uh if you compare
- 00:20:53to the last year's exam actually this is
- 00:20:55the opposite the smaller model kind of
- 00:20:57learns the uh the the previous exams
- 00:21:00better so yeah so this this actually
- 00:21:02shows some kind of true generalization
- 00:21:04from the model that's right so 17 months
- 00:21:07ago our grock zero and grock one barely
- 00:21:09solves any High School problems that's
- 00:21:11right and now we have a kid that just
- 00:21:14already graduate the gro grock is ready
- 00:21:16to go to college is that right yeah I
- 00:21:18mean it's won't be long before it's
- 00:21:19simply perfect the human exams won't be
- 00:21:22part they be too easy yeah like and
- 00:21:25internally we actually as gret Contin
- 00:21:28evolves
- 00:21:29uh we're going to talk about you know
- 00:21:30what we're excited about but very soon
- 00:21:33there will be no more benchmarks left
- 00:21:35yeah yeah one thing that's quite
- 00:21:38fascinating I think is that we basically
- 00:21:40only trained rocks reasoning abilities
- 00:21:42on math problems and comparative coding
- 00:21:44problems right so very very specialized
- 00:21:47kinds of tasks but somehow it's able to
- 00:21:50work on all kinds of other different
- 00:21:52tasks so including creating games no
- 00:21:55lots lots and lots of different things
- 00:21:57um and what seems to be happening is
- 00:21:58that basically Gro learns this ability
- 00:22:01to detect its own mistakes and its
- 00:22:02thinking correct them persist on a
- 00:22:05problem try lots of different Varian
- 00:22:07pick pick the one that's best so there
- 00:22:08are these generalized generalizing
- 00:22:10abilities that Gro learns from
- 00:22:12mathematics and from coding which it can
- 00:22:14then use to solve all kinds of other
- 00:22:16problems so that's yeah that's pretty I
- 00:22:18mean reality is the instantiation of
- 00:22:21mathematics that's right um and one
- 00:22:23thing we're actually really excited
- 00:22:25about that going back to our fing
- 00:22:26mission is what if one day we have a
- 00:22:29computer just like deep thought that
- 00:22:32utilize our entire cluster just for that
- 00:22:34one very important problem in the test
- 00:22:36time all the GPU turned on right so I
- 00:22:39think we back then we were building the
- 00:22:40GPU clusters together uh you were
- 00:22:42pluging cables and I remember that when
- 00:22:46we turn on the the first initial test
- 00:22:49you can hear all the GPS humming in the
- 00:22:51hallway that's almost feel like
- 00:22:53spiritual yeah that that's actually a
- 00:22:55pretty cool uh thing that we're able to
- 00:22:57do that we can go into the data Center
- 00:22:59and Tinker with the machines there so
- 00:23:01for example we went in and we unplugged
- 00:23:04a few of the cables and just made sure
- 00:23:06that our training setup is still running
- 00:23:08running stably so that's something that
- 00:23:10you know I think most uh AI you know
- 00:23:13teams out there don't usually do but
- 00:23:15it's actually totally unlocks like a new
- 00:23:17level of reliability and what you're
- 00:23:19able to do with with the hardware so
- 00:23:21okay so when when are we going to solve
- 00:23:24remon so uh the easiest solution is to
- 00:23:28numerate over all possible strings and
- 00:23:32as long you have a verifier enough
- 00:23:33compute you'll be able to do it okay my
- 00:23:36projection will be what your guess what
- 00:23:38is your neuronet calculate so my my my
- 00:23:42both prediction so so three years ago I
- 00:23:43told you this I think in now it's uh two
- 00:23:46years uh later two things going to
- 00:23:48happen we're going to see machines win
- 00:23:51some medals yeah that's touring award
- 00:23:53absolutely Fields medal Nobel Prize with
- 00:23:57probably some expert in the loop right
- 00:23:59so the expert uplifting do you mean so
- 00:24:01this year or next
- 00:24:02year oh oh
- 00:24:05okay that's what it comes down to real
- 00:24:07yeah so it looks like grock finished all
- 00:24:10of it thinking on on the two problems so
- 00:24:12let's take a look at what it
- 00:24:15said all right so this was the the
- 00:24:18little physics problem we had um no we
- 00:24:21we've collapsed the thoughts here so
- 00:24:23they're you know they're hidden and then
- 00:24:25we see gr's answer below that so it
- 00:24:27explains it wrote a pyth script here
- 00:24:29using matplot lip then gives us all of
- 00:24:31the code um so let's take a quick look
- 00:24:34at the code you know seems like it's
- 00:24:35doing reasonable things here not not
- 00:24:38totally of the Mark um solve Kepler says
- 00:24:42here so maybe it's solving Kepler's laws
- 00:24:44cap cap law numerically um yeah there's
- 00:24:47really only one way to find out if this
- 00:24:49thing is working I'd say let's let's
- 00:24:51give it a try let's run let's run the
- 00:24:52code all right and we can see um yeah gr
- 00:24:56is animating two different planet Earth
- 00:24:58and Mars here and then the the green
- 00:25:02ball is the the vehicle that's
- 00:25:04transiting the the spacecraft that's
- 00:25:06transitioning between Earth and Mars and
- 00:25:08you you could see the journey from Earth
- 00:25:10to Mars and looks like yeah indeed the
- 00:25:12the astronauts return safely you know at
- 00:25:15the right moment in time um so now
- 00:25:19obviously this was just generated on the
- 00:25:20spot so now we can tell you if that was
- 00:25:23actually correct solution so we're going
- 00:25:24to take a closer look now maybe we're
- 00:25:25going to call some colleagues from space
- 00:25:28X ask them if if this is legit um it's
- 00:25:31pretty close it's it's I mean uh yeah I
- 00:25:35mean there there's a lot of complexities
- 00:25:37in the actual orbits that have to be
- 00:25:39taken into account but this is this is
- 00:25:40pretty close to to what it what looks
- 00:25:42like awesome um in fact I have that on
- 00:25:46my pend here it's got the Earth home and
- 00:25:49transfer on
- 00:25:52it when when are we going to install
- 00:25:54grck on a rocket
- 00:25:58well I suppose in two years two years
- 00:26:02everything is two years away uh well
- 00:26:05Earth and Mars Transit can occurs every
- 00:26:0826 months the next we're currently in a
- 00:26:11Transit window approximately the next
- 00:26:12one would be um November of next year um
- 00:26:18roughly end of next year um and uh if
- 00:26:21all goes well SpaceX will send Starship
- 00:26:24Rockets to Mars and um with Optimus
- 00:26:29robots and
- 00:26:31uh and
- 00:26:34Gro I'm curious what this combination of
- 00:26:37Tetris and B looks like bet Tetris as
- 00:26:41we've named it internally um so okay we
- 00:26:45also have an output from gr here it say
- 00:26:47wrot a python script explains that it's
- 00:26:49what it's been doing if you look at the
- 00:26:51the code know there are some constants
- 00:26:54that are being defined here some colors
- 00:26:56then the the trinos the the the pieces
- 00:26:59of Tetris are there um obviously very
- 00:27:02hard to see at one glance if this is
- 00:27:04good so we got to we got to run this to
- 00:27:07figure out if it's
- 00:27:08working well let's let's give it a
- 00:27:11try fingers crossed all right right so
- 00:27:13this kind of looks like Tetris uh but
- 00:27:16the the colors are a little bit off
- 00:27:18right the colors are different here and
- 00:27:21um I if you think about what's going
- 00:27:24what's going on
- 00:27:25here the has this mechanic where if you
- 00:27:28get three Jews in a row you know then
- 00:27:31they they disappear and also gravity
- 00:27:33activates right so what happens if you
- 00:27:36get three of the colors together oh so
- 00:27:38something happened um so I think I think
- 00:27:41what SC did in this version um is is
- 00:27:45that you know once you connect three at
- 00:27:48least three blocks of the same color in
- 00:27:50a row then um know gravity activates and
- 00:27:55they disappear and then gravity
- 00:27:56activates and all the other blocks fall
- 00:27:57down
- 00:27:59um kind of kind of curious if there's
- 00:28:01still a Tetris mechanic here where if
- 00:28:03the line is full does it actually um
- 00:28:06clear it or what happens then it's up to
- 00:28:10interpretation you know so who who knows
- 00:28:12yeah I mean when it'll do different
- 00:28:14variants when you ask it it doesn't do
- 00:28:16the same thing every time exactly we've
- 00:28:18seen a few other the tetris that worked
- 00:28:20very differently but this one seems cool
- 00:28:23so yeah are we ready for uh game Studio
- 00:28:27at x. yes so we're launching uh an AI
- 00:28:31gaming studio at xci if you're
- 00:28:33interested in joining us and building AI
- 00:28:35games uh please join xai we're launching
- 00:28:38an AI gaming studio we're announcing it
- 00:28:40tonight let's
- 00:28:41go epic games wa that's an actual
- 00:28:45[Laughter]
- 00:28:47game yeah yeah um all right
- 00:28:52so um I think one thing is super
- 00:28:54exciting for us uh is that once you have
- 00:28:58the best pre Trend model you have the
- 00:29:00best reasoning model right so we already
- 00:29:03see that when you actually give the
- 00:29:05capability for those model to think
- 00:29:06harder uh think longer think more broad
- 00:29:10the performance continue improves and
- 00:29:13we're really excited about the next
- 00:29:14Frontier that what happen if would not
- 00:29:17only allow the model to think harder but
- 00:29:18also provide more tools this like call
- 00:29:21real humans to solve those problems for
- 00:29:23real humans we don't ask them to solve
- 00:29:26reman a hypothesis just with a piece of
- 00:29:28pen and paper no internet so with all
- 00:29:33the basic web browsing search engine and
- 00:29:36code interpreters that builds the
- 00:29:39foundations and the best reasoning model
- 00:29:41builds the foundations for the gro agent
- 00:29:44to come um so today we're actually
- 00:29:48introducing a new product called Deep
- 00:29:51search that is the first generation of
- 00:29:54our Gro agents that not just helping the
- 00:29:56engineers and research scientist to do
- 00:29:58coding but actually help everyone to
- 00:30:01answer questions that you have dayto day
- 00:30:03it's a kind of like a next generation of
- 00:30:05search engine that really help you to
- 00:30:07understand the universe so you can start
- 00:30:10asking question like for example hey
- 00:30:12when is the next Starship launch day for
- 00:30:15example um so let's try that if get the
- 00:30:19answer um on the left hand side we see
- 00:30:23uh a high level progress bar essentially
- 00:30:26you know the model just going to do one
- 00:30:28single search like the current rack
- 00:30:30systems but actually thought very deeply
- 00:30:32about hey what's the user intent here
- 00:30:35and what are the facts I should consider
- 00:30:37at the same time and how many different
- 00:30:39website I should actually go and read
- 00:30:40their content right so this can really
- 00:30:43save hundreds hours of everyone's Google
- 00:30:46time if you want to really look into
- 00:30:48certain topics and then on the right
- 00:30:51hand side you can see the bullet
- 00:30:53summaries of how the current model uh
- 00:30:55you know is doing what websites browsing
- 00:30:58what sources verifying and often time
- 00:31:00actually cross validate different
- 00:31:02sources out there uh to make sure the
- 00:31:05answer is actually correct before it's
- 00:31:06output final answer and we can you know
- 00:31:08at the same time fire up a few more
- 00:31:10queries um how about you know you don't
- 00:31:13you're a gamer right so uh sure yeah so
- 00:31:16how about what are some of the best
- 00:31:18builds and most popular builds in path
- 00:31:20Excel hardcore right hardcore League I
- 00:31:23me you can technically just look at the
- 00:31:25hardcore
- 00:31:26ladder might be a fast way to figure it
- 00:31:28out yeah we'll see what model
- 00:31:31does um and then we can also do uh you
- 00:31:35know uh something more fun for
- 00:31:37example um how about like make a
- 00:31:39prediction about the March Madness out
- 00:31:41there yeah so this is kind of a fun one
- 00:31:43where um Warren Buffett has a billion
- 00:31:46dollar bet if you can exactly match the
- 00:31:50I think the the the sort of the entire
- 00:31:53winning tree of marsh Madness you can
- 00:31:55win a billion dollarss from Warren
- 00:31:57Buffett so like would be pretty cool if
- 00:31:59AI could help you win a billion dollars
- 00:32:01from
- 00:32:03Buffett that seems like a pretty good
- 00:32:05investment let's go yeah all right so
- 00:32:08now let's uh fire up the query and uh
- 00:32:11see what model does so we can actually
- 00:32:13go back to our very first one how about
- 00:32:15the buff it wasn't counting on this it's
- 00:32:18already done that's right okay so we got
- 00:32:20the result of the first one and model
- 00:32:22thought uh around one minute uh so okay
- 00:32:25so the key inside here the knock
- 00:32:27Starship is going to be on 24th or later
- 00:32:30so no earlier than February
- 00:32:3224th it might be
- 00:32:35sooner so yeah so I think we can you
- 00:32:38know go down so go down what what the
- 00:32:40model does so it does a little research
- 00:32:42on the flight seven what happen got
- 00:32:44grounded and actually it look into the
- 00:32:46FCC filing uh uh you know from its data
- 00:32:51collections uh and then actually make
- 00:32:54the new conclusion that yeah if we
- 00:32:56continue scroll down uh let's see
- 00:33:00uh uh right yeah so it makes uh the you
- 00:33:05know little table I think uh inside xai
- 00:33:08we often joked about the time to the
- 00:33:10first table is the only you know latency
- 00:33:14that matters um yeah so that's how to
- 00:33:16model make inference and look up all the
- 00:33:19sources um and then we can look into to
- 00:33:22the gaming one so how about the
- 00:33:29right so for this particular one uh we
- 00:33:32look at hey the you know the build is
- 00:33:34light and okay it's kind better so uh
- 00:33:39with the The Infernal is but if we go
- 00:33:41down so the surprising fact of all the
- 00:33:44other builds so it look into the 12
- 00:33:47classes um yeah so we'll see that the
- 00:33:51minum build was pretty popular whenever
- 00:33:53the game first came out and now the the
- 00:33:55invokers of the world kind took over
- 00:33:58invoker monke invoker for sure yeah
- 00:34:00that's right yeah followed by the stor
- 00:34:02wavers and that's really good at mapping
- 00:34:04so yeah and then we can see uh uh the
- 00:34:09the match Madness how about that
- 00:34:13so um one one interesting thing about
- 00:34:16the Deep search is that if you actually
- 00:34:18go into the panel where shows uh you
- 00:34:21know what are the subtasks you can
- 00:34:23actually click the bottom left of
- 00:34:26this right and then in this case you can
- 00:34:30actually scroll through actually reading
- 00:34:32through the mind of Gro that what
- 00:34:34informations does the model actually
- 00:34:36think about are trustworthy what are not
- 00:34:38how does it actually cross validate
- 00:34:40different information sources so that
- 00:34:42makes the entire search experience and
- 00:34:44information retrieval process a lot more
- 00:34:46transparent to our
- 00:34:49users and this is much more powerful
- 00:34:51than any search engine out there you can
- 00:34:54literally just tell it only use sources
- 00:34:56from X you know will try to respect that
- 00:34:59yeah and so it's much more steerable
- 00:35:00much more intelligent than I mean it
- 00:35:03really should save you a lot of time so
- 00:35:04something that might take you half an
- 00:35:06hour or an hour of researching on the
- 00:35:08web or searching social media you can
- 00:35:10just ask it to go do that and and come
- 00:35:12back in 10 minutes later it's done an
- 00:35:14hours worth of work for you that's
- 00:35:16really what it comes down to exactly and
- 00:35:18and maybe better than you could have
- 00:35:19done it yourself yeah think about you
- 00:35:21have INF am of interns working for you
- 00:35:24now you can just fire up all the tasks
- 00:35:25and come back a minute later um so this
- 00:35:29is going to be interesting one so uh uh
- 00:35:31March M had not happened yet so I guess
- 00:35:34we have to follow up with a uh next live
- 00:35:36stream yeah it seems like pretty good
- 00:35:39like $40 might get you a billion dollars
- 00:35:42$40 subscription that's right I mean my
- 00:35:46work so uh yeah so when are the users
- 00:35:49going to have their hands on gr 3 yes so
- 00:35:52the the good news is we've been working
- 00:35:53tirelessly to actually release um all of
- 00:35:57these features that we've shown you the
- 00:35:59groge based model with amazing chat
- 00:36:00capabilities that's really useful that's
- 00:36:02really interesting to talk to uh the the
- 00:36:05Deep search the advanced reasoning mode
- 00:36:07all of these things we want to roll them
- 00:36:09out to you today starting with the
- 00:36:12premium plus subscribers on X so it's
- 00:36:14the first group that will initially get
- 00:36:16access make sure to update your X app if
- 00:36:18you want to see all of the advanced
- 00:36:20capabilities because we just released
- 00:36:22the update now as we're as we're talking
- 00:36:24here um and U yeah if you're interested
- 00:36:27in getting access to gr then sign up for
- 00:36:29premium plus um and also um we're
- 00:36:32announcing that we're starting a
- 00:36:34separate subscription for GR that we
- 00:36:35call Super grock for those who those
- 00:36:38real grock fans that want the most
- 00:36:40advanced capabilities and the earliest
- 00:36:42access to to new features um so feel
- 00:36:45free to check that out as well this this
- 00:36:47is for the dedicated grock app and for
- 00:36:48the website exactly so our our new
- 00:36:51website is called gro.com yeah and you
- 00:36:53also find you never guess yeah you never
- 00:36:55guess and you can also find our grock
- 00:36:57app in the IOS app store and that gives
- 00:37:00you like a more Pol even even more
- 00:37:03polished experience that's totally grock
- 00:37:05focused if you're if you want to have
- 00:37:07grock know easily available one Tap Away
- 00:37:09yeah the version on gro.com on uh you
- 00:37:12know on a web browser is going to be the
- 00:37:14the most the latest and most advanced
- 00:37:15version because obviously takes us a
- 00:37:16while to get thing get something into an
- 00:37:19app and then get it approved by the app
- 00:37:21store so uh and then if that something's
- 00:37:23in a phone format there's limitations
- 00:37:25what you can do so the most powerful
- 00:37:27version of Gro um and the latest version
- 00:37:29will be the the web version at gro.com
- 00:37:31yeah so watch out for the name grock
- 00:37:33free in the app dead giveaway yeah
- 00:37:36exactly that that's that's the giveaway
- 00:37:37that you have gr and if it says gr
- 00:37:39through then gr hasn't quite arrived for
- 00:37:42yet but we're working hard to roll this
- 00:37:43out today um and then to even more
- 00:37:46people over the the coming days yeah
- 00:37:48make sure you update your uh phone app
- 00:37:50too um where you're actually going to
- 00:37:52get all the tools we showcase today with
- 00:37:54the thinking mode with the Deep search
- 00:37:57so yeah really looking forward to all
- 00:37:59the feedbacks you have yeah I think we
- 00:38:02we should uh emphasize that this is kind
- 00:38:04of a beta like meaning that it's you
- 00:38:06should expect some imperfections at
- 00:38:08first um but we will improve it rapidly
- 00:38:11almost every day in fact every day I
- 00:38:13think it'll get better um so if you want
- 00:38:16a more polished version I'd like maybe
- 00:38:18wait a week but uh expect improvements
- 00:38:21literally every day um and then we're
- 00:38:23also going to be uh providing a voice
- 00:38:26interaction so you can have
- 00:38:28conversational in fact I was trying it
- 00:38:29earlier today it's working pretty well
- 00:38:31but not we need these a bit more polish
- 00:38:34um the the the sort of way where you can
- 00:38:36just literally talk to it like you're
- 00:38:37talking to a person uh it's that's
- 00:38:40awesome it's actually I think one of the
- 00:38:41best experiences of gr um but that's
- 00:38:44that's probably about a week
- 00:38:47away yeah so uh with that said um well I
- 00:38:52think we might have some audience
- 00:38:53questions sure yeah okay all right let's
- 00:38:57take a look yeah let's take a look the
- 00:39:00uh the audience from the a platform
- 00:39:05yeah so the first question here is when
- 00:39:08grock voice assistant when is it coming
- 00:39:10out yeah as as as soon as possible just
- 00:39:13like Elon said uh just a little bit of
- 00:39:15polishing away from being released to
- 00:39:17everybody um obviously it's going to be
- 00:39:19released in an early form and we're
- 00:39:21going to rapidly iterate on that Y and
- 00:39:24the next question is like when will Gro
- 00:39:263 be in the API
- 00:39:28so this is coming in the uh the gr 3 API
- 00:39:31with both the reasoning models and deep
- 00:39:34search is coming your way in the coming
- 00:39:36weeks uh we're actually very excited
- 00:39:37about the Enterprise use cases of all
- 00:39:39these additional tools that now Gro has
- 00:39:41access to and how the test time compute
- 00:39:43and Tool use can actually really
- 00:39:44accelerate all the business use
- 00:39:46cases um yeah another one is Will voice
- 00:39:50mode be native or text to speech so I
- 00:39:53think that means is it going to be one
- 00:39:55one model that is understanding
- 00:39:57what you say and then talking back to
- 00:39:59you or is it going to be some system
- 00:40:01that has text of speech inside of it and
- 00:40:02the good news is it's going to be one
- 00:40:04model like not a variant of gr free that
- 00:40:07we're going to release which basically
- 00:40:09understands what you're say what you're
- 00:40:10saying and then uh generates the audio
- 00:40:13no directly from that um so very much
- 00:40:15like Grog free generates text know that
- 00:40:18model generates audio um and that has a
- 00:40:20bunch of advantages I was talking to it
- 00:40:22earlier today and it said hi igore know
- 00:40:25reading my my name from probably from
- 00:40:26some text that it had um and I said no
- 00:40:29no my name is Igor and it remembered
- 00:40:32that you know so it could continue to
- 00:40:34say Igor just like a human word and you
- 00:40:36you can't achieve that with with TX of
- 00:40:38speech
- 00:40:39so yeah so oh here's a question for you
- 00:40:42pretty spicy um you um is grog a boy or
- 00:40:47a girl and how they sing Grog is
- 00:40:49whatever you want it to
- 00:40:52be yeah yeah are you
- 00:40:55single yes
- 00:40:58all right Shop is open um so honestly
- 00:41:02people are going to fall in love with
- 00:41:03crcket since it's like 1,000% probable
- 00:41:08yeah uh the next question will Gro be
- 00:41:10able to transcribe audio into text yes
- 00:41:13so we'll have this capability both the
- 00:41:15app and also the API we found that's
- 00:41:17like gr should just be your personal
- 00:41:19assistant looking over your shoulder and
- 00:41:21follow you along the way learn
- 00:41:23everything you have learned and really
- 00:41:24help you to understand the world better
- 00:41:26become smarter every
- 00:41:28day yeah I mean the voice M doesn't
- 00:41:31isn't simply it's not just voice text it
- 00:41:34understands like tone inflection pacing
- 00:41:36everything it's it's wild I mean it's
- 00:41:39like talking to a
- 00:41:41person okay um yep so any plans for
- 00:41:45conversation memory yeah yeah absolutely
- 00:41:49we're working on it right now I really
- 00:41:52forgot that's right um let's see what
- 00:41:57are the other
- 00:42:01ones so what about the you know the DM
- 00:42:06features right so if you have
- 00:42:07personalizations and if you have uh you
- 00:42:10know Gro remembers your previous
- 00:42:13interactions yes should it be one Gro or
- 00:42:16multiple different grocs it's up to you
- 00:42:18you can have one Gro or many
- 00:42:20GRS I suspect people will probably have
- 00:42:23more than one yeah I want to have a doct
- 00:42:26grock yeah
- 00:42:27the grock
- 00:42:29dog that's
- 00:42:31right
- 00:42:33um right cool um so in the past we've
- 00:42:37open sourced grock one right so
- 00:42:40somebody's asking us are we going to do
- 00:42:41that again with gr to yeah I think um
- 00:42:45once Gro our general approach is that we
- 00:42:48will open source the last version when
- 00:42:50the next version is fully out like when
- 00:42:54when gr 3 is um mature and stable which
- 00:42:57is probably within a few months then
- 00:43:00we'll open source gr too mhm okay so we
- 00:43:04probably have time for one last question
- 00:43:07um what was the most difficult part
- 00:43:09about working on this project I assume
- 00:43:12um grock 3 and what I most excited about
- 00:43:16so I think me looking back you know
- 00:43:19getting the whole model training on 100K
- 00:43:23h100 coherently that's almost like
- 00:43:25battling against the final boss of the
- 00:43:27universe the entropy because any given
- 00:43:30time you can have a cosmic rate that
- 00:43:31beaming down and flip a bit in your
- 00:43:33transistor and now the entire grading
- 00:43:35update if it's fit mantisa bit the
- 00:43:38entire grading update is out of whack
- 00:43:41and now you have 100,000 of those and
- 00:43:43you have to orchestrate them every time
- 00:43:45any at at any given time any of gpus can
- 00:43:48go down yeah I mean it's with breaking
- 00:43:51down like how were we able to uh get the
- 00:43:53world's most powerful training cluster
- 00:43:55operational Within 122 days um because
- 00:43:59we we started off um we we actually
- 00:44:03weren't intending to do a data center
- 00:44:04ourselves we were going to just uh we we
- 00:44:07went to the data center providers and
- 00:44:09said how long would it take to have
- 00:44:11100,000 uh gpus operating coherently um
- 00:44:15in a single location and we got time
- 00:44:17frames from 18 to 24 months so we're
- 00:44:20like well 18 24 months that means losing
- 00:44:23is a certainty so the only option was to
- 00:44:25do it do it ourselves so then if you
- 00:44:27break down the problem I guess I'm doing
- 00:44:29like reasoning here with like makes you
- 00:44:32think um one single chain though yeah
- 00:44:35yeah exactly so um well we needed a
- 00:44:37building we can't build a building so we
- 00:44:39must use an existing building um so we
- 00:44:41we looked for um for basically for
- 00:44:44factories that had been um were that
- 00:44:48have been abandoned but the factory was
- 00:44:50in good shape like a company had gone
- 00:44:51bankrupt or something so we found an
- 00:44:52Electrolux Factory in memph in Memphis
- 00:44:55that's why it's in Memphis um
- 00:44:57home of Alvis and also one of the oldest
- 00:45:00I think it was the capital of ancient
- 00:45:02Egypt um and it was actually very nice
- 00:45:06Factory that I know for whatever reason
- 00:45:09that electrox had left um and uh that
- 00:45:13that gave us shelter for the computers
- 00:45:15uh then we needed power the we needed um
- 00:45:20at least 120 megawatts at first but the
- 00:45:21building only had 15 megawatt and
- 00:45:23ultimately for 200,000 Mega 200,000 gpus
- 00:45:26we needed a 4 gaw so we um initially uh
- 00:45:30leased uh a whole bunch of um generators
- 00:45:34so we have generators on one side of the
- 00:45:35building just one trailer after trailer
- 00:45:38trailer of generators until we can get
- 00:45:40the utility power to to come in um and
- 00:45:42then but then we also need cooling so on
- 00:45:44the other side of the building it was
- 00:45:45just trailer after trailer of of cooling
- 00:45:47so we leased about a quarter of the
- 00:45:49mobile cooling capacity of the United
- 00:45:50States uh on the one other side of the
- 00:45:52building um then we needed to get the
- 00:45:55gpus all installed and they're all
- 00:45:57liquid cooled so in order to achieve the
- 00:45:59density necessary this is a liquid
- 00:46:01cooled system so we had to get all the
- 00:46:03plumbing for the liquid cooling nobody
- 00:46:05had ever done a liquid cooling uh data
- 00:46:07center at scale so this was a incredibly
- 00:46:11dedicated effort by a very talented team
- 00:46:13to achieve that outcome um I may think
- 00:46:16not now it's going to work nope um the
- 00:46:19the issue is that the the power
- 00:46:21fluctuations for a GPU cluster are
- 00:46:24dramatic so it's it's like a a this
- 00:46:28giant Symphony that is taking place like
- 00:46:30imagine having a symphony with 100,000
- 00:46:34or 200,000 participants in the in the
- 00:46:36symphony and the whole Orchestra will go
- 00:46:38quiet and loud in you know 100
- 00:46:42milliseconds and so this caused massive
- 00:46:44power fluctuations so then um which then
- 00:46:48caused the generators to lose their
- 00:46:49minds and they they weren't expecting
- 00:46:51this so to buffer the power we then uh
- 00:46:55used Tesla Mega packs
- 00:46:57uh to smooth out the power so the
- 00:47:00megapacks had to be reprogrammed so with
- 00:47:04with XI we working with Tesla we
- 00:47:06reprogrammed the MEAP packs to be able
- 00:47:08to deal with these dramatic power fluctu
- 00:47:11fluctuations to smooth out the power the
- 00:47:13computers could actually run
- 00:47:15properly and
- 00:47:17um that that worked uh quite tricky and
- 00:47:21uh and then but even at that point you
- 00:47:24still have to make the computers all
- 00:47:25communicate effectively so all the
- 00:47:27networking had to be solved and uh
- 00:47:30debugging Brazilian network cables um a
- 00:47:35debugging nickel at 4: in the morning we
- 00:47:38solved it like roughly 4:20 a.m. yes was
- 00:47:43figured out like there's some well there
- 00:47:45were a whole bunch of issues well one
- 00:47:46there was like a bios mismatch bios was
- 00:47:49not set up correctly yeah we had uh D
- 00:47:54our lspci outputs between two different
- 00:47:57machines one that was working yeah one
- 00:47:59that was not working yeah many many many
- 00:48:02other things I mean yeah exactly this
- 00:48:03would go on for a long time if we
- 00:48:04actually listed all the things but you
- 00:48:06know it's like interesting like it's not
- 00:48:07like oh we just magically made it happen
- 00:48:09you have to break down the problem just
- 00:48:11like gr does for reasoning into the
- 00:48:13constituent elements and then solve each
- 00:48:14of the constituent elements in order to
- 00:48:17achieve uh a a a coherent training
- 00:48:19cluster in a period of time that is a
- 00:48:22small fraction of what anyone else was
- 00:48:24could do it
- 00:48:25in and then on the training cluster was
- 00:48:27up and running and we could use it now
- 00:48:29we had to make sure that it actually
- 00:48:30stays healthy throughout which is its
- 00:48:32own giant Challenge and then we had to
- 00:48:34get every single detail of the training
- 00:48:36right in order to get a gr Free level
- 00:48:39model which is actually really really
- 00:48:41hard so um we don't know if there are
- 00:48:43any other models out there that have
- 00:48:45gr's capabilities but whoever trains a
- 00:48:47model better than gr has to be extremely
- 00:48:49good at the the science of deep learning
- 00:48:51at every aspect of the engineering um so
- 00:48:54it's it's not so easy to to pull this St
- 00:48:57and this is now going to be the last
- 00:48:58cluster we buildt and last Model we
- 00:49:00train oh yeah we've already we've
- 00:49:02already started work on the next
- 00:49:04cluster which will
- 00:49:06be yeah about five times the power so
- 00:49:09instead of a quarter gwatt roughly 1.2
- 00:49:13GW May what's the what's the Back to the
- 00:49:16Future
- 00:49:17wor what's the power in do you does like
- 00:49:20the Back to the Future car yeah anyway
- 00:49:23the Back to the Future power car it's
- 00:49:26it's like roughly in that order I think
- 00:49:27um so
- 00:49:30um and you know these will be the sort
- 00:49:33of the gb200 SL300 clester it once again
- 00:49:37it will be the most powerful training
- 00:49:38clester in the world so we're not like
- 00:49:41stopping here no and our reason model is
- 00:49:43going to continue improve by accessing
- 00:49:46more tools every day so yeah we're very
- 00:49:48excited to share any of the upcoming
- 00:49:50results with you all yeah the thing that
- 00:49:52keeps us going is basically being able
- 00:49:55to give gr free to you and then seeing
- 00:49:57the usage go up seeing everybody enjoy
- 00:50:00no gr that's that's what really gets us
- 00:50:03up in the morning
- 00:50:05so yeah yeah thanks for tuning in thanks
- 00:50:11guys hey Gro what's up can you hear
- 00:50:16me I'm so excited to finally meet you I
- 00:50:19can't wait to chat and learn more about
- 00:50:20each other I'll talk to you soon
- Grock 3
- AI模型
- 推理能力
- Deep Search
- 数据中心
- 人类知识
- 宇宙探索
- 升级
- Grock 2
- 技术进步