Grok 3发布会中文字幕版本|聪明得让人害怕
摘要
TLDR在Grock 3的发布中,XAI团队强调了其在宇宙探索和知识获取上的愿景,并展示了Grock 3在能力上的显著提升。模型的设计灵感来源于对人类和宇宙理解的深刻追求,注重事实和真相的严谨探索。通过建立自有数据中心,团队克服了多重挑战以实现Grock 3的训练,最新的Grock 3表现出色,并通过Blind Test获胜。新功能包括高级推理能力和Deep Search,致力于为用户提供实时准确的信息检索体验,体现了持续改进的潜力。
心得
- 🌌 追求宇宙真理的重要性
- 🚀 Grock 3在各项基准测试中表现出色
- 🛠️ 建立自有数据中心以支持AI训练
- 🧠 加强推理和创造力的能力
- 🔍 Deep Search帮助精确回答问题
- 📈 Grock持续进行优化与改进
- 👾 实时反馈用户使用体验
- 🔋 大规模训练面临的冷却与电力问题
- 🎮 新游戏创作展示AI能力
- 💡 未来AI在各领域的应用展望
时间轴
- 00:00:00 - 00:05:00
在这次GR 3的介绍中,团队的使命是深入了解宇宙的本质,探索宇宙中的一系列基本问题,例如外星人存在与否、生命的意义等。他们表达了追求真理的重要性,并介绍了GR 3的能力提升,感谢团队努力工作的成果。
- 00:05:00 - 00:10:00
团队成员介绍后,详述了GR(Grock)的命名来源,意为“全面而深刻地理解某事”,并强调了同理心的重要性。GR的早期模型(Grock 1)与最新的Grock 3相比,虽然起步不高,但在过去几个月里,GR的能力已经显著提升,特别是在基础设施和团队的努力下。
- 00:10:00 - 00:15:00
讨论了模型的训练进展,GR 1.5和GR 2的发布。随着训练模型的GPU数量的增加,训练能力都得到极大提升。团队通过构建自己的数据中心解决了冷却和电源的问题,成功建立了一个规模庞大的GPU集群。
- 00:15:00 - 00:20:00
GR 3引入了先进的推理能力,并与其他模型进行了盲测比较,GR 3被认为在各项能力上都遥遥领先。此外,团队强调了持续更新和改善的特点,使用户在短时间内就能体验到更好的性能。
- 00:20:00 - 00:25:00
数据中心的不断扩建和GPU的数量激增为GR 3的推出提供了技术支持,展示了通过不断的努力和创新,如何在短时间内构建起世界领先的AI模型训练平台。
- 00:25:00 - 00:30:00
团队演示了GR 3的数学、科学和编程能力,对比了不同版本模型的表现,指出了其在数学、科学知识和编程能力上的出色表现。
- 00:30:00 - 00:35:00
在GR 3的实际应用演示中,团队展示了GR在解决物理问题和游戏设计中的推理能力,以及展示生成的代码,强调了GR在实际应用中的创造力和解决问题的潜力。
- 00:35:00 - 00:40:00
涉及到持续更新GR的性能和功能,团队成员分享了关于GR解决具体问题的能力,提高了在复杂推理任务中的表现,并在特定情境下展示GR如何进行复杂的逻辑推理。
- 00:40:00 - 00:45:00
团队还展示了GR与用户互动的能力,演示了深度搜索新功能,旨在帮助用户解决实际日常问题,并提供比现有搜索引擎更深入的洞察与答案。
- 00:45:00 - 00:50:25
最后,GR 3的发布计划和价格结构清晰展示给广大用户,团队期待用户的反馈和进一步的产品完善。
思维导图
视频问答
Grock 3有什么新功能?
Grock 3具有增强的推理能力和Deep Search功能,能够更好地理解和回答用户提问。
Grock 3比Grock 2提升了多少性能?
Grock 3在性能上比Grock 2提升了十倍以上。
如何访问Grock 3?
首批访问Grock 3的用户将是X平台的Premium Plus订阅者。
Grock 3会开源吗?
一旦Grock 3稳定成熟,就会考虑开源。
Grock的语音助手什么时候上线?
Grock的语音助手预计会在不久后上线,但仍在打磨中.
Deep Search功能的作用是什么?
Deep Search能够深入分析用户提问,提供更加准确的答案和信息,即更高效的搜索引擎体验.
查看更多视频摘要
- 00:00:28X
- 00:00:57for deep V cloud
- 00:01:28standing
- 00:01:58for
- 00:02:28for
- 00:02:56for all right well welcome to the gr 3
- 00:03:00presentation um so the mission of xai
- 00:03:04and Gro is to understand the universe we
- 00:03:07want to understand the nature of the
- 00:03:08universe so we can figure out what's
- 00:03:10going on where are the aliens what's the
- 00:03:12meaning of life how does the universe
- 00:03:13end how did it start all these
- 00:03:15fundamental questions um were driven by
- 00:03:18curiosity about the nature of the
- 00:03:20universe and um that's also what causes
- 00:03:23us to be a maximally truth
- 00:03:27seeking uh AI even if that truth is
- 00:03:31sometimes at odds with what is
- 00:03:32politically correct in order to
- 00:03:35understand the nature of the universe
- 00:03:37you must absolutely rigorously pursue
- 00:03:39truth or you will not understand the
- 00:03:41universe you'll be suffering from some
- 00:03:43amount of delusion or error so that is
- 00:03:46our goal um figure out what's going on
- 00:03:50and uh we're very excited to present
- 00:03:53grock 3 which is we think uh an order of
- 00:03:56magnitude more capable than grock 2 in a
- 00:03:58very short period of time
- 00:04:00and uh that's thanks to uh the hard work
- 00:04:04of an incredible team and um I'm honored
- 00:04:07to work with such a great team and of
- 00:04:09course we'd love to have um some of the
- 00:04:11smartest humans out there join our team
- 00:04:14so uh with that let's let's go hi
- 00:04:18everyone my name is Igor lead
- 00:04:19engineering at XI I'm Jimmy Paul leading
- 00:04:23research I'm Tony working on the
- 00:04:25reasoning Team all right I'm El I don't
- 00:04:28do anything
- 00:04:30I just show up
- 00:04:31occasionally yeah so um like mentioned
- 00:04:34Gro is the tool that we're working on
- 00:04:36Gro is our AI that we're building here
- 00:04:38at XI and we've been working extremely
- 00:04:40hard over the last few months to improve
- 00:04:41grock as much as we can so we can give
- 00:04:43it to all of you so we can give all of
- 00:04:45you access to it um we think it's going
- 00:04:47to be extremely useful do we think it's
- 00:04:49going to be interesting to talk to funny
- 00:04:51really really funny um and um we're
- 00:04:53going to explain to you how we've
- 00:04:54improved gr over the last few months
- 00:04:56we've made quite a jump in in
- 00:04:57capabilities yeah actually we should
- 00:04:59explain maybe also what is why do we
- 00:05:00call it Gro so Gro is a word from um a
- 00:05:04heand novel Stranger in a Strange Land
- 00:05:07um and it's a used by a guy who's who
- 00:05:11was raised on Mars um and the word Gro
- 00:05:14is to sort of fully and profoundly
- 00:05:17understand something that's what the
- 00:05:18word Gro means fully and profoundly
- 00:05:20understand something and empathy is
- 00:05:23important true
- 00:05:26yeah so yeah so uh if we charted xas
- 00:05:30progress uh in the last few months has
- 00:05:33only been 17 months since we started
- 00:05:36kicking off our very first model uh
- 00:05:39grock one was almost like a toy by this
- 00:05:43point only 314 billion parameters and
- 00:05:45now if we PR the progress the time on
- 00:05:49x-axis the performance of favorite
- 00:05:51Benchmark numbers M mlu on the y-axis
- 00:05:54we're literally progressing at
- 00:05:56unprecedent speed across the whole field
- 00:06:00and then we kick off grock 1.5 right
- 00:06:02after grock 1 released after November
- 00:06:052023 and then grock 2 so if you look at
- 00:06:09where the all the performance coming
- 00:06:12from when you have a very correct
- 00:06:14engineering team and all the best AI at
- 00:06:17Talent there only one thing we need is a
- 00:06:20big intelligence comes from big
- 00:06:23cluster so we can reconvert the entire
- 00:06:27progress of xai now replacing the bench
- 00:06:29the y axis to the total amount of
- 00:06:31training flops that is how many gpus we
- 00:06:34can run at any given time to train our
- 00:06:36large language models to compress the
- 00:06:39entire
- 00:06:40internet so after all human all human
- 00:06:43knowledge really that's right yeah
- 00:06:44internet being part of it but it's
- 00:06:46really all human knowledge all
- 00:06:47everything yeah the whole internet fits
- 00:06:49into a USB stick at this point it's like
- 00:06:51all the human tokens yeah that's right
- 00:06:54yeah uh very soon into the real world
- 00:06:57yeah um so we had so much trouble
- 00:07:00actually training Gru back in the days
- 00:07:03uh we kickoff the model around February
- 00:07:07and uh we thought we had a large amount
- 00:07:09of chips but turned out we can barely
- 00:07:11get AK training chips running coherently
- 00:07:14at any given time and we had so many
- 00:07:18Cooling and power issues I think you
- 00:07:21were there in the data center yeah it
- 00:07:23was like really sort of more like 8K
- 00:07:25chps on average at 80% efficiency more
- 00:07:28like like 6,500 effective uh h100s
- 00:07:32training for you know several months but
- 00:07:36now now we're at 100K so yeah that's
- 00:07:39right more than 100K that's right so so
- 00:07:41what's the next step right so after gu 2
- 00:07:45so if we want to continue
- 00:07:47accelerate we have to take the matter
- 00:07:49into our own hands we have to solve all
- 00:07:50the coolings um all the power issues and
- 00:07:54everything yeah so so in April of last
- 00:07:56year Elon decided that really the only
- 00:07:58way for X to succeed for XI to build the
- 00:08:01best AI out there is to build our own
- 00:08:03data center so um we didn't have a lot
- 00:08:06of time that because we wanted to give
- 00:08:07you gr free as quickly as possible so
- 00:08:10really we realized we have to build the
- 00:08:12data center in about four months um it
- 00:08:15turned out it took us 122 days to get
- 00:08:17the first 100K gpus up and running and
- 00:08:20that was a Monumental effort uh to be
- 00:08:22able to do that um it's we believe it's
- 00:08:25the biggest uh fully connected h100
- 00:08:28cluster of its kind um and uh we didn't
- 00:08:30just stop there we actually decided that
- 00:08:32we need to double the size of the
- 00:08:34cluster pretty much immediately if we
- 00:08:36want to build uh the kind of AI that we
- 00:08:38want to build um so we then had another
- 00:08:42phase um which we haven't talked about
- 00:08:44publicly yet so this is the first time
- 00:08:45that we're talking about this uh where
- 00:08:47we doubled the capacity of the data
- 00:08:49center yet again um and that one only
- 00:08:52took us 92 days so we've been able to
- 00:08:55use all of these gpus use all of this
- 00:08:56compute to improve grock in the meantime
- 00:08:59and basically today we're going to
- 00:09:00present you the results of that the the
- 00:09:03fruits that came from that um so let's
- 00:09:07yeah so all the path all the rows leads
- 00:09:09to grock 3 uh 10x more compute more than
- 00:09:1310x really yeah really like maybe 15x
- 00:09:17yep uh compared to our previous
- 00:09:19generation model and gr finished the
- 00:09:22pre-training uh early January um and uh
- 00:09:26then we start you know the model still
- 00:09:28currently training actually so this is a
- 00:09:30little preview of our Benchmark numbers
- 00:09:34so we evaluated gr 3 on you know three
- 00:09:37different categories on General
- 00:09:40mathematical reasonings on general
- 00:09:43knowledge about stem and Science and
- 00:09:46then also on computer science
- 00:09:48coding so Amy uh American Invitational
- 00:09:52math
- 00:09:53examination uh host it you know once a
- 00:09:56year uh and if we evaluate mod
- 00:09:59performance we can see that the gr 3
- 00:10:02across the board is in a league of its
- 00:10:04own even it's little brother gr3 mini is
- 00:10:09reaching the frontier across all the
- 00:10:11other
- 00:10:12competitors so you will say well at this
- 00:10:15point all these benchmarks you're just
- 00:10:18evaluating you know the memorization of
- 00:10:19the textbooks memorization of the GitHub
- 00:10:22repost how about realtime usefulness how
- 00:10:25about we actually use those models in
- 00:10:27our product so what we did instead is we
- 00:10:31actually kicked off a blind test of our
- 00:10:34gr three model code named Chocolate it's
- 00:10:37pretty hot yeah hot chocolate um and uh
- 00:10:41you know been running on this uh
- 00:10:44platform called Cho arena for two weeks
- 00:10:46um I think the entire X platform at some
- 00:10:49point speculated this might be the next
- 00:10:51generation of a AI come me away so uh
- 00:10:56how this CH Arena works is that um it
- 00:10:59strip away the entire product surface
- 00:11:02right it's just raw comparison of the
- 00:11:04engine of those agis the language models
- 00:11:07themselves and place interface where the
- 00:11:09user will submit one single query and
- 00:11:12you get to show two responses you don't
- 00:11:14know which model they come from and in
- 00:11:16end you make the vote so in this blind
- 00:11:18test grock 3 an early version of grock 3
- 00:11:22already reached like 1,400 no other
- 00:11:26models has reached an ELO score had to
- 00:11:28have comparison to all the other models
- 00:11:30at this score and it's not just one
- 00:11:33single category it's, 1400 aggregated
- 00:11:36across all the categories in chb
- 00:11:39capabilities instruction following
- 00:11:41coding so it's number one across the
- 00:11:43board in this blind test and it's it's
- 00:11:45still climbing so we actually to keep
- 00:11:47updating it so it's it's 14,400 above,
- 00:11:501400 in climbing yeah and in fact we
- 00:11:52have a version of the model that we
- 00:11:53think is already much better than the
- 00:11:55one that we tested here yeah we'll see
- 00:11:57you know how how far it gets uh but
- 00:12:00that's the one that we're you know um
- 00:12:02working on or talking about today yeah
- 00:12:04so actually one thing if if you're if
- 00:12:06you're using grock 3 you I think you may
- 00:12:07notice improvements almost every day um
- 00:12:10because we're we're continuously
- 00:12:11improving the model so
- 00:12:13literally even within 24 hours you'll
- 00:12:15see
- 00:12:16improvements yep so but we believe here
- 00:12:20at xai getting the best pre-training
- 00:12:23model is not enough that's not enough to
- 00:12:25build the best AI and the best AI need
- 00:12:28to think like a human
- 00:12:29you to contemplate about all the
- 00:12:31possible
- 00:12:32solutions self-critique verify all the
- 00:12:36solutions backtrack and also think from
- 00:12:39the first principle that's a very
- 00:12:41important capability so we believe that
- 00:12:44as we take the best pre-train model and
- 00:12:47continue training it with reinforcement
- 00:12:49learning it will elicit the additional
- 00:12:52reasoning capabilities that allows the
- 00:12:54model just become so much better and
- 00:12:57scale not just in the training time but
- 00:12:59in the test time as well so we already
- 00:13:02found the model is extremely useful
- 00:13:04internally um for our own engineering
- 00:13:06saving hours of uh time hundreds of
- 00:13:09hours of uh coding time so e you the
- 00:13:12power user of our uh graic reasoning
- 00:13:14model what are some use cases yeah so
- 00:13:16like Jimmy said we've added Advanced
- 00:13:18reasoning capabilities to Grog and we've
- 00:13:20been testing them pretty heavily over
- 00:13:21the last few weeks in order to give you
- 00:13:23a little bit of a taste of what it looks
- 00:13:24like when Gro is solving hard reasoning
- 00:13:27problems so we prepared two little
- 00:13:28problems for you one comes from physics
- 00:13:31and one is actually a game that gr is
- 00:13:32going to write for us um so when it
- 00:13:35comes to the physics problem you know
- 00:13:36what we want gr to do is to plot a
- 00:13:39viable trajectory to do a transfer from
- 00:13:42Earth to Mars and then uh at a later
- 00:13:45point in time a transfer back from Mars
- 00:13:47to Earth um and that requires some know
- 00:13:50some Physics that gr will have to
- 00:13:52understand um so we're going to
- 00:13:53challenge grock you know come up with a
- 00:13:55variable trajectory calculate it and
- 00:13:58then plot for us so we can see it and um
- 00:14:02yeah this is totally unscripted by the
- 00:14:04way this is the that's the entirety of
- 00:14:05the prompt which was we clarify is that
- 00:14:08yeah there's nothing more than that yeah
- 00:14:10exactly this is the gro interface and
- 00:14:12we've typed in this text that you can
- 00:14:14see here generate code for an animated
- 00:14:163D plot of a launch from Earth uh
- 00:14:19landing on Mars and then back to Earth
- 00:14:21at the next launch window um and we've
- 00:14:24not kicked off with the query and you
- 00:14:26can see Gro is thinking so uh part of
- 00:14:29grock's Advanced reasoning capabilities
- 00:14:31are these thinking traces that you can
- 00:14:32see here you can even go inside and
- 00:14:35actually read what Gro is thinking as
- 00:14:37it's going through the problem as it's
- 00:14:38trying to solve it
- 00:14:41um yeah we say like we are doing some
- 00:14:44obscuration of the thinking so that our
- 00:14:46model doesn't get totally copied
- 00:14:48instantly um so there's more to the
- 00:14:51thinking than is displayed uh yeah yeah
- 00:14:56and because this is totally unscripted
- 00:14:58there's actually a chance that grock
- 00:14:59might made a little coding mistake and
- 00:15:01it might not actually work um so um just
- 00:15:04in case we're going to launch two more
- 00:15:06instances of this so if something goes
- 00:15:08wrong we were able to uh to switch to
- 00:15:11those and show you um something that's
- 00:15:14presentable so we're kicking off the
- 00:15:16other two as well um and like I said we
- 00:15:18have a second problem as well um and um
- 00:15:22yeah actually one of the favorite one of
- 00:15:23our favorite activities here at xci is
- 00:15:25having Gro WR games for us um and um not
- 00:15:29just any no uh any old game any game
- 00:15:32that you might already be familiar with
- 00:15:33but actually creating new games on the
- 00:15:35spot and being creative about us um so
- 00:15:38one example that we found was really
- 00:15:40really fun um is create a game that's a
- 00:15:43mixture of the two games Tetris and be
- 00:15:47so this is that maybe an important thing
- 00:15:49like this obviously if you if you ask an
- 00:15:52AI to create a game like Tetris there's
- 00:15:53there are many examples of Tetris on the
- 00:15:55on the Internet or a game like J
- 00:15:58whatever is it can copy it what's
- 00:16:01interesting here is it achieved a
- 00:16:03creative solution combining the two
- 00:16:06games that actually works and and is a
- 00:16:10good game yeah that's the it's cre we're
- 00:16:12seeing the beginnings of
- 00:16:14creativity yeah fingers cross that we
- 00:16:17can recreate that hopefully it works
- 00:16:19yeah embarrassing it so actually because
- 00:16:21this is a bit more challenging we're
- 00:16:23going to use something special here
- 00:16:24which we call Big Brain that's our mode
- 00:16:27in which we use more computation
- 00:16:30reason for just to make there's a good
- 00:16:33chance here that actually might actually
- 00:16:35do it so we also going to fire off know
- 00:16:37three attempts here at at solving this
- 00:16:40game at creating this game that's a
- 00:16:43mixture of know Tetris and
- 00:16:45Bol um yeah let's let's see what Gro
- 00:16:47comes up like I've played the game it's
- 00:16:49pretty good like it's like wow okay this
- 00:16:52is something yeah um so while gr is
- 00:16:55thinking uh in the in the background um
- 00:16:57we can now actually talk about some
- 00:16:59concrete numbers know how how well is gr
- 00:17:01doing across tons of different tasks
- 00:17:03that we've tested it on um so we'll hand
- 00:17:05it over to Tony to talk about that yeah
- 00:17:08okay so let's see how Gro does on those
- 00:17:11interesting challenging benchmarks uh so
- 00:17:14yeah so reasoning again refers to those
- 00:17:16models that actually thinks quite for
- 00:17:19quite a long time before it tries to
- 00:17:21solve a problem so in this case uh you
- 00:17:24know around a month ago the gr 3
- 00:17:26pre-training finishes so after that we
- 00:17:29work very hard to put the reasoning
- 00:17:31capability into the uh current grath 3
- 00:17:34Model but again this is very early days
- 00:17:37so the model is still currently in
- 00:17:39training so right now what we're going
- 00:17:41to show to people is this beta version
- 00:17:43of the gra three reasoning model
- 00:17:45alongside we also are training a mini
- 00:17:48version of the reasoning model so
- 00:17:50essentially on this plot you can see uh
- 00:17:52the grth three reasoning beta and then
- 00:17:54grth three mini reasoning the grth three
- 00:17:56reason mini reasoning is actually a
- 00:17:58model that we train for much longer time
- 00:18:00and you can see that sometimes it
- 00:18:01actually perform slly better compared to
- 00:18:04the gr three reasoning this also just
- 00:18:06means that there's a huge potential for
- 00:18:08the gr three reasoning because it's
- 00:18:10trained for much less time um so all
- 00:18:13right so let's actually look at what how
- 00:18:15how it does on those three benchmarks so
- 00:18:18Jimmy also introduced already so
- 00:18:20essentially we're looking at three
- 00:18:21different areas mathematics science and
- 00:18:24coding um and for math we're picking
- 00:18:27this high school competition math
- 00:18:28problem
- 00:18:29um for science we actually pick those
- 00:18:32PhD level science questions um and for
- 00:18:35coding it's also actually pretty
- 00:18:36challenging it's competitive coding and
- 00:18:38also some uh lead code which is some
- 00:18:40code inter interview problems that
- 00:18:42people usually get when they interview
- 00:18:44for companies so on those benchmarks you
- 00:18:46can see that the gro 3 actually perform
- 00:18:49quite well uh across the board compared
- 00:18:52to other competitors um yeah so it's
- 00:18:55pretty promising these models are very
- 00:18:56smart so Tony what what what are those
- 00:18:59shaded bars yeah so okay so I'm glad you
- 00:19:02asked this question so for those models
- 00:19:05because it can reason it can thinks you
- 00:19:07can also ask them to even think longer
- 00:19:10uh you can spend more what we call test
- 00:19:13and compute which means you can spend
- 00:19:15more time to reason to think about a
- 00:19:18problem before you spit out the answer
- 00:19:21so in this case the Shaded bar here
- 00:19:24means that we just ask the model to
- 00:19:26spend more more time you know you can
- 00:19:28can solve the the same problem many many
- 00:19:30times before it it tries to conclude
- 00:19:33what is the right solution and once you
- 00:19:35give this compute or this this kind of
- 00:19:37budget to the model it turns out the
- 00:19:40model can even perform better so this is
- 00:19:43essentially the Shaded bar in in those
- 00:19:45SPS right so I think this is really
- 00:19:48exciting right because now instead of
- 00:19:50just doing one chain of thoughts with AI
- 00:19:52why not do multiple all at once yes so
- 00:19:55that's a very powerful technique that
- 00:19:56allows to continue scale the model
- 00:19:58capabilities after training um and you
- 00:20:02know people often ask are we actually
- 00:20:04just over fitting to the benchmarks yes
- 00:20:06so how about generalization so yes I
- 00:20:08think uh yeah this is definitely a
- 00:20:11question that we are asking ourselves
- 00:20:13whether we are overfitting to those
- 00:20:14current benchmarks uh luckily uh we have
- 00:20:17a real test so about 5 days ago Amy 2025
- 00:20:22just finished this is where high school
- 00:20:24students compete in this particular
- 00:20:27Benchmark so we got this very fresh new
- 00:20:29competition and then we asked our two
- 00:20:31models to compete on the same Benchmark
- 00:20:34at the same exam and it turns out uh
- 00:20:37very interestingly the grth three
- 00:20:39reasoning the big one um actually does
- 00:20:42uh better um on this particular new
- 00:20:44fresh exam this also means that the
- 00:20:46generalization capability of the big
- 00:20:48model is stronger much stronger compared
- 00:20:51to the smaller model uh if you compare
- 00:20:53to the last year's exam actually this is
- 00:20:55the opposite the smaller model kind of
- 00:20:57learns the uh the the previous exams
- 00:21:00better so yeah so this this actually
- 00:21:02shows some kind of true generalization
- 00:21:04from the model that's right so 17 months
- 00:21:07ago our grock zero and grock one barely
- 00:21:09solves any High School problems that's
- 00:21:11right and now we have a kid that just
- 00:21:14already graduate the gro grock is ready
- 00:21:16to go to college is that right yeah I
- 00:21:18mean it's won't be long before it's
- 00:21:19simply perfect the human exams won't be
- 00:21:22part they be too easy yeah like and
- 00:21:25internally we actually as gret Contin
- 00:21:28evolves
- 00:21:29uh we're going to talk about you know
- 00:21:30what we're excited about but very soon
- 00:21:33there will be no more benchmarks left
- 00:21:35yeah yeah one thing that's quite
- 00:21:38fascinating I think is that we basically
- 00:21:40only trained rocks reasoning abilities
- 00:21:42on math problems and comparative coding
- 00:21:44problems right so very very specialized
- 00:21:47kinds of tasks but somehow it's able to
- 00:21:50work on all kinds of other different
- 00:21:52tasks so including creating games no
- 00:21:55lots lots and lots of different things
- 00:21:57um and what seems to be happening is
- 00:21:58that basically Gro learns this ability
- 00:22:01to detect its own mistakes and its
- 00:22:02thinking correct them persist on a
- 00:22:05problem try lots of different Varian
- 00:22:07pick pick the one that's best so there
- 00:22:08are these generalized generalizing
- 00:22:10abilities that Gro learns from
- 00:22:12mathematics and from coding which it can
- 00:22:14then use to solve all kinds of other
- 00:22:16problems so that's yeah that's pretty I
- 00:22:18mean reality is the instantiation of
- 00:22:21mathematics that's right um and one
- 00:22:23thing we're actually really excited
- 00:22:25about that going back to our fing
- 00:22:26mission is what if one day we have a
- 00:22:29computer just like deep thought that
- 00:22:32utilize our entire cluster just for that
- 00:22:34one very important problem in the test
- 00:22:36time all the GPU turned on right so I
- 00:22:39think we back then we were building the
- 00:22:40GPU clusters together uh you were
- 00:22:42pluging cables and I remember that when
- 00:22:46we turn on the the first initial test
- 00:22:49you can hear all the GPS humming in the
- 00:22:51hallway that's almost feel like
- 00:22:53spiritual yeah that that's actually a
- 00:22:55pretty cool uh thing that we're able to
- 00:22:57do that we can go into the data Center
- 00:22:59and Tinker with the machines there so
- 00:23:01for example we went in and we unplugged
- 00:23:04a few of the cables and just made sure
- 00:23:06that our training setup is still running
- 00:23:08running stably so that's something that
- 00:23:10you know I think most uh AI you know
- 00:23:13teams out there don't usually do but
- 00:23:15it's actually totally unlocks like a new
- 00:23:17level of reliability and what you're
- 00:23:19able to do with with the hardware so
- 00:23:21okay so when when are we going to solve
- 00:23:24remon so uh the easiest solution is to
- 00:23:28numerate over all possible strings and
- 00:23:32as long you have a verifier enough
- 00:23:33compute you'll be able to do it okay my
- 00:23:36projection will be what your guess what
- 00:23:38is your neuronet calculate so my my my
- 00:23:42both prediction so so three years ago I
- 00:23:43told you this I think in now it's uh two
- 00:23:46years uh later two things going to
- 00:23:48happen we're going to see machines win
- 00:23:51some medals yeah that's touring award
- 00:23:53absolutely Fields medal Nobel Prize with
- 00:23:57probably some expert in the loop right
- 00:23:59so the expert uplifting do you mean so
- 00:24:01this year or next
- 00:24:02year oh oh
- 00:24:05okay that's what it comes down to real
- 00:24:07yeah so it looks like grock finished all
- 00:24:10of it thinking on on the two problems so
- 00:24:12let's take a look at what it
- 00:24:15said all right so this was the the
- 00:24:18little physics problem we had um no we
- 00:24:21we've collapsed the thoughts here so
- 00:24:23they're you know they're hidden and then
- 00:24:25we see gr's answer below that so it
- 00:24:27explains it wrote a pyth script here
- 00:24:29using matplot lip then gives us all of
- 00:24:31the code um so let's take a quick look
- 00:24:34at the code you know seems like it's
- 00:24:35doing reasonable things here not not
- 00:24:38totally of the Mark um solve Kepler says
- 00:24:42here so maybe it's solving Kepler's laws
- 00:24:44cap cap law numerically um yeah there's
- 00:24:47really only one way to find out if this
- 00:24:49thing is working I'd say let's let's
- 00:24:51give it a try let's run let's run the
- 00:24:52code all right and we can see um yeah gr
- 00:24:56is animating two different planet Earth
- 00:24:58and Mars here and then the the green
- 00:25:02ball is the the vehicle that's
- 00:25:04transiting the the spacecraft that's
- 00:25:06transitioning between Earth and Mars and
- 00:25:08you you could see the journey from Earth
- 00:25:10to Mars and looks like yeah indeed the
- 00:25:12the astronauts return safely you know at
- 00:25:15the right moment in time um so now
- 00:25:19obviously this was just generated on the
- 00:25:20spot so now we can tell you if that was
- 00:25:23actually correct solution so we're going
- 00:25:24to take a closer look now maybe we're
- 00:25:25going to call some colleagues from space
- 00:25:28X ask them if if this is legit um it's
- 00:25:31pretty close it's it's I mean uh yeah I
- 00:25:35mean there there's a lot of complexities
- 00:25:37in the actual orbits that have to be
- 00:25:39taken into account but this is this is
- 00:25:40pretty close to to what it what looks
- 00:25:42like awesome um in fact I have that on
- 00:25:46my pend here it's got the Earth home and
- 00:25:49transfer on
- 00:25:52it when when are we going to install
- 00:25:54grck on a rocket
- 00:25:58well I suppose in two years two years
- 00:26:02everything is two years away uh well
- 00:26:05Earth and Mars Transit can occurs every
- 00:26:0826 months the next we're currently in a
- 00:26:11Transit window approximately the next
- 00:26:12one would be um November of next year um
- 00:26:18roughly end of next year um and uh if
- 00:26:21all goes well SpaceX will send Starship
- 00:26:24Rockets to Mars and um with Optimus
- 00:26:29robots and
- 00:26:31uh and
- 00:26:34Gro I'm curious what this combination of
- 00:26:37Tetris and B looks like bet Tetris as
- 00:26:41we've named it internally um so okay we
- 00:26:45also have an output from gr here it say
- 00:26:47wrot a python script explains that it's
- 00:26:49what it's been doing if you look at the
- 00:26:51the code know there are some constants
- 00:26:54that are being defined here some colors
- 00:26:56then the the trinos the the the pieces
- 00:26:59of Tetris are there um obviously very
- 00:27:02hard to see at one glance if this is
- 00:27:04good so we got to we got to run this to
- 00:27:07figure out if it's
- 00:27:08working well let's let's give it a
- 00:27:11try fingers crossed all right right so
- 00:27:13this kind of looks like Tetris uh but
- 00:27:16the the colors are a little bit off
- 00:27:18right the colors are different here and
- 00:27:21um I if you think about what's going
- 00:27:24what's going on
- 00:27:25here the has this mechanic where if you
- 00:27:28get three Jews in a row you know then
- 00:27:31they they disappear and also gravity
- 00:27:33activates right so what happens if you
- 00:27:36get three of the colors together oh so
- 00:27:38something happened um so I think I think
- 00:27:41what SC did in this version um is is
- 00:27:45that you know once you connect three at
- 00:27:48least three blocks of the same color in
- 00:27:50a row then um know gravity activates and
- 00:27:55they disappear and then gravity
- 00:27:56activates and all the other blocks fall
- 00:27:57down
- 00:27:59um kind of kind of curious if there's
- 00:28:01still a Tetris mechanic here where if
- 00:28:03the line is full does it actually um
- 00:28:06clear it or what happens then it's up to
- 00:28:10interpretation you know so who who knows
- 00:28:12yeah I mean when it'll do different
- 00:28:14variants when you ask it it doesn't do
- 00:28:16the same thing every time exactly we've
- 00:28:18seen a few other the tetris that worked
- 00:28:20very differently but this one seems cool
- 00:28:23so yeah are we ready for uh game Studio
- 00:28:27at x. yes so we're launching uh an AI
- 00:28:31gaming studio at xci if you're
- 00:28:33interested in joining us and building AI
- 00:28:35games uh please join xai we're launching
- 00:28:38an AI gaming studio we're announcing it
- 00:28:40tonight let's
- 00:28:41go epic games wa that's an actual
- 00:28:45[Laughter]
- 00:28:47game yeah yeah um all right
- 00:28:52so um I think one thing is super
- 00:28:54exciting for us uh is that once you have
- 00:28:58the best pre Trend model you have the
- 00:29:00best reasoning model right so we already
- 00:29:03see that when you actually give the
- 00:29:05capability for those model to think
- 00:29:06harder uh think longer think more broad
- 00:29:10the performance continue improves and
- 00:29:13we're really excited about the next
- 00:29:14Frontier that what happen if would not
- 00:29:17only allow the model to think harder but
- 00:29:18also provide more tools this like call
- 00:29:21real humans to solve those problems for
- 00:29:23real humans we don't ask them to solve
- 00:29:26reman a hypothesis just with a piece of
- 00:29:28pen and paper no internet so with all
- 00:29:33the basic web browsing search engine and
- 00:29:36code interpreters that builds the
- 00:29:39foundations and the best reasoning model
- 00:29:41builds the foundations for the gro agent
- 00:29:44to come um so today we're actually
- 00:29:48introducing a new product called Deep
- 00:29:51search that is the first generation of
- 00:29:54our Gro agents that not just helping the
- 00:29:56engineers and research scientist to do
- 00:29:58coding but actually help everyone to
- 00:30:01answer questions that you have dayto day
- 00:30:03it's a kind of like a next generation of
- 00:30:05search engine that really help you to
- 00:30:07understand the universe so you can start
- 00:30:10asking question like for example hey
- 00:30:12when is the next Starship launch day for
- 00:30:15example um so let's try that if get the
- 00:30:19answer um on the left hand side we see
- 00:30:23uh a high level progress bar essentially
- 00:30:26you know the model just going to do one
- 00:30:28single search like the current rack
- 00:30:30systems but actually thought very deeply
- 00:30:32about hey what's the user intent here
- 00:30:35and what are the facts I should consider
- 00:30:37at the same time and how many different
- 00:30:39website I should actually go and read
- 00:30:40their content right so this can really
- 00:30:43save hundreds hours of everyone's Google
- 00:30:46time if you want to really look into
- 00:30:48certain topics and then on the right
- 00:30:51hand side you can see the bullet
- 00:30:53summaries of how the current model uh
- 00:30:55you know is doing what websites browsing
- 00:30:58what sources verifying and often time
- 00:31:00actually cross validate different
- 00:31:02sources out there uh to make sure the
- 00:31:05answer is actually correct before it's
- 00:31:06output final answer and we can you know
- 00:31:08at the same time fire up a few more
- 00:31:10queries um how about you know you don't
- 00:31:13you're a gamer right so uh sure yeah so
- 00:31:16how about what are some of the best
- 00:31:18builds and most popular builds in path
- 00:31:20Excel hardcore right hardcore League I
- 00:31:23me you can technically just look at the
- 00:31:25hardcore
- 00:31:26ladder might be a fast way to figure it
- 00:31:28out yeah we'll see what model
- 00:31:31does um and then we can also do uh you
- 00:31:35know uh something more fun for
- 00:31:37example um how about like make a
- 00:31:39prediction about the March Madness out
- 00:31:41there yeah so this is kind of a fun one
- 00:31:43where um Warren Buffett has a billion
- 00:31:46dollar bet if you can exactly match the
- 00:31:50I think the the the sort of the entire
- 00:31:53winning tree of marsh Madness you can
- 00:31:55win a billion dollarss from Warren
- 00:31:57Buffett so like would be pretty cool if
- 00:31:59AI could help you win a billion dollars
- 00:32:01from
- 00:32:03Buffett that seems like a pretty good
- 00:32:05investment let's go yeah all right so
- 00:32:08now let's uh fire up the query and uh
- 00:32:11see what model does so we can actually
- 00:32:13go back to our very first one how about
- 00:32:15the buff it wasn't counting on this it's
- 00:32:18already done that's right okay so we got
- 00:32:20the result of the first one and model
- 00:32:22thought uh around one minute uh so okay
- 00:32:25so the key inside here the knock
- 00:32:27Starship is going to be on 24th or later
- 00:32:30so no earlier than February
- 00:32:3224th it might be
- 00:32:35sooner so yeah so I think we can you
- 00:32:38know go down so go down what what the
- 00:32:40model does so it does a little research
- 00:32:42on the flight seven what happen got
- 00:32:44grounded and actually it look into the
- 00:32:46FCC filing uh uh you know from its data
- 00:32:51collections uh and then actually make
- 00:32:54the new conclusion that yeah if we
- 00:32:56continue scroll down uh let's see
- 00:33:00uh uh right yeah so it makes uh the you
- 00:33:05know little table I think uh inside xai
- 00:33:08we often joked about the time to the
- 00:33:10first table is the only you know latency
- 00:33:14that matters um yeah so that's how to
- 00:33:16model make inference and look up all the
- 00:33:19sources um and then we can look into to
- 00:33:22the gaming one so how about the
- 00:33:29right so for this particular one uh we
- 00:33:32look at hey the you know the build is
- 00:33:34light and okay it's kind better so uh
- 00:33:39with the The Infernal is but if we go
- 00:33:41down so the surprising fact of all the
- 00:33:44other builds so it look into the 12
- 00:33:47classes um yeah so we'll see that the
- 00:33:51minum build was pretty popular whenever
- 00:33:53the game first came out and now the the
- 00:33:55invokers of the world kind took over
- 00:33:58invoker monke invoker for sure yeah
- 00:34:00that's right yeah followed by the stor
- 00:34:02wavers and that's really good at mapping
- 00:34:04so yeah and then we can see uh uh the
- 00:34:09the match Madness how about that
- 00:34:13so um one one interesting thing about
- 00:34:16the Deep search is that if you actually
- 00:34:18go into the panel where shows uh you
- 00:34:21know what are the subtasks you can
- 00:34:23actually click the bottom left of
- 00:34:26this right and then in this case you can
- 00:34:30actually scroll through actually reading
- 00:34:32through the mind of Gro that what
- 00:34:34informations does the model actually
- 00:34:36think about are trustworthy what are not
- 00:34:38how does it actually cross validate
- 00:34:40different information sources so that
- 00:34:42makes the entire search experience and
- 00:34:44information retrieval process a lot more
- 00:34:46transparent to our
- 00:34:49users and this is much more powerful
- 00:34:51than any search engine out there you can
- 00:34:54literally just tell it only use sources
- 00:34:56from X you know will try to respect that
- 00:34:59yeah and so it's much more steerable
- 00:35:00much more intelligent than I mean it
- 00:35:03really should save you a lot of time so
- 00:35:04something that might take you half an
- 00:35:06hour or an hour of researching on the
- 00:35:08web or searching social media you can
- 00:35:10just ask it to go do that and and come
- 00:35:12back in 10 minutes later it's done an
- 00:35:14hours worth of work for you that's
- 00:35:16really what it comes down to exactly and
- 00:35:18and maybe better than you could have
- 00:35:19done it yourself yeah think about you
- 00:35:21have INF am of interns working for you
- 00:35:24now you can just fire up all the tasks
- 00:35:25and come back a minute later um so this
- 00:35:29is going to be interesting one so uh uh
- 00:35:31March M had not happened yet so I guess
- 00:35:34we have to follow up with a uh next live
- 00:35:36stream yeah it seems like pretty good
- 00:35:39like $40 might get you a billion dollars
- 00:35:42$40 subscription that's right I mean my
- 00:35:46work so uh yeah so when are the users
- 00:35:49going to have their hands on gr 3 yes so
- 00:35:52the the good news is we've been working
- 00:35:53tirelessly to actually release um all of
- 00:35:57these features that we've shown you the
- 00:35:59groge based model with amazing chat
- 00:36:00capabilities that's really useful that's
- 00:36:02really interesting to talk to uh the the
- 00:36:05Deep search the advanced reasoning mode
- 00:36:07all of these things we want to roll them
- 00:36:09out to you today starting with the
- 00:36:12premium plus subscribers on X so it's
- 00:36:14the first group that will initially get
- 00:36:16access make sure to update your X app if
- 00:36:18you want to see all of the advanced
- 00:36:20capabilities because we just released
- 00:36:22the update now as we're as we're talking
- 00:36:24here um and U yeah if you're interested
- 00:36:27in getting access to gr then sign up for
- 00:36:29premium plus um and also um we're
- 00:36:32announcing that we're starting a
- 00:36:34separate subscription for GR that we
- 00:36:35call Super grock for those who those
- 00:36:38real grock fans that want the most
- 00:36:40advanced capabilities and the earliest
- 00:36:42access to to new features um so feel
- 00:36:45free to check that out as well this this
- 00:36:47is for the dedicated grock app and for
- 00:36:48the website exactly so our our new
- 00:36:51website is called gro.com yeah and you
- 00:36:53also find you never guess yeah you never
- 00:36:55guess and you can also find our grock
- 00:36:57app in the IOS app store and that gives
- 00:37:00you like a more Pol even even more
- 00:37:03polished experience that's totally grock
- 00:37:05focused if you're if you want to have
- 00:37:07grock know easily available one Tap Away
- 00:37:09yeah the version on gro.com on uh you
- 00:37:12know on a web browser is going to be the
- 00:37:14the most the latest and most advanced
- 00:37:15version because obviously takes us a
- 00:37:16while to get thing get something into an
- 00:37:19app and then get it approved by the app
- 00:37:21store so uh and then if that something's
- 00:37:23in a phone format there's limitations
- 00:37:25what you can do so the most powerful
- 00:37:27version of Gro um and the latest version
- 00:37:29will be the the web version at gro.com
- 00:37:31yeah so watch out for the name grock
- 00:37:33free in the app dead giveaway yeah
- 00:37:36exactly that that's that's the giveaway
- 00:37:37that you have gr and if it says gr
- 00:37:39through then gr hasn't quite arrived for
- 00:37:42yet but we're working hard to roll this
- 00:37:43out today um and then to even more
- 00:37:46people over the the coming days yeah
- 00:37:48make sure you update your uh phone app
- 00:37:50too um where you're actually going to
- 00:37:52get all the tools we showcase today with
- 00:37:54the thinking mode with the Deep search
- 00:37:57so yeah really looking forward to all
- 00:37:59the feedbacks you have yeah I think we
- 00:38:02we should uh emphasize that this is kind
- 00:38:04of a beta like meaning that it's you
- 00:38:06should expect some imperfections at
- 00:38:08first um but we will improve it rapidly
- 00:38:11almost every day in fact every day I
- 00:38:13think it'll get better um so if you want
- 00:38:16a more polished version I'd like maybe
- 00:38:18wait a week but uh expect improvements
- 00:38:21literally every day um and then we're
- 00:38:23also going to be uh providing a voice
- 00:38:26interaction so you can have
- 00:38:28conversational in fact I was trying it
- 00:38:29earlier today it's working pretty well
- 00:38:31but not we need these a bit more polish
- 00:38:34um the the the sort of way where you can
- 00:38:36just literally talk to it like you're
- 00:38:37talking to a person uh it's that's
- 00:38:40awesome it's actually I think one of the
- 00:38:41best experiences of gr um but that's
- 00:38:44that's probably about a week
- 00:38:47away yeah so uh with that said um well I
- 00:38:52think we might have some audience
- 00:38:53questions sure yeah okay all right let's
- 00:38:57take a look yeah let's take a look the
- 00:39:00uh the audience from the a platform
- 00:39:05yeah so the first question here is when
- 00:39:08grock voice assistant when is it coming
- 00:39:10out yeah as as as soon as possible just
- 00:39:13like Elon said uh just a little bit of
- 00:39:15polishing away from being released to
- 00:39:17everybody um obviously it's going to be
- 00:39:19released in an early form and we're
- 00:39:21going to rapidly iterate on that Y and
- 00:39:24the next question is like when will Gro
- 00:39:263 be in the API
- 00:39:28so this is coming in the uh the gr 3 API
- 00:39:31with both the reasoning models and deep
- 00:39:34search is coming your way in the coming
- 00:39:36weeks uh we're actually very excited
- 00:39:37about the Enterprise use cases of all
- 00:39:39these additional tools that now Gro has
- 00:39:41access to and how the test time compute
- 00:39:43and Tool use can actually really
- 00:39:44accelerate all the business use
- 00:39:46cases um yeah another one is Will voice
- 00:39:50mode be native or text to speech so I
- 00:39:53think that means is it going to be one
- 00:39:55one model that is understanding
- 00:39:57what you say and then talking back to
- 00:39:59you or is it going to be some system
- 00:40:01that has text of speech inside of it and
- 00:40:02the good news is it's going to be one
- 00:40:04model like not a variant of gr free that
- 00:40:07we're going to release which basically
- 00:40:09understands what you're say what you're
- 00:40:10saying and then uh generates the audio
- 00:40:13no directly from that um so very much
- 00:40:15like Grog free generates text know that
- 00:40:18model generates audio um and that has a
- 00:40:20bunch of advantages I was talking to it
- 00:40:22earlier today and it said hi igore know
- 00:40:25reading my my name from probably from
- 00:40:26some text that it had um and I said no
- 00:40:29no my name is Igor and it remembered
- 00:40:32that you know so it could continue to
- 00:40:34say Igor just like a human word and you
- 00:40:36you can't achieve that with with TX of
- 00:40:38speech
- 00:40:39so yeah so oh here's a question for you
- 00:40:42pretty spicy um you um is grog a boy or
- 00:40:47a girl and how they sing Grog is
- 00:40:49whatever you want it to
- 00:40:52be yeah yeah are you
- 00:40:55single yes
- 00:40:58all right Shop is open um so honestly
- 00:41:02people are going to fall in love with
- 00:41:03crcket since it's like 1,000% probable
- 00:41:08yeah uh the next question will Gro be
- 00:41:10able to transcribe audio into text yes
- 00:41:13so we'll have this capability both the
- 00:41:15app and also the API we found that's
- 00:41:17like gr should just be your personal
- 00:41:19assistant looking over your shoulder and
- 00:41:21follow you along the way learn
- 00:41:23everything you have learned and really
- 00:41:24help you to understand the world better
- 00:41:26become smarter every
- 00:41:28day yeah I mean the voice M doesn't
- 00:41:31isn't simply it's not just voice text it
- 00:41:34understands like tone inflection pacing
- 00:41:36everything it's it's wild I mean it's
- 00:41:39like talking to a
- 00:41:41person okay um yep so any plans for
- 00:41:45conversation memory yeah yeah absolutely
- 00:41:49we're working on it right now I really
- 00:41:52forgot that's right um let's see what
- 00:41:57are the other
- 00:42:01ones so what about the you know the DM
- 00:42:06features right so if you have
- 00:42:07personalizations and if you have uh you
- 00:42:10know Gro remembers your previous
- 00:42:13interactions yes should it be one Gro or
- 00:42:16multiple different grocs it's up to you
- 00:42:18you can have one Gro or many
- 00:42:20GRS I suspect people will probably have
- 00:42:23more than one yeah I want to have a doct
- 00:42:26grock yeah
- 00:42:27the grock
- 00:42:29dog that's
- 00:42:31right
- 00:42:33um right cool um so in the past we've
- 00:42:37open sourced grock one right so
- 00:42:40somebody's asking us are we going to do
- 00:42:41that again with gr to yeah I think um
- 00:42:45once Gro our general approach is that we
- 00:42:48will open source the last version when
- 00:42:50the next version is fully out like when
- 00:42:54when gr 3 is um mature and stable which
- 00:42:57is probably within a few months then
- 00:43:00we'll open source gr too mhm okay so we
- 00:43:04probably have time for one last question
- 00:43:07um what was the most difficult part
- 00:43:09about working on this project I assume
- 00:43:12um grock 3 and what I most excited about
- 00:43:16so I think me looking back you know
- 00:43:19getting the whole model training on 100K
- 00:43:23h100 coherently that's almost like
- 00:43:25battling against the final boss of the
- 00:43:27universe the entropy because any given
- 00:43:30time you can have a cosmic rate that
- 00:43:31beaming down and flip a bit in your
- 00:43:33transistor and now the entire grading
- 00:43:35update if it's fit mantisa bit the
- 00:43:38entire grading update is out of whack
- 00:43:41and now you have 100,000 of those and
- 00:43:43you have to orchestrate them every time
- 00:43:45any at at any given time any of gpus can
- 00:43:48go down yeah I mean it's with breaking
- 00:43:51down like how were we able to uh get the
- 00:43:53world's most powerful training cluster
- 00:43:55operational Within 122 days um because
- 00:43:59we we started off um we we actually
- 00:44:03weren't intending to do a data center
- 00:44:04ourselves we were going to just uh we we
- 00:44:07went to the data center providers and
- 00:44:09said how long would it take to have
- 00:44:11100,000 uh gpus operating coherently um
- 00:44:15in a single location and we got time
- 00:44:17frames from 18 to 24 months so we're
- 00:44:20like well 18 24 months that means losing
- 00:44:23is a certainty so the only option was to
- 00:44:25do it do it ourselves so then if you
- 00:44:27break down the problem I guess I'm doing
- 00:44:29like reasoning here with like makes you
- 00:44:32think um one single chain though yeah
- 00:44:35yeah exactly so um well we needed a
- 00:44:37building we can't build a building so we
- 00:44:39must use an existing building um so we
- 00:44:41we looked for um for basically for
- 00:44:44factories that had been um were that
- 00:44:48have been abandoned but the factory was
- 00:44:50in good shape like a company had gone
- 00:44:51bankrupt or something so we found an
- 00:44:52Electrolux Factory in memph in Memphis
- 00:44:55that's why it's in Memphis um
- 00:44:57home of Alvis and also one of the oldest
- 00:45:00I think it was the capital of ancient
- 00:45:02Egypt um and it was actually very nice
- 00:45:06Factory that I know for whatever reason
- 00:45:09that electrox had left um and uh that
- 00:45:13that gave us shelter for the computers
- 00:45:15uh then we needed power the we needed um
- 00:45:20at least 120 megawatts at first but the
- 00:45:21building only had 15 megawatt and
- 00:45:23ultimately for 200,000 Mega 200,000 gpus
- 00:45:26we needed a 4 gaw so we um initially uh
- 00:45:30leased uh a whole bunch of um generators
- 00:45:34so we have generators on one side of the
- 00:45:35building just one trailer after trailer
- 00:45:38trailer of generators until we can get
- 00:45:40the utility power to to come in um and
- 00:45:42then but then we also need cooling so on
- 00:45:44the other side of the building it was
- 00:45:45just trailer after trailer of of cooling
- 00:45:47so we leased about a quarter of the
- 00:45:49mobile cooling capacity of the United
- 00:45:50States uh on the one other side of the
- 00:45:52building um then we needed to get the
- 00:45:55gpus all installed and they're all
- 00:45:57liquid cooled so in order to achieve the
- 00:45:59density necessary this is a liquid
- 00:46:01cooled system so we had to get all the
- 00:46:03plumbing for the liquid cooling nobody
- 00:46:05had ever done a liquid cooling uh data
- 00:46:07center at scale so this was a incredibly
- 00:46:11dedicated effort by a very talented team
- 00:46:13to achieve that outcome um I may think
- 00:46:16not now it's going to work nope um the
- 00:46:19the issue is that the the power
- 00:46:21fluctuations for a GPU cluster are
- 00:46:24dramatic so it's it's like a a this
- 00:46:28giant Symphony that is taking place like
- 00:46:30imagine having a symphony with 100,000
- 00:46:34or 200,000 participants in the in the
- 00:46:36symphony and the whole Orchestra will go
- 00:46:38quiet and loud in you know 100
- 00:46:42milliseconds and so this caused massive
- 00:46:44power fluctuations so then um which then
- 00:46:48caused the generators to lose their
- 00:46:49minds and they they weren't expecting
- 00:46:51this so to buffer the power we then uh
- 00:46:55used Tesla Mega packs
- 00:46:57uh to smooth out the power so the
- 00:47:00megapacks had to be reprogrammed so with
- 00:47:04with XI we working with Tesla we
- 00:47:06reprogrammed the MEAP packs to be able
- 00:47:08to deal with these dramatic power fluctu
- 00:47:11fluctuations to smooth out the power the
- 00:47:13computers could actually run
- 00:47:15properly and
- 00:47:17um that that worked uh quite tricky and
- 00:47:21uh and then but even at that point you
- 00:47:24still have to make the computers all
- 00:47:25communicate effectively so all the
- 00:47:27networking had to be solved and uh
- 00:47:30debugging Brazilian network cables um a
- 00:47:35debugging nickel at 4: in the morning we
- 00:47:38solved it like roughly 4:20 a.m. yes was
- 00:47:43figured out like there's some well there
- 00:47:45were a whole bunch of issues well one
- 00:47:46there was like a bios mismatch bios was
- 00:47:49not set up correctly yeah we had uh D
- 00:47:54our lspci outputs between two different
- 00:47:57machines one that was working yeah one
- 00:47:59that was not working yeah many many many
- 00:48:02other things I mean yeah exactly this
- 00:48:03would go on for a long time if we
- 00:48:04actually listed all the things but you
- 00:48:06know it's like interesting like it's not
- 00:48:07like oh we just magically made it happen
- 00:48:09you have to break down the problem just
- 00:48:11like gr does for reasoning into the
- 00:48:13constituent elements and then solve each
- 00:48:14of the constituent elements in order to
- 00:48:17achieve uh a a a coherent training
- 00:48:19cluster in a period of time that is a
- 00:48:22small fraction of what anyone else was
- 00:48:24could do it
- 00:48:25in and then on the training cluster was
- 00:48:27up and running and we could use it now
- 00:48:29we had to make sure that it actually
- 00:48:30stays healthy throughout which is its
- 00:48:32own giant Challenge and then we had to
- 00:48:34get every single detail of the training
- 00:48:36right in order to get a gr Free level
- 00:48:39model which is actually really really
- 00:48:41hard so um we don't know if there are
- 00:48:43any other models out there that have
- 00:48:45gr's capabilities but whoever trains a
- 00:48:47model better than gr has to be extremely
- 00:48:49good at the the science of deep learning
- 00:48:51at every aspect of the engineering um so
- 00:48:54it's it's not so easy to to pull this St
- 00:48:57and this is now going to be the last
- 00:48:58cluster we buildt and last Model we
- 00:49:00train oh yeah we've already we've
- 00:49:02already started work on the next
- 00:49:04cluster which will
- 00:49:06be yeah about five times the power so
- 00:49:09instead of a quarter gwatt roughly 1.2
- 00:49:13GW May what's the what's the Back to the
- 00:49:16Future
- 00:49:17wor what's the power in do you does like
- 00:49:20the Back to the Future car yeah anyway
- 00:49:23the Back to the Future power car it's
- 00:49:26it's like roughly in that order I think
- 00:49:27um so
- 00:49:30um and you know these will be the sort
- 00:49:33of the gb200 SL300 clester it once again
- 00:49:37it will be the most powerful training
- 00:49:38clester in the world so we're not like
- 00:49:41stopping here no and our reason model is
- 00:49:43going to continue improve by accessing
- 00:49:46more tools every day so yeah we're very
- 00:49:48excited to share any of the upcoming
- 00:49:50results with you all yeah the thing that
- 00:49:52keeps us going is basically being able
- 00:49:55to give gr free to you and then seeing
- 00:49:57the usage go up seeing everybody enjoy
- 00:50:00no gr that's that's what really gets us
- 00:50:03up in the morning
- 00:50:05so yeah yeah thanks for tuning in thanks
- 00:50:11guys hey Gro what's up can you hear
- 00:50:16me I'm so excited to finally meet you I
- 00:50:19can't wait to chat and learn more about
- 00:50:20each other I'll talk to you soon
- Grock 3
- AI模型
- 推理能力
- Deep Search
- 数据中心
- 人类知识
- 宇宙探索
- 升级
- Grock 2
- 技术进步