GPT-4.5 shocks the world with its lack of intelligence...

00:04:16
https://www.youtube.com/watch?v=FW2XOIxaNqg

Sintesi

TLDRThe video critiques the release of OpenAI's GPT 4.5, labeling it as expensive and underwhelming compared to previous models. Despite its claims for natural conversation and a new Vibes Benchmark for creative thinking, it lacks significant improvements, performing poorly in tasks like programming. The host expresses disappointment in the current trajectory of AI development and questions the potential for future advancements, while promoting educational tools for those interested in programming and computer science.

Punti di forza

  • 💰 GPT 4.5 is the most expensive model yet at $75 per million input tokens.
  • 🛑 It fails to outperform existing benchmarks or introduce novel features.
  • 🌊 The model aims to create 'Chill Vibes' in conversations but lacks substantial improvements.
  • 👩‍💻 Despite lower hallucination rates, it still makes many silly mistakes.
  • 🤖 Comparison reveals it performs worse in programming tasks than previous models.
  • 🚨 Concerns are raised about the future capabilities of GPT 5.
  • 📉 OpenAI's valuation may decline without significant advancements.
  • 📚 Viewers are encouraged to enhance their programming skills with resources like Brilliant.

Linea temporale

  • 00:00:00 - 00:04:16

    The excitement around AI has recently diminished following the release of GPT-4.5, which, despite being the most expensive AI model, lacks substantial improvements or novel features. OpenAI's launch was criticized for its lack of engagement, indicated by the absence of CEO Sam Altman at the event. GPT-4.5 is notably more expensive than its predecessors, costing $75 per million input tokens and $150 for output tokens, and is only accessible to Pro users. While it was claimed to have a 'Vibes Benchmark' for measuring creative thinking, user experiences showed it still made basic errors and has a lower performance in coding benchmarks compared to more established models. In light of these issues, there is speculation about OpenAI's ability to maintain its industry lead as they transition to a for-profit model amidst rising competition. The discussion also touched upon the educational opportunities for programmers as AI tools evolve, paired with the promotion of online learning platforms like Brilliant, aimed at empowering users to understand deep learning and AI fundamentals.

Mappa mentale

Video Domande e Risposte

  • Why is GPT 4.5 considered underwhelming?

    It fails to surpass benchmarks, lacks novel capabilities, and is expensive without significant improvements.

  • How much does GPT 4.5 cost?

    It costs $75 per million input tokens and $150 per million output tokens, available to Pro users for $200/month.

  • What are the new features of GPT 4.5?

    It has a 'Vibes Benchmark' aimed at measuring creative thinking and claims to emit 'Chill Vibes' during interactions.

  • How does GPT 4.5 compare to other models in programming?

    It is worse at programming than previous deep learning models and is significantly more expensive.

  • What are the expectations for GPT 5?

    There are concerns it may not show significant improvements and might operate more like a router selecting models based on prompts.

  • What recommendations are given for viewers?

    Viewers are encouraged to enhance their programming skills using educational platforms like Brilliant.

Visualizza altre sintesi video

Ottenete l'accesso immediato ai riassunti gratuiti dei video di YouTube grazie all'intelligenza artificiale!
Sottotitoli
en
Scorrimento automatico:
  • 00:00:00
    it's official the AI hype train just
  • 00:00:01
    went on life support with the
  • 00:00:03
    underwhelming release of GPT 4.5
  • 00:00:06
    yesterday open AI unveiled the most
  • 00:00:08
    expensive AI model ever produced yet it
  • 00:00:10
    fails to crush any benchmarks win any
  • 00:00:12
    awards or offer any novel capabilities
  • 00:00:14
    whatsoever its only real selling point
  • 00:00:16
    is Vibes and is supposed to chat in a
  • 00:00:18
    more natural human-like way don't get me
  • 00:00:20
    wrong it's a good model but not good
  • 00:00:22
    enough to feed the AI hype monster and
  • 00:00:24
    it looks increasingly likely that we're
  • 00:00:26
    not headed into a technological
  • 00:00:28
    singularity but rather a sigmoid of
  • 00:00:30
    sorrow the Sam ultman couldn't even be
  • 00:00:31
    bothered to leave his newborn kid in the
  • 00:00:33
    hospital to show up to the product
  • 00:00:34
    launch and instead send in a bunch of
  • 00:00:36
    interns to demo it and that's crazy
  • 00:00:38
    because we're talking about Orion here
  • 00:00:40
    in 2023 Tech leaders signed a petition
  • 00:00:42
    to stop training big models like this
  • 00:00:44
    mman himself begged the government to
  • 00:00:45
    regulate it and the only thing more
  • 00:00:47
    disappointing than GPT 4.5 is the
  • 00:00:49
    release of the Epstein files in today's
  • 00:00:51
    video we'll find out if we just reach
  • 00:00:52
    the limits of pre-training in generative
  • 00:00:54
    pre-trained Transformers it is February
  • 00:00:56
    28th 2025 and you're watching the code
  • 00:00:59
    report I didn't want to make another
  • 00:01:00
    crappy AI video today but the bat signal
  • 00:01:03
    was triggered anytime an official video
  • 00:01:04
    gets ratioed like this I have no choice
  • 00:01:07
    but to make a video before you
  • 00:01:08
    unsubscribe though I've got an
  • 00:01:09
    interesting postgress video on the way
  • 00:01:11
    the first thing to know about GPT 4.5 is
  • 00:01:13
    that it's extremely expensive if you
  • 00:01:15
    thought Claude was expensive at $15 per
  • 00:01:17
    million tokens GPT 4.5 is five times
  • 00:01:20
    more expensive at $75 per million output
  • 00:01:23
    tokens actually no correction that's
  • 00:01:25
    input tokens it's $150 per million
  • 00:01:28
    output tokens and to chat with it it's
  • 00:01:29
    it's currently only available to the
  • 00:01:31
    $200 per month Pro users I tried it out
  • 00:01:33
    myself and it does seem to emit Chill
  • 00:01:35
    Vibes but the problem is that's highly
  • 00:01:37
    subjective however in the launch open
  • 00:01:39
    aai talked about a new Vibes Benchmark
  • 00:01:42
    that's supposed to measure creative
  • 00:01:43
    thinking the best way to get a f for the
  • 00:01:45
    model is to talk to it so let's jump
  • 00:01:47
    into a demo a lot of people on the
  • 00:01:48
    internet criticize this presentation but
  • 00:01:50
    as an introvert myself I think they did
  • 00:01:52
    a great job in addition it apparently
  • 00:01:53
    has a far lower hallucination rate but
  • 00:01:55
    what I found is that it still makes a
  • 00:01:57
    lot of silly mistakes it's not
  • 00:01:58
    self-aware and has has no idea what gbt
  • 00:02:00
    4.5 even is and says its training cut
  • 00:02:03
    off is October 2023 it was however able
  • 00:02:06
    to tell me how many RS are in Strawberry
  • 00:02:08
    that felt like a huge leap forward but I
  • 00:02:10
    quickly became disappointed when it gave
  • 00:02:11
    me the wrong number of L's in laap paloa
  • 00:02:14
    now when it comes to programming and
  • 00:02:15
    science I didn't even try because we
  • 00:02:17
    already know it's not going to perform
  • 00:02:18
    as well as the deep thinking models like
  • 00:02:20
    03 then to make matters worse on the AER
  • 00:02:22
    polyglot coding Benchmark it's not only
  • 00:02:24
    worse at programming than deep seek but
  • 00:02:26
    also hundreds of times more expensive
  • 00:02:28
    now if you're an Elon Musk hater you'll
  • 00:02:29
    want take a bong rip of copium right now
  • 00:02:31
    because currently xai's Gro is the best
  • 00:02:33
    model in the world that's not my opinion
  • 00:02:35
    it's the opinion of the betting Market
  • 00:02:37
    although by the end of 2025 open AI is
  • 00:02:39
    still the favorite to have the best
  • 00:02:41
    model but its odds are on the decline
  • 00:02:43
    that's problematic for open AI though
  • 00:02:44
    because they're raising billions and
  • 00:02:46
    billions of dollars as they transition
  • 00:02:47
    to for-profit and will need to maintain
  • 00:02:49
    a massive valuation Alman says there is
  • 00:02:51
    no wall and believes they can scale
  • 00:02:53
    these models almost infinitely that's
  • 00:02:55
    assuming he gets trillions of dollars
  • 00:02:56
    from soft Bank in the Saudis to build
  • 00:02:58
    these data centers my theory as an
  • 00:03:00
    unqualified ship poster is that they
  • 00:03:01
    failed to train GPT 5 with any
  • 00:03:03
    significant Improvement despite scaling
  • 00:03:05
    up the number of parameters in compute
  • 00:03:07
    GPT 4.5 is the biggest model they've
  • 00:03:09
    ever created and now they're lowering
  • 00:03:10
    the bar for gbt 5 which ultman described
  • 00:03:13
    a few weeks ago being more like a router
  • 00:03:15
    that automatically chooses the best
  • 00:03:16
    model based on your prompt and that's
  • 00:03:17
    highly disappointing because I was
  • 00:03:19
    expecting to be a post-apocalyptic
  • 00:03:21
    warlord by now the battling robots and
  • 00:03:23
    barbecuing rats over burning garbage
  • 00:03:24
    cans for dinner but instead I live in
  • 00:03:26
    this dystopia where artificial super
  • 00:03:28
    intelligence never comes and nothing
  • 00:03:30
    ever happens but if you're a computer
  • 00:03:31
    science student the plateau is great
  • 00:03:33
    news AI coding tools are incredible but
  • 00:03:35
    they're most useful to real human
  • 00:03:37
    programmers who know what they're doing
  • 00:03:38
    and I don't see that changing anytime
  • 00:03:40
    soon and you can start getting really
  • 00:03:41
    good at programming for free thanks to
  • 00:03:43
    this video sponsor brilliant their
  • 00:03:45
    platform provides interactive Hands-On
  • 00:03:47
    lessons that demystify the complexity of
  • 00:03:49
    deep learning with just a few minutes of
  • 00:03:51
    effort each day you can understand the
  • 00:03:53
    math and computer science behind this
  • 00:03:55
    seemingly magic technology I'd recommend
  • 00:03:57
    starting with python then check out
  • 00:03:59
    their full how large language models
  • 00:04:01
    work course if you really want to look
  • 00:04:02
    under the hood of chat gbt try
  • 00:04:04
    everything brilliant has to offer for
  • 00:04:06
    free for 30 days by going to
  • 00:04:08
    brilliant.org fireship or use the QR
  • 00:04:11
    code on screen this has been the code
  • 00:04:12
    report thanks for watching and I will
  • 00:04:14
    see you in the next one
Tag
  • GPT 4.5
  • OpenAI
  • AI development
  • programming
  • vibes benchmark
  • technology critique
  • AI performance
  • deep learning
  • cost
  • future of AI