AI Generated Videos Just Changed Forever

00:12:02
https://www.youtube.com/watch?v=NXpdyAWLDas

Résumé

TLDRVideoen diskuterer de imponerende fremskridt inden for AI-genererede videoer med introduktionen af OpenAI's nye model, Sora. Denne model kan skabe realistik videoindhold ud fra tekstinput, betydeligt forbedret fra tidligere AI-teknologier, som Will Smiths spaghetti-spisende klip. Sora kræver forståelse for, hvordan lys, materialer og fysik interagerer over tid i videoform. Selvom teknologien er imponerende, er den også skræmmende, da det er nu muligt at lave videoer, der kan forveksles med ægte indhold. Dette har store konsekvenser for mediefremstilling, da AI-genererede videoer kan bruges til kommercielle formål som stock video uden behov for traditionel videolicensering. Der er dog etiske bekymringer om potentialet for misbrug, især i et valgår i USA. OpenAI har inkluderet vandmærker på videoer for at indikere AI-generering, men opfordres til at være forsigtige med anvendelsen af teknologien.

A retenir

  • 🚀 AI-videoteknologien har gjort store fremskridt på kun et år.
  • 🤖 OpenAI's model Sora kan skabe realistiske videoer fra bare tekst.
  • ⚠️ Der er etiske bekymringer angående AI-genererede videoer under valgkampagner.
  • 💡 AI-videoer kan betydeligt reducere behovet for licenseret videomateriale.
  • 🛡️ OpenAI har indført vandmærker for at identificere AI-genererede videoer.
  • 🔍 AI-videoer kan stadig have små unøjagtigheder og glitches.
  • 📈 AI-modeller forbedrer sig i et forbløffende tempo.
  • 🎮 Nogle AI-genererede videoer kan virke mere computeragtige eller spilagtige.
  • 📚 Tidligere videoeksempler, som AI-genererede Will Smith, er meget simple sammenlignet med nu.
  • 💻 AI-modeller som Sora kræver kompleks forståelse af fysik og materialer i video.

Chronologie

  • 00:00:00 - 00:05:00

    Videoen starter med en diskussion om, hvordan AI-genereret video har udviklet sig betydeligt siden den tidlige 'Will Smith spiser spaghetti'-fase. Værten diskuterer lanceringen af OpenAIs nye model, Sora, som kan generere op til et minuts video baseret på tekstinput. Sammenligningen med tidligere AI-modeller som DALL.E fremhæver den teknologiske udvikling. AI skaber nu videoer med realistiske lys, teksturer og bevægelser. Eksempler som en kvinde, der går ned ad en Tokyo-gade viser hvor overbevisende AI kan være, selvom mindre fejl stadig kan ses ved nærmere inspektion.

  • 00:05:00 - 00:12:02

    Videoen fortsætter med yderligere eksempler på Soras evner, såsom en bil på en stejl vejbane og gyldne retrieverhvalpe i sneen. Værten diskuterer også potentialet for AI-genereret indhold til reklamer og præsentationer, hvilket kunne erstatte behovet for stock videooptagelser og derved påvirke professionel footage-licensering. Han bemærker dog, at der stadig er mangler i teknologien, som vist i en video af en bedstemor med mærkeligt udformede hænder. Diskussionen afsluttes med refleksioner over de større konsekvenser og fremtiden for AI-genererede videoer, især hvad det betyder for kreativitet og innovation.

Carte mentale

Mind Map

Questions fréquemment posées

  • Hvad er Sora?

    Sora er en ny model fra OpenAI, der kan generere op til et minuts videoindhold fra tekstinput.

  • Hvordan adskiller Sora sig fra tidligere AI-modeller som DALL-E?

    Sora kan generere videoindhold, hvilket kræver forståelse for bevægelse, lysreflektion, materialer og fysik, hvilket gør det mere komplekst end billedgenererende AI som DALL-E.

  • Hvordan var AI-genererede videoer tidligere?

    Tidligere AI-genererede videoer, såsom dem, der involverer kendte personer som Will Smith, var meget mere primitive og ikke særlig realistiske.

  • Er der nogle typiske fejl i AI-generede videoer?

    Ja, AI-genererede videoer kan have unøjagtigheder, som mærkelige bevægelser, inkonsistente billedrater og forvrængninger i refleksioner.

  • Hvor hurtigt har AI-videoteknologien udviklet sig?

    Udviklingen har været meget hurtig, og de AI-videomodeller, der findes i dag, er langt mere avancerede sammenlignet med, hvad vi havde for blot et år siden.

  • Hvilke etiske overvejelser er der ved AI-genererede videoer?

    Der er bekymringer om, at AI-genererede videoer kan forveksles med ægte indhold, hvilket kan være farligt i kontekster som politiske valg og sociale medier.

  • Hvordan påvirker dette videolicensering?

    AI-genererede videoer kan erstatte behovet for licenseret videooptagelse, da virksomheder kan generere specifikt indhold billigere og hurtigere.

Voir plus de résumés vidéo

Accédez instantanément à des résumés vidéo gratuits sur YouTube grâce à l'IA !
Sous-titres
en
Défilement automatique:
  • 00:00:00
    - [Will] That's hot. (bright upbeat music)
  • 00:00:02
    - All right, so this is simultaneously really impressive
  • 00:00:07
    and really frightening at the same time,
  • 00:00:10
    and it's hitting me in ways that I didn't really expect.
  • 00:00:12
    So do you remember Will Smith eating spaghetti?
  • 00:00:15
    Do you remember when this
  • 00:00:16
    was what AI generated videos looked like?
  • 00:00:18
    Remember when we said, "Okay, this AI stuff is cool and all
  • 00:00:20
    "but clearly there's a long way to go
  • 00:00:22
    "before there's any need for concern."
  • 00:00:25
    Well, welcome to the future people
  • 00:00:27
    because this is also an AI generated video.
  • 00:00:32
    And so is this, completely synthesized out of thin air
  • 00:00:35
    by computers.
  • 00:00:36
    This one too, this is not real.
  • 00:00:39
    Absolutely ridiculous
  • 00:00:40
    how far we've come in literally one year.
  • 00:00:43
    This does feel like another ChatGPT, DALL.E moment for AI.
  • 00:00:49
    And maybe I'm overreacting
  • 00:00:50
    because, okay, I am a video creator,
  • 00:00:53
    so an AI that's actually doing my job,
  • 00:00:56
    maybe that feels a little more threatening
  • 00:00:58
    so I'm particularly impressed by it.
  • 00:00:59
    But also this stuff is really good.
  • 00:01:02
    So today, Sam Altman and OpenAI announced a new model
  • 00:01:06
    called Sora and it can generate
  • 00:01:08
    full up to one minute video clips from just text input.
  • 00:01:13
    So the same way DALL.E was able to understand our text input
  • 00:01:17
    and turn it into a photorealistic
  • 00:01:19
    or stylized image or whatever you want,
  • 00:01:22
    same thing with Sora but now since it's videos,
  • 00:01:25
    it also needs to understand
  • 00:01:28
    how all these things like reflections and textures
  • 00:01:31
    and materials and physics all interact with each other
  • 00:01:35
    over time to make a reasonable looking video.
  • 00:01:37
    And of course, right away, there's a bunch of examples
  • 00:01:39
    on their website that are crazy.
  • 00:01:41
    Now, before I show you these,
  • 00:01:42
    I just need you to keep this in mind,
  • 00:01:44
    you're about to watch a bunch of AI generated videos
  • 00:01:47
    and you know that you're about to watch
  • 00:01:49
    a bunch of AI generated content.
  • 00:01:51
    So your brain, you're already looking for this stuff
  • 00:01:53
    and it's not perfect, you will find imperfections,
  • 00:01:57
    but not everybody who sees AI generated content
  • 00:02:00
    on the internet knows to be looking for that.
  • 00:02:04
    So also keep that in mind.
  • 00:02:06
    This is also the worst that this technology
  • 00:02:08
    is going to be from here on out.
  • 00:02:10
    So, okay, here's one of the videos.
  • 00:02:12
    There's no audio to any of these clips,
  • 00:02:13
    but the prompt for this one
  • 00:02:15
    is a stylish woman walks down a Tokyo street
  • 00:02:18
    filled with warm, glowing, neon and animated city signage.
  • 00:02:22
    She wears a black leather jacket,
  • 00:02:23
    a long red dress and black boots.
  • 00:02:26
    This video is already miles ahead of where we were.
  • 00:02:30
    It has accurate lighting, it has materials,
  • 00:02:33
    it has skin tones, movements,
  • 00:02:36
    even has reflections all over the place.
  • 00:02:38
    Now, of course, if you look at it
  • 00:02:39
    for more than about 10 seconds, very closely,
  • 00:02:42
    there are lots of giveaways.
  • 00:02:43
    Like this dude in the background
  • 00:02:45
    kinda looks like he's gliding in a weird way.
  • 00:02:48
    The frame rates and the reflections in the water
  • 00:02:50
    are for some reason lower than the rest of the video.
  • 00:02:52
    The camera movement overall is just a bit inconsistent
  • 00:02:54
    and it just, I don't know,
  • 00:02:56
    it just kinda feels a little bit off.
  • 00:02:58
    But then again, this is where we were one year ago.
  • 00:03:02
    So just keep that in the back of your head for all this.
  • 00:03:05
    Okay, how about this one?
  • 00:03:06
    This is another one which has a long prompt
  • 00:03:08
    about a camera following behind a white vintage SUV
  • 00:03:12
    with a black roof rack as it's speeds up a steep dirt road.
  • 00:03:16
    This is also, again, really good.
  • 00:03:20
    It kinda looks a little more video gamey
  • 00:03:21
    because of how rock solid the drone footage is,
  • 00:03:24
    but clearly very usable.
  • 00:03:27
    Here's another one,
  • 00:03:28
    a litter of golden retriever puppies playing in the snow.
  • 00:03:31
    Their heads pop in and out of the snow covered in it,
  • 00:03:34
    it's so good.
  • 00:03:35
    It feels like the physics of the fur and the ears
  • 00:03:37
    and everything with the snow flying around in slow motion
  • 00:03:41
    is incredible.
  • 00:03:42
    I've looked through all of the sample videos
  • 00:03:43
    on OpenAI's website,
  • 00:03:45
    and clearly these are the handpicked best ones
  • 00:03:48
    that they chose to share
  • 00:03:49
    where they just put in some text
  • 00:03:50
    and then get a video and don't modify it.
  • 00:03:52
    But there's really impressive stuff in there.
  • 00:03:54
    Some of it has humans, some of it doesn't.
  • 00:03:56
    Some of it is more realistic feeling
  • 00:03:58
    like the truck driving one,
  • 00:03:59
    but some of them are more video gamey or more stylized.
  • 00:04:02
    A lot of it is slow motion,
  • 00:04:03
    I just have to say how insanely fast
  • 00:04:06
    these models are improving is genuinely,
  • 00:04:09
    like that's the shocking part.
  • 00:04:11
    Like I remember not even that many months ago, DALL-E 3,
  • 00:04:15
    really, really high end,
  • 00:04:16
    and you could always still find something off about it.
  • 00:04:19
    Like especially if you ask it
  • 00:04:20
    for something like a photorealistic image of a human,
  • 00:04:25
    something about like the hands or the ears
  • 00:04:28
    would always just be a little bit off,
  • 00:04:30
    nevermind the physics.
  • 00:04:31
    But even this video here is crazy at first glance.
  • 00:04:36
    The prompt for this AI generated video
  • 00:04:37
    is a young man in his 20s
  • 00:04:39
    is sitting on a piece of a cloud in the sky reading a book.
  • 00:04:43
    This one feels like 90% of the way there for me.
  • 00:04:47
    Like it's beyond the uncanny valley
  • 00:04:49
    of like apple's personas,
  • 00:04:51
    which are actually based on humans.
  • 00:04:53
    This is a made up person.
  • 00:04:54
    I mean, his eyes are kinda weird,
  • 00:04:56
    and the motion of the pages in the book are kinda odd.
  • 00:04:59
    And yeah, obviously, he's in a cloud and that's a giveaway
  • 00:05:01
    but like, the lighting and the shadows and the skin tones
  • 00:05:05
    and then all the realism of the textures on the shirt
  • 00:05:07
    and the way the shirt and the pants move and the hair,
  • 00:05:09
    they're all really impressive.
  • 00:05:11
    And then for this one,
  • 00:05:12
    they typed in a movie trailer
  • 00:05:15
    featuring the adventurers of the 30-year old spaceman
  • 00:05:19
    wearing a red wool knitted motorcycle helmet, blue sky,
  • 00:05:23
    salt desert, cinematic style, shot on 35 millimeter film.
  • 00:05:28
    And the closeups of his face, the fabrics on the helmet,
  • 00:05:33
    the film grain through every shot and the cinematic style,
  • 00:05:36
    this is one of the most convincing AI generated videos
  • 00:05:40
    I've ever seen, minus maybe the weird physics
  • 00:05:42
    of that dude walking kind of in fast motion.
  • 00:05:45
    So Sam Altman, if you follow him on Twitter,
  • 00:05:46
    he's going through a whole bunch more
  • 00:05:47
    of like people's requests
  • 00:05:49
    and posting a bunch more generated videos.
  • 00:05:51
    And so if you wanna check out his profile,
  • 00:05:53
    you can see those.
  • 00:05:53
    But here's the thing about these AI generated videos now,
  • 00:05:58
    as good as they've gotten to this point,
  • 00:06:01
    they can and will pass as real videos
  • 00:06:06
    to people who are not looking for AI generated videos.
  • 00:06:09
    Now that is obviously insanely sketchy
  • 00:06:13
    during an election year in the U.S.
  • 00:06:14
    and also terrifying
  • 00:06:15
    for a bunch of other internet related reasons
  • 00:06:17
    but it's also perfect for stock footage.
  • 00:06:22
    Like there are already all kinds of presentations
  • 00:06:25
    and advertisements and then PowerPoints
  • 00:06:28
    that are in need of oddly specific stock videos.
  • 00:06:33
    And these AI generated videos are already good enough
  • 00:06:38
    to 100% pass for that purpose.
  • 00:06:40
    Like look at this one, this one with the waves
  • 00:06:42
    at Big Sur, this drone shot.
  • 00:06:45
    Honestly, if I saw this on Twitter,
  • 00:06:47
    I wouldn't even think twice.
  • 00:06:48
    I'd be like, "Oh, nice drone shot, dude."
  • 00:06:50
    Wouldn't even think about AI if I wasn't pixel peeping
  • 00:06:53
    at like the way the water was moving.
  • 00:06:55
    Like this is a totally usable video in an ad
  • 00:06:59
    for some California based product.
  • 00:07:01
    And that has all sorts of implications for the drone pilot
  • 00:07:05
    that no longer needs to be hired,
  • 00:07:07
    for all the photographers and videographers
  • 00:07:09
    whose footage no longer needs to be licensed
  • 00:07:12
    to show up in that ad that's being made.
  • 00:07:14
    It's already that good.
  • 00:07:16
    There's other stuff like this wall of TVs,
  • 00:07:19
    which would be a totally expensive and difficult thing
  • 00:07:21
    to shoot with a camera and all these old expensive props,
  • 00:07:25
    but if you can just generate it this well with reflections
  • 00:07:29
    and the environment and everything else around it,
  • 00:07:32
    I mean, why do it any other way?
  • 00:07:33
    It's also very capable of historical themed footage.
  • 00:07:36
    So this is supposed to be California during the Gold Rush.
  • 00:07:40
    It's AI generated but it could totally pass
  • 00:07:43
    for the opening scene in an old Western
  • 00:07:45
    with the right music over it.
  • 00:07:46
    How long until an entire ad,
  • 00:07:47
    every single shot is completely generated with AI?
  • 00:07:51
    Or what about an entire YouTube video or an entire movie?
  • 00:07:54
    I'm tempted to say like we're a long way away from that
  • 00:07:56
    because you know, this still has flaws clearly
  • 00:07:59
    and there's no sound,
  • 00:08:00
    and there's a long way to go with the prompt engineering
  • 00:08:02
    to iron these things out.
  • 00:08:04
    But then again, the spaghetti was like a year ago.
  • 00:08:09
    Now actually like that OpenAI, on their website,
  • 00:08:11
    they show some of the downfalls too
  • 00:08:13
    of this particular model.
  • 00:08:15
    And because who would know better
  • 00:08:16
    than the people who have been using it?
  • 00:08:18
    This is a very private tool, by the way, right now.
  • 00:08:20
    It's in super limited access,
  • 00:08:21
    so it's in the hands of red teamers,
  • 00:08:23
    which basically means people testing it, pushing the limits,
  • 00:08:26
    trying to break it, and a few trusted creators.
  • 00:08:29
    But they have found plenty of weird edge stuff.
  • 00:08:33
    Like this clip here of a bunch of gray wolf pups
  • 00:08:36
    looks normal at first
  • 00:08:37
    but then it's pretty clear that something's kinda off
  • 00:08:39
    with the way they're just kinda appearing out of nowhere
  • 00:08:42
    and walking through each other.
  • 00:08:44
    That's kinda weird.
  • 00:08:45
    Or this clip of a guy running on a treadmill,
  • 00:08:47
    which I mean, I don't really have to say much more
  • 00:08:49
    about why this one is weird.
  • 00:08:50
    But this is my favorite one, again,
  • 00:08:52
    so again, just try to put yourself in the mind of someone
  • 00:08:54
    who's not expecting AI.
  • 00:08:56
    You're just scrolling through Facebook
  • 00:08:59
    or Twitter or something, right?
  • 00:09:01
    So you just see this video.
  • 00:09:02
    So first I just want you to watch this clip
  • 00:09:03
    as if it's just a stock video you found
  • 00:09:05
    of a grandma celebrating her birthday.
  • 00:09:07
    And just try to think like,
  • 00:09:09
    I wonder what birthday she's celebrating, right?
  • 00:09:11
    I don't know, how old do you think she is?
  • 00:09:12
    60?
  • 00:09:13
    65?
  • 00:09:14
    Maybe it's the big 70.
  • 00:09:15
    She seems to really like that cake.
  • 00:09:17
    Now, did you see it?
  • 00:09:19
    Did you catch that?
  • 00:09:20
    I'm gonna play it again,
  • 00:09:22
    but this time, watch the video
  • 00:09:24
    knowing that AI generated photos and videos
  • 00:09:27
    have trouble accurately doing hands.
  • 00:09:32
    I'll play it again.
  • 00:09:33
    And now it feels super obvious like every time you watch it,
  • 00:09:37
    watch a different set of hands, it gets weirder and weirder.
  • 00:09:40
    You can watch it like five times and there's dead giveaway
  • 00:09:42
    after dead giveaway,
  • 00:09:43
    not even mentioning the weird inconsistencies
  • 00:09:45
    with the direction of the wind on the candles.
  • 00:09:48
    But even as I'm saying all that,
  • 00:09:50
    even as it's coming outta my mouth,
  • 00:09:51
    I can't help but remember that 12 months ago
  • 00:09:54
    we were critiquing this.
  • 00:09:56
    (Will laughing)
  • 00:09:58
    So what does this all mean?
  • 00:10:00
    Well, I mean, there's what it means now
  • 00:10:02
    and there's what it means for the future.
  • 00:10:05
    Now, Sora, this thing that they've made
  • 00:10:08
    is clearly a really impressive video generation AI tool
  • 00:10:12
    that is both going to fool people and also be very useful.
  • 00:10:18
    There's also a watermark in the bottom corner
  • 00:10:21
    of every video generated by it.
  • 00:10:22
    So if you see one of those videos
  • 00:10:23
    and ideally it hasn't been cropped out,
  • 00:10:26
    then that's at least a pretty clear indicator
  • 00:10:27
    that it's AI generated.
  • 00:10:28
    It's a Sora video.
  • 00:10:29
    But also, I do think they're gonna have to be very careful
  • 00:10:33
    with this, they're gonna have a whole bunch of safety stuff
  • 00:10:35
    to keep in mind.
  • 00:10:36
    I think they'll probably have to be even more safe
  • 00:10:37
    than DALL.E.
  • 00:10:38
    Like you shouldn't be able to generate people's likenesses.
  • 00:10:42
    Like you shouldn't be able to make a politician
  • 00:10:43
    look like they're doing something on video,
  • 00:10:45
    especially this year.
  • 00:10:47
    You probably won't be able to make
  • 00:10:48
    Will Smith eating spaghetti,
  • 00:10:50
    but it also definitely means stock video generation
  • 00:10:56
    is absolutely going to take a dent out of video licensing.
  • 00:11:01
    Like I can basically guarantee that.
  • 00:11:02
    Like logistically, why would anyone making something
  • 00:11:05
    pay for footage of a house in the cliffs
  • 00:11:07
    when they can generate one for free
  • 00:11:10
    or for a small subscription price?
  • 00:11:11
    Like that is the real scary part of what this tool implies.
  • 00:11:15
    But in the future, it gets pretty existential, man.
  • 00:11:19
    I mean, okay, if this is trained on all videos
  • 00:11:23
    that have ever been made by humans,
  • 00:11:25
    then surely it can't be innovative or creative
  • 00:11:27
    in ways that humans haven't already been, right?
  • 00:11:33
    I don't know.
  • 00:11:34
    Either way, I'll have all the links below
  • 00:11:35
    for all the Sora stuff, for OpenAI stuff,
  • 00:11:38
    and I guess I'll talk to you next year
  • 00:11:40
    when we look back and go,
  • 00:11:41
    "Remember that first version of Sora
  • 00:11:44
    "and how bad those wolf pups looked
  • 00:11:46
    "when they spawned out of nowhere?"
  • 00:11:48
    Just remember, this is the worst that this technology
  • 00:11:50
    is going to be from here on out.
  • 00:11:53
    Thanks for watching.
  • 00:11:55
    Catch you the next one, peace.
  • 00:11:57
    (bright upbeat music)
Tags
  • AI
  • Video Generation
  • OpenAI
  • Sora
  • AI Ethics
  • Media Production
  • Video Technology
  • Stock Footage