AI Unscripted #003: 🚨 AI’s Biggest Flaw? Why Machine Unlearning is the Future
Résumé
TLDRThis episode delves into AI unlearning with guest Ben Lura, CEO of Herondo. The conversation highlights the necessity for AI models to not only learn but also unlearn data, especially outdated or biased information. Ben explains how his startup develops solutions for enterprises to remove harmful data from their AI systems, thereby enhancing their accuracy and compliance. The episode explores various use cases, challenges associated with bias in AI, and the potential impact of robust unlearning practices on improving organizational AI performance. The discussion emphasizes the ongoing transformation in AI technology and its future implications for industries.
A retenir
- 🤖 AI can 'unlearn' by removing outdated or inaccurate data.
- 📊 Unlearning improves the accuracy and compliance of AI models.
- 🧠 Bias in AI is a critical issue that needs addressing.
- 🏢 Herondo targets enterprises to help them manage AI data.
- 💡 Startups can lead the charge in developing unlearning technologies.
- 📈 Unlearning can save organizations time and resources on retraining.
- 🚗 Use cases include improving autonomous driving AI performance.
- 🥇 Organizations are beginning to recognize the importance of unlearning.
- ⚖️ Ethical concerns in AI rely on effective bias mitigation strategies.
- 📅 The future of AI may heavily involve unlearning techniques.
Chronologie
- 00:00:00 - 00:05:00
The host welcomes viewers back to AI Unscripted, introducing the concept of 'unlearning' in AI, which contrasts with the familiar notion of AI learning.
- 00:05:00 - 00:10:00
Special guest Ben Lura, CEO of Hundo, explains his startup's focus on AI unlearning, allowing models to forget incorrect or outdated data without needing retraining.
- 00:10:00 - 00:15:00
Ben gives an overview of his background and the formation of Hundo, highlighting the importance of correcting inaccuracies in AI data processing for enterprises.
- 00:15:00 - 00:20:00
The conversation shifts to examples of biased AI models, including risks encountered by companies like Amazon and Facebook, leading to the need for AI unlearning solutions.
- 00:20:00 - 00:25:00
They discuss how Hundo targets business-to-business services for data science teams, emphasizing the growing necessity of unlearning in future AI applications and compliance.
- 00:25:00 - 00:30:00
Ben details the technical aspects of Hundo's processes, agreeing that while startups face high pressure to deliver advanced solutions, they must also maintain a focus on existing models.
- 00:30:00 - 00:35:00
The discussion shifts to AI in various industries, explaining how unlearning applies to self-driving models and generative AI in mitigating bias and misinformation.
- 00:35:00 - 00:40:00
Ben shares his perspective on the evolving landscape of AI, asserting that while generative models are gaining attention, necessary improvements in classical AI remain crucial.
- 00:40:00 - 00:45:00
The guests weigh in on the implications of AI on jobs and future employment landscapes, considering how technological advancement can both displace and create new roles.
- 00:45:00 - 00:53:25
The conversation wraps up with reflections on the progress in AI, the acknowledgement of existing biases, and the necessity of 'unlearning' for responsible AI development.
Carte mentale
Vidéo Q&R
What is AI unlearning?
AI unlearning is the process of selectively removing unwanted, outdated, or biased data from AI models, allowing them to 'forget' without needing to be retrained.
Why is unlearning important for AI?
Unlearning is crucial for improving the accuracy and compliance of AI models, helping them avoid biases and inaccuracies that can arise from bad data.
What are some use cases for AI unlearning?
Use cases include improving autonomous driving models by removing mislabeled data, and eliminating sensitive personal information from AI outputs in compliance with regulations.
How does Ben's startup approach unlearning?
Herondo provides a platform that allows organizations to easily identify and remove harmful data from their AI models, improving their performance with minimal friction.
Is AI unlearning currently being practiced in the industry?
Yes, although it is still a developing field, organizations are starting to recognize the value of unlearning, and Bench's startup is pioneering this area.
What challenges does AI unlearning face?
Challenges include ensuring that unlearning is thorough enough that it effectively removes the unwanted data while not causing unintended consequences on the model's performance.
What impact does bias in AI have?
Bias in AI can lead to inaccurate outcomes, reinforce stereotypes, and create ethical concerns, making bias mitigation an essential part of AI development.
Can AI technically 'forget' things?
Yes, through unlearning, AI can be designed to remove specific data points or biases from its memory, but it requires careful execution and validation.
How can businesses benefit from unlearning technologies?
Businesses can improve their AI's compliance with regulations, enhance product safety and accuracy, and reduce the need for extensive data retraining efforts.
What are the future implications for AI unlearning?
As AI continues to evolve, unlearning will become increasingly important for maintaining ethical standards, improving AI reliability, and supporting business needs.
Voir plus de résumés vidéo
Perseverance Mars Rover Landing- Inside Story
The Story of James Simons - Renaissance Technologies & Medallion Fund
7 Study Habits Of Successful & Effective Students 📝
Ultimate REFUTATION of Sola Scriptura
A Literary Analysis of God of War Ragnarok
The Future of AI in B2B SaaS: Insights from Synthesia and Theory Ventures. Hosted by Jason Lemkin
- 00:00:001 2
- 00:00:043 hello and welcome back to AI
- 00:00:07unscripted the show where we dive deep
- 00:00:09into the world of AI unscripted with no
- 00:00:12fluff no BS and with the people that are
- 00:00:14shaping AI today we have a very special
- 00:00:16guest joining the usual Squad of Effie
- 00:00:19and Roy say hi guys hey hey guys how's
- 00:00:22it going wait where's my
- 00:00:24camera I want to look at the camera I
- 00:00:26don't see a camera hi hit the link
- 00:00:30subscribe so today guys we're talking
- 00:00:32about something a little bit
- 00:00:33counterintuitive we all talk about AI
- 00:00:35learning models but what about AI
- 00:00:38unlearning That's crazy that's actually
- 00:00:41crazy that is crazy we're going to talk
- 00:00:43about it because it's interesting I
- 00:00:44actually made some notes
- 00:00:48first so today's guest is Ben Lura the
- 00:00:51CEO of hundu am I saying that correctly
- 00:00:54almost okay hero herondo sorry yep and
- 00:00:57uh basically Ben if you'd like to tell
- 00:01:00was a little bit about your cool startup
- 00:01:02and yourself of course yeah okay so my
- 00:01:04name is Ben I'm the CEO of rundo We're
- 00:01:06the first machine and learning startup
- 00:01:08so basically if you think about AI
- 00:01:09models something that's very obvious and
- 00:01:12inherent to them they learn on data and
- 00:01:15regardless of how much you try to
- 00:01:16perfect the data that they being trained
- 00:01:18on or fine tuned on there's always going
- 00:01:20to be something there that shouldn't be
- 00:01:21it could be inaccurate data
- 00:01:22non-compliant data outdated information
- 00:01:25a non-compliant confidential and similar
- 00:01:28to human beings uh you can can't just
- 00:01:30tell a model to forget right so in
- 00:01:32lectures I present a picture of a pink
- 00:01:34elephant I tell the audience please
- 00:01:36don't remember this pink elephant and
- 00:01:38obviously later on in the conference
- 00:01:40they're like hey Pink Elephant guy
- 00:01:41remember what the
- 00:01:45pink basically that leads to
- 00:01:48non-compliant models inaccurate models
- 00:01:49and so forth we developed a way to undo
- 00:01:52that so to selectively remove the bad
- 00:01:55unwanted data from a models make them
- 00:01:56forget without having to rebuild them
- 00:01:58retrain them and so forth wow that's
- 00:02:01very cool I wonder um would it be more
- 00:02:05of a business use case or could it be
- 00:02:07also if I want the computer to forget
- 00:02:09what I searched yesterday afternoon
- 00:02:11perhaps so I think like any other
- 00:02:13Israeli startup we B2B so we're
- 00:02:15targeting like Enterprises those that
- 00:02:17are you know very wary of adopting these
- 00:02:20new things because of the risk involved
- 00:02:21whether it be a regulatory risk or just
- 00:02:24like the risk of providing inaccurate
- 00:02:25answers biased answers ER revealing some
- 00:02:28information that should be a proprietary
- 00:02:31and disclos only to those who are
- 00:02:33eligible for it yeah um so it's a B2B
- 00:02:36startup targeting data science teams in
- 00:02:38big organizations across multiple V and
- 00:02:41where did you come up with the idea for
- 00:02:42this I mean there's a few famous cases
- 00:02:44right like Amazon for example with the
- 00:02:46HR debacle where it was hiring guys I
- 00:02:49think and the target uh we know you're
- 00:02:51pregnant before you do yeah that was
- 00:02:54Facebook but that was AI yeah there's
- 00:02:57also um what's the house there the house
- 00:03:00property Market I think they start
- 00:03:02valuating houses at like $500 million
- 00:03:05regular houses just their data went wild
- 00:03:08so it's obviously a big problem I should
- 00:03:10get on that platform yeah you need to
- 00:03:12how did you come up with the idea so I
- 00:03:15didn't explain also like about me and
- 00:03:17the team and we'll get to that but
- 00:03:18basically I think like um a lot of
- 00:03:20different startups it's all like a both
- 00:03:23a moment of realization but also like a
- 00:03:24very deep process of looking for the
- 00:03:26right thing the thing that the market
- 00:03:27needs now and even more than that the
- 00:03:30thing that the market will need you know
- 00:03:33as a must in the years to come so a bit
- 00:03:37about my background which is usually
- 00:03:38surprising given this is a deep Tech
- 00:03:40startup so like at the trenches of AI
- 00:03:42infrastructure I'm a non-technical
- 00:03:44person so I come from background both of
- 00:03:46a little bit entrepreneurship I had a
- 00:03:48fintech startup a I co-founded a
- 00:03:50nonprofit that helps Israel is switch
- 00:03:52International higher education but my
- 00:03:54background is more in strategy and
- 00:03:55policy so I was a visiting fellow at
- 00:03:57Oxford for public policy and innovation
- 00:04:01it's alongside the likes of Bill Clinton
- 00:04:03I think who was also roads right yeah
- 00:04:05yeah so one of the first roads callers
- 00:04:07from Israel and before that I was a
- 00:04:08captain rank officer dealing with Israel
- 00:04:10us relations my co-founders or my
- 00:04:12partners in crime are much more from
- 00:04:14like the Deep technical expertise that
- 00:04:17you need for such a thing so Misha or
- 00:04:19Michael R CTO was an award-winning R&D
- 00:04:21officer at the Israeli air forceit
- 00:04:23called off um and he was before that a
- 00:04:26friend and maybe even more importantly
- 00:04:28he was a research secher under the
- 00:04:30supervision of a professor called ODI
- 00:04:33and ODed is our chief
- 00:04:36scientist and back then they worked on
- 00:04:38some a research that has actually
- 00:04:42nothing to do with what we do today but
- 00:04:44they thought about commercializing it
- 00:04:46and uh the story is like they sent me a
- 00:04:48blurb about what uh they're working on
- 00:04:51and they looked for a business lead so
- 00:04:53like someone to take it from the
- 00:04:54Academia into the market I read it and I
- 00:04:57was like okay no clue you guys are
- 00:05:00talking
- 00:05:01about like read it again read it again
- 00:05:03like still can't understand but clearly
- 00:05:06you need someone like me who's more of a
- 00:05:07story teller this before chat GPT could
- 00:05:10have summarized it for you before um and
- 00:05:14then we partnered as a team and and
- 00:05:15started like initially thinking about
- 00:05:18how can we potentially commercialize it
- 00:05:19and started interviewing data teams
- 00:05:21across different
- 00:05:23organizations um and while we found that
- 00:05:26their previous research so like
- 00:05:28interesting and
- 00:05:30the you know people care about it but
- 00:05:32it's more of a nice to have from the
- 00:05:34commercial set of things but we kept
- 00:05:36hearing about these deep deep problems
- 00:05:38in data science te that just kept
- 00:05:40reappearing again and again and it's
- 00:05:42basically the process that I've
- 00:05:43described but when you reach it at the
- 00:05:45end of the cycle right so when when they
- 00:05:47tell us like look we spent all this time
- 00:05:49collecting data curating it validating
- 00:05:52it we built a model it's just not good
- 00:05:55enough to reach production and we don't
- 00:05:57know what to do we're spending like it's
- 00:05:58the most highly paid organ like I wish I
- 00:06:02had aary of PhD a scientist in a
- 00:06:06corporate but then you understanding
- 00:06:08that like they're spending so much time
- 00:06:09just like looking for the needle in the
- 00:06:11H stack both to find the data that got
- 00:06:14the model to underperform and also once
- 00:06:17you found it are you going to like
- 00:06:18restart the process again and you
- 00:06:20mentioned cha PT cha PT so like came to
- 00:06:23our lives pretty much at the same time
- 00:06:25like a month or two after we started
- 00:06:27working together and when we saw saw
- 00:06:30like how this you know drastically
- 00:06:32changes how people perceive AI how
- 00:06:34businesses adopt Ai and also our
- 00:06:38Regulators look at this evolving thing H
- 00:06:40we understood that there is first of all
- 00:06:42that we need to change a concept so we
- 00:06:44got we let go of all of like the techan
- 00:06:47related research we started at Clean
- 00:06:50Slate then formed the company and we
- 00:06:52wanted to solve these deep challenges
- 00:06:54that usually just like go a people skip
- 00:06:58them because it seems
- 00:07:00sometimes even too difficult to solve
- 00:07:02and we're like okay we have some bright
- 00:07:04Minds we're going to dig deeper we're
- 00:07:06going to solve something that's very
- 00:07:08inherent to the challenges of data
- 00:07:09science teams um and Vis A that the
- 00:07:13market is going through this
- 00:07:14transformation so we see it as the
- 00:07:16perfect Stone amazing that's crazy and
- 00:07:19you see it like on a layer above the the
- 00:07:22the in internal systems of the company
- 00:07:25yeah so I will say right now we're
- 00:07:27working with open source uh models I
- 00:07:29didn't mention but it's not only for
- 00:07:30large language models it's also for
- 00:07:32classical AI so non-generative models
- 00:07:34Vision radar spech to text as long as we
- 00:07:37have access to the weights in some use
- 00:07:39cases if we have access to the data
- 00:07:41itself but in llms it's not necessary we
- 00:07:46can do this process and then it's
- 00:07:47basically part of the infrastructure
- 00:07:49layer with organizations that are
- 00:07:51working internally can you give me a use
- 00:07:54case something like dumb it down yes get
- 00:07:57it to my language um okay so two very
- 00:08:00different examples one for
- 00:08:01non-generative AI and one for generative
- 00:08:04AI so let's say that you're working on
- 00:08:06an autonomous driving model right like
- 00:08:09cars are progressing it's a very shaky
- 00:08:11market I'll say but eventually I believe
- 00:08:13we'll get there yeah yeah how this
- 00:08:14process looks like is a let's say that
- 00:08:17you roll out a new kind of camera or a
- 00:08:18new kind of liar you spend a lot of time
- 00:08:21and money on collecting data using this
- 00:08:23all like new uh devices or new
- 00:08:26technologies from different sceneries so
- 00:08:28like you wanted to have a road signs
- 00:08:30with different
- 00:08:31languages different kind of seasons and
- 00:08:33so forth different terrains then in
- 00:08:36these cases you send it to labeling a
- 00:08:38lot of times it's just like happening
- 00:08:39offshore in developing countries
- 00:08:41sometimes by computers themselves and
- 00:08:44then you train models on top of that
- 00:08:45that could understand when is it the
- 00:08:47right time to take a a left turn when do
- 00:08:50you see an object that a car should
- 00:08:51avoid or stop and so forth now during
- 00:08:55the labeling of this process think about
- 00:08:58it it's such a tedious job you have
- 00:08:59millions of objects yeah H the people
- 00:09:02are really really underpaid and they
- 00:09:04don't have much time so it's optimizing
- 00:09:06for quantity over quality and you
- 00:09:09understand that maybe the model fails in
- 00:09:12so like a different specific kind of
- 00:09:14scenario right like it can recognize
- 00:09:16motorbikes but if it's a yellow
- 00:09:17motorbike it just never understands like
- 00:09:20what is the object and then you read in
- 00:09:21the news that a car from Brand X crashed
- 00:09:24into a wall and uh us regulatory
- 00:09:28agencies are in investigating until
- 00:09:30there's a change of of government um a
- 00:09:33lot of times these problems happen
- 00:09:34because of labeling issues so we have
- 00:09:37some public case studies that you can
- 00:09:38check out on our website where we
- 00:09:39detected just like that 10% of one of
- 00:09:43the most trusted data sets for
- 00:09:44autonomous driving included M labels so
- 00:09:47which is a crazy amount right and some
- 00:09:49ridiculous examples where you just see
- 00:09:5120 people each one has a box that says
- 00:09:53car around it instead of pedestrian yeah
- 00:09:56so the first challenge is to find this
- 00:09:58bad data the Second Challenge is are you
- 00:10:01going to retrain them all which is an
- 00:10:03option or what happens if you can
- 00:10:04actually remove it in an instant and
- 00:10:06then improve the accuracy of the model
- 00:10:08from an end to- endend process which
- 00:10:10really just like makes everyone lives
- 00:10:12easier and safer yeah so that's the
- 00:10:15non-generative use case where we use
- 00:10:16unlearning without educating the market
- 00:10:18too much just for the value offering of
- 00:10:21let's increase the accuracy of your
- 00:10:22models instantly in llms we're talking
- 00:10:25now with a big Fortune fire corporate we
- 00:10:28presented them a case that we thought
- 00:10:30will interest them which is removing
- 00:10:31personal sensitive information so let's
- 00:10:33say that they find tun the model on top
- 00:10:35of a database that they had that
- 00:10:37included Social Security numbers right
- 00:10:40that's a major issue both from like what
- 00:10:42happens it's a PR a nightmare if you
- 00:10:45know an LM discloses Regulators will eat
- 00:10:47you alive and it's also a regulatory
- 00:10:50issue H they said great H three weeks
- 00:10:53from now can you present it to our VPS
- 00:10:55but instead of unlearning this sensitive
- 00:10:58information was some you un learned bias
- 00:11:01and we're like okay very different
- 00:11:03challenge very different but we made it
- 00:11:06and in that case a you know you can see
- 00:11:09bias is part of a fairi responsible AI
- 00:11:13maybe some Regulatory Affairs but if I
- 00:11:16put myself in their shoes they're
- 00:11:18thinking about the fact that they just
- 00:11:19want to drisk the adoption of once you
- 00:11:22have a customer facing
- 00:11:23application you must make sure that the
- 00:11:25models act as expected act as the best
- 00:11:28and maybe even better than the best
- 00:11:30customer service agent yeah and bias is
- 00:11:32a big thing I mean I remember even
- 00:11:33watching a Netflix show about Ai and
- 00:11:36bias in AI even before CH GPT came into
- 00:11:39our life the LMS and everything and it
- 00:11:41spoke about how you portray in on the
- 00:11:44internet something like doctors usually
- 00:11:45portray it with a man a white man right
- 00:11:48it create B it creates bias and and it's
- 00:11:52crazy it's not just with corporat it's
- 00:11:53everywhere like it's all over the LMS
- 00:11:55right yeah absolutely and differently to
- 00:11:58the a classic use case that we had in
- 00:12:01mind it's not about a specific data
- 00:12:03point that went wrong right because
- 00:12:05biases exist in our real
- 00:12:07lives can I can I jump in I think I'm
- 00:12:10representing this is for the listeners
- 00:12:11now uh I'm a bit confused or still I
- 00:12:15mean it's it seems highly Technical and
- 00:12:17it is uh and I open conversation let me
- 00:12:20share with you how I saw it and what I
- 00:12:22thought it was and maybe you know maybe
- 00:12:26I got it wrong maybe it's a good example
- 00:12:28to clarify but when you when we talked
- 00:12:29about we also had the preall we talked
- 00:12:31about how I love the term machine
- 00:12:33unlearning uh but you know when you say
- 00:12:35unlearning I felt it's like forgetting
- 00:12:38so you know the classic case that I had
- 00:12:40for machine unlearning is oh sorry sorry
- 00:12:44Joe the case that I have for machine and
- 00:12:46learning is if I'm a publicly traded
- 00:12:48company and one of my sales my senior
- 00:12:51salesman did something or has something
- 00:12:52I want to make it forget now that's on
- 00:12:54the let's call it gry line but there are
- 00:12:56other things you can forget that it's
- 00:12:58wrongful data and if you know for
- 00:13:00example C GPT or the most popular llms
- 00:13:02people query them like they query Google
- 00:13:04so I want to make sure I have the right
- 00:13:06information access by Google you know
- 00:13:09change it to to the chat coming to
- 00:13:11information and that's like crucial
- 00:13:13information about my company about my
- 00:13:15product about my text spe that I'm
- 00:13:16making sure it's right what you said
- 00:13:18about visual computer vision and
- 00:13:19analyzing and self driving this sounds
- 00:13:21like data correction and anomaly
- 00:13:23detection and when you also mentioned
- 00:13:26about doing it when people do it
- 00:13:27overseas to me me personally this seems
- 00:13:30like you're 2016 yeah uh 2025 you know I
- 00:13:34would dump it on an llm sorry I would
- 00:13:36dump it on some sort of a model train it
- 00:13:39a thousand times and then another
- 00:13:41thousand times go public and then train
- 00:13:43it again and again and again but that's
- 00:13:45not what I had in mind when I heard
- 00:13:46unlearning yeah it's like unlearn things
- 00:13:49that can make bad decisions later that's
- 00:13:52how I saw it and so so I think like you
- 00:13:54bring a valid point and like I think
- 00:13:56like I talked twice about uh this topic
- 00:14:00today for us decision not here I'm not
- 00:14:03cheting
- 00:14:06withc I think like for us a company in
- 00:14:102025 it seems peculiar sometimes to side
- 00:14:14viewers that were're also targeting
- 00:14:16classic AI it seems like the world has
- 00:14:18forgotten about them like right what
- 00:14:20year is this um it's a different
- 00:14:22conversation that un learning
- 00:14:24specifically but I think like one thing
- 00:14:26that the bubble bursting in 2021 and
- 00:14:29it's aftermath has t us is so like a lot
- 00:14:32of startups are building just for the
- 00:14:34future that those are a based on a lot
- 00:14:37of underlying assumptions and sometimes
- 00:14:39like they fail to deliver before that
- 00:14:41future comes to materialized that's
- 00:14:42that's a great sentence that really
- 00:14:44makes sense when also when I think about
- 00:14:46data it's like yeah you can build stuff
- 00:14:49going forward so don't interrupt you it
- 00:14:50just clicked uh but you have to take it
- 00:14:53to you know fix the past so look at the
- 00:14:55past data also yeah and so both like in
- 00:14:58Ai and like as a business decision we
- 00:15:00felt like we want to be there where all
- 00:15:02of the hype is where it's clear that the
- 00:15:04future of the industry is relying on
- 00:15:06which is generative Ai and like I
- 00:15:08definitely don't uh you know like object
- 00:15:11what you said in this but for us if you
- 00:15:14look at a lot of different Industries we
- 00:15:16talked about Healthcare so medical
- 00:15:18diagnosis a autonomous driving industry
- 00:15:204.0 defense like something that's very
- 00:15:23present in our lives in Israel all the
- 00:15:25are classic models classic meaning like
- 00:15:28post 2014 is that are still very very
- 00:15:32much relevant to the industries a lot of
- 00:15:34cash is being burnt on them and the same
- 00:15:38old problems since 2014 now we're bring
- 00:15:40this new approach where from one end as
- 00:15:43a quite a small startup at this point we
- 00:15:45have like this futuristic stuff
- 00:15:47educating the market coming to these
- 00:15:49sort of events to tell people about by
- 00:15:51the way AI models need to forget and now
- 00:15:53they can forget but at the same time
- 00:15:55talking to a data scientist that has
- 00:15:56been working on maybe like a model that
- 00:15:59has to do with National Defense or with
- 00:16:00autonomous driving and not educating
- 00:16:02them about what is UN learning but just
- 00:16:04telling them look we have a new approach
- 00:16:06that can find the anomalies resolve them
- 00:16:09instantly in the model itself and you
- 00:16:11don't need to care about anything
- 00:16:12besides clicking a button and your
- 00:16:14accuracy could come up 10% more I I just
- 00:16:17had a thought you know it's like you
- 00:16:18have the it's like The Wizard of Oz
- 00:16:20right the sexiness is hidden behind this
- 00:16:22big curtain but then when you go behind
- 00:16:24it there's like uh people working
- 00:16:2624-hour shifts to label things and you
- 00:16:29know the voice of O is actually IID
- 00:16:32about
- 00:16:34start I I actually saw it as therapy for
- 00:16:38data it's like you know psychology
- 00:16:40something like that like it is something
- 00:16:42buried in your past that you know that
- 00:16:45you know like ego or a bad experience or
- 00:16:47bad trauma that can let you move forward
- 00:16:49let's fix that correct that and then
- 00:16:51build a better version of it reminds me
- 00:16:55so so we talked briefly about like me
- 00:16:56being a non-technical person and a
- 00:16:58Technical startup
- 00:16:59I remember one of my conversations with
- 00:17:01a dad he started talking and then like
- 00:17:03you need to go through a process of data
- 00:17:05massaging and I could not tell like
- 00:17:08laughing about me or if this is a
- 00:17:11serious term that people use in
- 00:17:14theformation yeah like if if someone
- 00:17:16deserves a massage here not a DAT are
- 00:17:19you the are you the first company doing
- 00:17:21that uh as as a company did did did
- 00:17:24companies had departments that were
- 00:17:26doing that like how's how the market
- 00:17:28looking okay so so a few basic facts
- 00:17:31maybe about unlearning first of all we
- 00:17:33didn't invent a term since I think like
- 00:17:352015 it's been an academic term a until
- 00:17:38three years ago a lot of data science
- 00:17:40leaders thought it's just an impossible
- 00:17:42challenge that this is science fiction I
- 00:17:43think like Visa our realization Theory
- 00:17:45wasn't in practice the only a real
- 00:17:50serious uh papers about it talked about
- 00:17:52just a different way of training models
- 00:17:54in a much more distributed Manner and
- 00:17:56then you can basically remove a mini
- 00:17:59models so everyone was looking at the
- 00:18:01future but no one was looking at the
- 00:18:02past everybody was say let's let's
- 00:18:04create it better to to work better at
- 00:18:06the future but not correct the fact but
- 00:18:07then the question is is it better or is
- 00:18:09it just more enabling of this thing that
- 00:18:11you have a fixation on which is
- 00:18:12unlearning and I think when you think
- 00:18:15business you think minimum friction like
- 00:18:17I'm not going to tell data science seems
- 00:18:19to change anything about I currently
- 00:18:22build their models otherwise it will
- 00:18:24hurt your feelings yeah no clients
- 00:18:27anymore exactly and LM Rose and I think
- 00:18:29like there was suddenly a a push like
- 00:18:33the It Rose
- 00:18:35back and I think like in 2024 Gartner
- 00:18:39put on learning as one of the four hot
- 00:18:41topics to look at in technology oh G had
- 00:18:45made it and it's mainstream already
- 00:18:46you're too late okay um if you look at
- 00:18:49the research published there's research
- 00:18:51by Microsoft Amazon meta IBM Google
- 00:18:55Google even had like a public data
- 00:18:57science competition around offer us
- 00:18:59solutions for un learning a moment
- 00:19:01before the startup was born and yet
- 00:19:03there's zero products in the market so
- 00:19:05we are the first un learning startup and
- 00:19:08uh
- 00:19:09products but in Academy it's already a
- 00:19:12thing or and you're the only one who
- 00:19:14commercialized it or the first one it's
- 00:19:15a thing existing research is very flawed
- 00:19:18I think like it's a work in progress and
- 00:19:19we are looking at everything that is
- 00:19:21being published you know taking notes
- 00:19:24taking inspiration understanding like
- 00:19:25what do they do better what do we do
- 00:19:27better and how can we a improve but so
- 00:19:30far there's been a lot of challenges and
- 00:19:32also when you think about this act of
- 00:19:33forgetting about the depth of removing
- 00:19:36things like what happens if you a tell
- 00:19:39an Enterprise we remove this social
- 00:19:41security number or this inaccurate
- 00:19:44information and then it's not there
- 00:19:46Microsoft's I think like the most
- 00:19:48published piece around un learning is a
- 00:19:50paper called forgetting har reporter by
- 00:19:53Microsoft research and about two days
- 00:19:56after they released it on hugging face a
- 00:19:59people just got the model to remember
- 00:20:01Harry Potter that's brilliant right so
- 00:20:03like yeah he never existed yeah but then
- 00:20:06like it it did moft came back yeah and
- 00:20:11sort it was like a cliffhanger and then
- 00:20:13like Harry Potter resurfaced like look
- 00:20:15at the Pink Elephant right exactly so
- 00:20:17it's as if the audience still remembers
- 00:20:19the Pink Elephant after the lecture I'm
- 00:20:21very disappointed because I told them to
- 00:20:23forget a so first of all how do you
- 00:20:25actually make it deep enough and
- 00:20:26thorough enough second of all how do you
- 00:20:28validate
- 00:20:29cuz clearly Microsoft validated but it
- 00:20:31wasn't good enough by the way like they
- 00:20:33the reason that they released it
- 00:20:35publicly was in order for like the
- 00:20:36public domain to Red Team it which they
- 00:20:39have done successfully and I think like
- 00:20:41the last and maybe even more challenging
- 00:20:44is how do you make sure that you didn't
- 00:20:45hurt the rest of them all so if you
- 00:20:47think about it as a brain I was about to
- 00:20:48ask you about what's what's the
- 00:20:49butterfly effect of changing the past
- 00:20:52because it's like you know in the the
- 00:20:54Sci-Fi movies right you change the past
- 00:20:55you have a butterfly effect yeah what's
- 00:20:58the butterfly
- 00:20:59I'm collecting more references usually I
- 00:21:01useal of the Spotless Mind or the these
- 00:21:05are chaotic systems with big or Back to
- 00:21:07the Future the classic right
- 00:21:10exactly um so basically like that has
- 00:21:13been like the major I think like a
- 00:21:15holdback for this field of research and
- 00:21:18and then potential product CU there's
- 00:21:22always been a
- 00:21:23trade-off if you erase too much or uh
- 00:21:27you try to eras
- 00:21:29harder to make sure that it isn't daily
- 00:21:31raced then there's peripheral damage how
- 00:21:34how much of these are just like black
- 00:21:35boxes then when you really get down to
- 00:21:37it right like we talked about Wizard of
- 00:21:38Oz and then the curtain and you go
- 00:21:40behind is it a black box
- 00:21:43like yeah yeah when you're kind of
- 00:21:45looking at it from that level above and
- 00:21:48you know the weights you don't know the
- 00:21:49information inside right it's
- 00:21:51interesting because we we can't look but
- 00:21:53he's looking
- 00:21:54inside but I mean like a lot of people
- 00:21:56are looking have access to the same
- 00:21:59right like but you have access to a
- 00:22:01better mind than
- 00:22:02mine but not my mind so it's a little
- 00:22:06bit both because I think both in our
- 00:22:09capabilities and also things that you
- 00:22:11can find out there there's a lot of
- 00:22:13progress around things that could
- 00:22:15explain the behaviors and also
- 00:22:18regulation so like forces now
- 00:22:20Enterprises or foundational model
- 00:22:22developers to disclose more about how
- 00:22:24they build and some do it not because of
- 00:22:26Regulation just because they bring
- 00:22:27better data scientist
- 00:22:32buop releas research to build really
- 00:22:36great
- 00:22:37models but at the same time I think
- 00:22:40explainability is I'm I'm talking to
- 00:22:43someone who a lot of our Technologies
- 00:22:46could be and somewhat also use as
- 00:22:48explainability techniques explainity
- 00:22:50meaning the ability to explain the
- 00:22:52behaviors of models but at the same time
- 00:22:53I think explainability will never be
- 00:22:56perfect we'll never be able to really
- 00:22:59decipher a a direct correlative so like
- 00:23:03a cause and effect in that sense just
- 00:23:06because the better that we build the
- 00:23:08models and the whole point of AI moving
- 00:23:11from like classic algorithms is things
- 00:23:13need to get more complex like models
- 00:23:15need to make better decisions and they
- 00:23:17need to do these like a correlations
- 00:23:19that sometimes are not intuitive for the
- 00:23:22naked eye what we can do is both try our
- 00:23:25best to explain find different mechan Ms
- 00:23:28that are into play and understand what
- 00:23:31comes in first and how to so like
- 00:23:34resolve the issues that come out later
- 00:23:36so one example is um not something
- 00:23:39that's developed uh By Us by the way in
- 00:23:42Transformers um before that at some
- 00:23:45point it was seen as you know a pretty
- 00:23:47big black box but now there's an
- 00:23:49understanding of there's a specific area
- 00:23:51in the weights of models that are based
- 00:23:52on a Transformer architecture that's
- 00:23:55more correlated with storing information
- 00:23:58so it's almost like a knowledge base now
- 00:24:00of course there's more areas that
- 00:24:02include some information not just
- 00:24:03transforming it's actually storing
- 00:24:05things yeah now again I'm talking here
- 00:24:06as the non-technical leader of tech by
- 00:24:09the way Transformers is the T in GPT
- 00:24:11just you know for General know for those
- 00:24:12who don't not sure so that's when we
- 00:24:15think about un learning one of the
- 00:24:16challenges is finding where this coming
- 00:24:19back to not how to remove Behavior but
- 00:24:20how to remove specific data the first
- 00:24:24challenge is okay where is this data
- 00:24:25stored we're not going to spray and pre
- 00:24:28yeah right across the model so that
- 00:24:32understanding that comes from again
- 00:24:33public research that was not done done
- 00:24:35by us gave us like the first clue into
- 00:24:38okay we know this is the area where data
- 00:24:39is stored then the next challenge is
- 00:24:41isolating the specific area which this
- 00:24:44specific information is stored and so
- 00:24:45forth and I think like another cool
- 00:24:47thing in our Technologies is um we
- 00:24:51basically a lot of
- 00:24:53our work is reliant on idea that we call
- 00:24:56data influence which is Tak inspiration
- 00:24:59from a mathematical field called
- 00:25:00influence functions and basically uh we
- 00:25:04use it to explain how each sample that's
- 00:25:07introduced to a model during training
- 00:25:09affects the learning of this model how
- 00:25:12it perceives this sample how this how
- 00:25:14does the model perceive other samples
- 00:25:16after it learns this sample and later on
- 00:25:19where the model makes predictions which
- 00:25:22training samples that it learned in the
- 00:25:23past were the most important ones that
- 00:25:26made them mod decide can youtune then
- 00:25:29all these micro models inside the big
- 00:25:32model to make it more accurate like the
- 00:25:35like the actual it's not the data
- 00:25:37scientist right but the architect of the
- 00:25:40the the system I guess okay so if I'm
- 00:25:44rephrasing so like we talked about
- 00:25:46Transformers and so like how they're
- 00:25:47built yeah and the question is like can
- 00:25:49we build a better
- 00:25:51architecture so that's one of our
- 00:25:53long-term ideas got it again I think
- 00:25:55like we talked about having too many
- 00:25:57things on the play of your room though
- 00:25:59so talking about the next step is is
- 00:26:00quite a lot but so I I I sorry I just
- 00:26:03got a little bit freaked out just night
- 00:26:05because you
- 00:26:06know my work here is done yeah that's it
- 00:26:10okay so robots are going to take over
- 00:26:11one day right are you like at the
- 00:26:13Forefront of protecting us from the AI
- 00:26:16takeover right I actually thought about
- 00:26:18that and also is there a dark side for
- 00:26:21our learning right okay because then you
- 00:26:23can influence data let's talk about an
- 00:26:25interesting and and then it connects to
- 00:26:27the robots because you know yeah you
- 00:26:29know forgetting history is is
- 00:26:32yeah you're never here this is one of
- 00:26:34the like hot topics right like what
- 00:26:37influence it becomes sentient like all
- 00:26:39the rest of it you know and there's
- 00:26:43obvious like very practical like cause
- 00:26:48of a use case for this which is you have
- 00:26:50a car you have maybe governments you
- 00:26:53have like life and death things that
- 00:26:54make a difference to everyone's life
- 00:26:57going on in the background you know like
- 00:26:59these are very clear and it's not
- 00:27:01something that we think about today
- 00:27:03necessarily like we just kind of laugh
- 00:27:05sometimes at chat GPT with the the
- 00:27:08things that gets wrong but like are do
- 00:27:11you see that as like a serious use case
- 00:27:12are you like the frontier Guardian of
- 00:27:15the
- 00:27:16Galaxy bridging
- 00:27:18AI he just Mak things for this is my God
- 00:27:22complex this this real by the way is
- 00:27:24going to get like 500,000 views his deck
- 00:27:27his dech in a physical device is the man
- 00:27:29in Black
- 00:27:31Gadget someone from the future is going
- 00:27:33to come now and
- 00:27:35try sh so they don't have the codes for
- 00:27:38the door yeah so are you the guardian of
- 00:27:40the galaxy for the AI takeover
- 00:27:43world I was not prepared for that
- 00:27:45question I think that's good that's why
- 00:27:48unscripted yeah that the real answer is
- 00:27:50I don't know I think like one day I wake
- 00:27:53up optimistic about like the future of
- 00:27:55Mankind versus the future of AI one day
- 00:27:58coming up waking up a bit more I don't
- 00:28:01know concerned is the right word not
- 00:28:03pessimistic yeah and I feel like in
- 00:28:07Hebrew we say like the prophecy was
- 00:28:08given to fools M right in the case of
- 00:28:12like a Skynet Revolution I think like
- 00:28:14I'll try to be on the good side will un
- 00:28:16learning be solution to everything yeah
- 00:28:19maybe we'll play a a small part are you
- 00:28:21polite when you speak to chat GPT do you
- 00:28:24say please you yes I am no okay I trust
- 00:28:27this guy he knows some of control we
- 00:28:29don't know about I I I'm actually polite
- 00:28:31until they do something like with
- 00:28:32lovable when it gets wrong and deletes
- 00:28:35everything then I'm I'm British I get
- 00:28:37passive
- 00:28:38aggressive that's fine try it's an
- 00:28:41American model so it it passes over do
- 00:28:45emotion it's like the US version of the
- 00:28:46office you know but you know what I was
- 00:28:48thinking about what uh in in um for
- 00:28:52example in in a time where AI will start
- 00:28:54making decisions of its own and he's
- 00:28:56going to start un learning stuff on its
- 00:28:59own then it can go to the dark side
- 00:29:01right is because the AI has the ability
- 00:29:04to change our minds like Google today
- 00:29:06Google can affect the search result
- 00:29:08results and change the whole country's
- 00:29:11Minds like with the companies and
- 00:29:13politicians Twitter uh Facebook so what
- 00:29:15happens if AI does that so so I feel
- 00:29:18like it's it's a nice like full circle
- 00:29:21because when Joe asked so like about
- 00:29:24again like in my own words about the
- 00:29:25architectures I think the next step is
- 00:29:27an architecture allows this process of
- 00:29:29both finding the best data and
- 00:29:31unlearning constantly forgetting the bad
- 00:29:34data um is the next step and uh we have
- 00:29:37some things like in store future plans H
- 00:29:40for this that we call Dynamic Ai and
- 00:29:43recently Google also published like a
- 00:29:45research about an architecture that
- 00:29:46they're bring to the table called Titans
- 00:29:48which in the long term they see it as a
- 00:29:51potential replac Transformers and one of
- 00:29:54the things that characterizes it is the
- 00:29:55differentiation between a long-term
- 00:29:58memory and short-term memory and that
- 00:30:00will you know that's a step that towards
- 00:30:02allowing you to forget stuff that are
- 00:30:04not part of like your DNA and that's
- 00:30:06crazy because that's the part A lot of
- 00:30:08people are not looking when they're
- 00:30:10talking about AI they're talking about
- 00:30:11like me oppus clip you know uh building
- 00:30:14uh creating scripts for videos on chpt
- 00:30:17versus dips you're not thinking about
- 00:30:19you know the biases and and and how how
- 00:30:23how to optimize the results not just on
- 00:30:27new models but also perfecting the old
- 00:30:30models right yeah and and it's something
- 00:30:32that I think everybody should start
- 00:30:35thinking about you know so I'm glad
- 00:30:37you're here and another thing this is
- 00:30:40this is you know we we're talking like
- 00:30:42it sounds all very academic but you guys
- 00:30:44have paying customers you're seeing like
- 00:30:47what kind of like improvements are you
- 00:30:48seeing on data sets do you have like
- 00:30:50statistics around that H so first of all
- 00:30:53like anything that we can release
- 00:30:54publicly we release publicly in the
- 00:30:56sense of there's a lot of case studies
- 00:30:58on public data sets and models that are
- 00:31:00available on our blog when we're working
- 00:31:01with
- 00:31:02customers we we're not really allowed to
- 00:31:05talk about like the improvements that we
- 00:31:07see with them and a lot of times like we
- 00:31:09don't see what they're doing with a
- 00:31:11platform because it also works on Prem
- 00:31:14but again like I gave one example of
- 00:31:16like the less sexy stuff of like classic
- 00:31:19AI like autonomous driving and again
- 00:31:22like I I mentioned a number which is
- 00:31:24crazy but I found like satellite imagery
- 00:31:2717% of frames like in a benchmark
- 00:31:29worldwi data set 17% of the frames
- 00:31:32included like inaccuracies our end to
- 00:31:34end process improved that reduce the
- 00:31:36errors of the model by
- 00:31:3817% 177% and if you think okay forget
- 00:31:41forget but this is me being a CEO so we
- 00:31:44improve M accuracy by 5.1% the reduction
- 00:31:47in Errors By the model was 17% okay but
- 00:31:50then take always double check what
- 00:31:52the someone's fact checking right now as
- 00:31:55we speak we the other room doing it
- 00:31:57right now yeah but think how much it
- 00:31:59cost to train the latest GPT model right
- 00:32:04from but 5% of that budget you can just
- 00:32:08fix like by plugging into this platform
- 00:32:10that's a yeah and with DM like right now
- 00:32:14we I talked about like the so like crazy
- 00:32:16tasks to unlearn bias right which we're
- 00:32:18like okay let's just think about how we
- 00:32:20soling that um we took the liberty of
- 00:32:23like riding the hype wave everyone is
- 00:32:24talking about DPS we took a DPS R1 is
- 00:32:27still llama which is basically released
- 00:32:29by deepik but so like a variation of
- 00:32:32llama that they fine tuned in their own
- 00:32:35mechanisms and their own mechanisms
- 00:32:37being API
- 00:32:40we'll and the first thing is like we
- 00:32:42noticed that there's double like almost
- 00:32:45twice the bias that existed in
- 00:32:48Lama like exists in EP in deep seek in
- 00:32:51in this specific deeps variation so
- 00:32:53think now we're not measuring it using
- 00:32:55our own method we send it to like a
- 00:32:57impart partial so like valuation method
- 00:33:00and we're saying that like the bias
- 00:33:02towards nationality towards race towards
- 00:33:04gender is just crazy higher crazy higher
- 00:33:09wow and then we're using our methods um
- 00:33:12to undo this and again not undo specific
- 00:33:15data points but undo behaviors using
- 00:33:17similar methods and we reduce like the
- 00:33:20bias of dips on if I remember correctly
- 00:33:22nationality 76% and gender and race like
- 00:33:256 5% like reduction wow right so this is
- 00:33:29czy this is something that is not really
- 00:33:31achiev in any method it took us like
- 00:33:33using our platform it takes less than an
- 00:33:34of compute wow that's insane so the
- 00:33:37whole model is improved yeah essentially
- 00:33:4076% around race related yeah exactly so
- 00:33:44basically if you run your model with
- 00:33:46Netflix it will give me better
- 00:33:48recommendations I actually wrote a post
- 00:33:50about hyper personalization now we
- 00:33:52talking about that and I was like H if
- 00:33:54you if you if you fix their bias I'm
- 00:33:56going to get better results right better
- 00:33:59but it won't fix their content it won't
- 00:34:01fix their content no I I I have no place
- 00:34:04in the ecosyst after this podcast I'm
- 00:34:05just talking
- 00:34:07about you should approach him right but
- 00:34:10guys you know even everyday life like I
- 00:34:12think one of the examples you gave to me
- 00:34:14was you know like just recently
- 00:34:16obviously uh president Trump came into
- 00:34:18power and all the models are trained
- 00:34:20around Biden and you remember how
- 00:34:23controversial that was with data across
- 00:34:25the internet and things but just even on
- 00:34:27a dat today basis like using these
- 00:34:30models we don't understand how biased
- 00:34:32they are I guess and 76% I mean do do
- 00:34:36you know how bias alpaka was then or
- 00:34:39yeah I don't know have the stat we can
- 00:34:41we can run it but you know I use deep
- 00:34:42seek right and I don't know if I used a
- 00:34:44specific version but I never
- 00:34:47identified anything that seemed did you
- 00:34:50ask it about
- 00:34:53the the first thing you think about
- 00:34:56using but it's a good example are you
- 00:34:57using dips itself hosted on like another
- 00:35:00server or you using like DPS website
- 00:35:02yeah on dp's website yeah okay yeah good
- 00:35:05luck it's not biased at all at all no no
- 00:35:09they just own you right now so that's it
- 00:35:12I'm it's honestly I'm like in a trance
- 00:35:13when I'm looking it's like chat GPT now
- 00:35:16as well right where you can actually see
- 00:35:17the reasoning and the thinking of it
- 00:35:19like know three amazing yeah yeah I
- 00:35:21really like the way it kind of works
- 00:35:23yeah it is and sometimes you you you you
- 00:35:25watch it while it writes and you're like
- 00:35:28that's that's a new age right you know
- 00:35:30one one thing I'm not impressed by I
- 00:35:32have to say is operator have you tried
- 00:35:34operator no I haven't Tri yet so slow
- 00:35:37yeah but they said in in the in the
- 00:35:39video they released they they said it's
- 00:35:41a very early version of it they were
- 00:35:43just jumping the gun you know to be
- 00:35:45first because of dips and everything I
- 00:35:47think uh but but it's it will be
- 00:35:50something crazy yeah also I love how
- 00:35:52crazy fast things are progressing that's
- 00:35:54true that we're like this new like
- 00:35:58but he's able to replace me doing any
- 00:36:00kind of job just like a bit it's like R
- 00:36:02CK when he's in the plane and the
- 00:36:04internet is not working right it's like
- 00:36:05oh the internet's down it's like you're
- 00:36:08in a bird in the
- 00:36:10sky no but it's a great segue to what I
- 00:36:12wanted to ask you um is you know
- 00:36:15personally and based on what you see you
- 00:36:17know doing what you do and and working
- 00:36:18on what you're working is like how do
- 00:36:20you see you said replacing me so how do
- 00:36:23you see like all of us in this age you
- 00:36:25know we said last time that the new
- 00:36:28titles the titles of the jobs are not
- 00:36:30invented yet but that's okay that's a
- 00:36:31good title would love to hear what you
- 00:36:34think about you know jobs will be if
- 00:36:36jobs will be replaced
- 00:36:37Etc so because I'm going to say
- 00:36:40something similar so I think if I said
- 00:36:43that one day I'm waking up optimistic
- 00:36:45one day I'm waking up pessimistic I I
- 00:36:47think like on that front every major
- 00:36:50technological disruption big or small
- 00:36:52there was always the underlying
- 00:36:54assumption that it's going to take out
- 00:36:56so many jobs but as easily it created
- 00:36:58just like whole new jobs that no one
- 00:37:00anticipated the classic example is like
- 00:37:03the introduction of ATMs didn't remove
- 00:37:05the Nique for bank tellers right the
- 00:37:08online banking
- 00:37:10de well today kind of did but now they
- 00:37:13hire more Market I see at the beginning
- 00:37:15ATMs if to touch that point at the
- 00:37:17beginning ATMs didn't really replace
- 00:37:19them but today you can also deposit and
- 00:37:21and deposit checks and everything with
- 00:37:23an ATM and then with that and the online
- 00:37:25tellers are obsolete so it technology
- 00:37:28did take their job but it took took a
- 00:37:31while longer but I think that Ai and all
- 00:37:33this revolution that we talk about is
- 00:37:34the same as going online you know going
- 00:37:37online took it took a lot but
- 00:37:38essentially it started in ' 93 it it
- 00:37:41took a while to get where it is today
- 00:37:43but I think that we are somewhere there
- 00:37:45like you know before the do bubble
- 00:37:47somewhere but the growth of technology
- 00:37:48is much faster than it was before guys
- 00:37:51guys I really want to hear what is it
- 00:37:53utopian is it dystopian like what is
- 00:37:56that future I think it would be like
- 00:37:58like today but just like more flashy and
- 00:38:00cool right so more flashy and cool more
- 00:38:04cuz I mean you know we don't know what
- 00:38:07we don't know right um and I think in
- 00:38:10that sense um like as much as it's bad
- 00:38:15for a tech CEO to say this like at hard
- 00:38:17I'm a Social
- 00:38:18Democrat if we reach a reality where
- 00:38:21univers Universal basic income is
- 00:38:23available because AI is doing all the
- 00:38:25job amazing yeah is ch going to be the
- 00:38:28thing to create it I I doubt it I doubt
- 00:38:29that we're heading there I think like
- 00:38:31we'll still be running the meal and like
- 00:38:33having to wake up in the morning to go
- 00:38:35to work hopefully it will be a bit more
- 00:38:36flexible I will probably work even
- 00:38:39harder that's but just like we'll have
- 00:38:42different tools maybe we'll use our
- 00:38:44brain more maybe like low tech jobs will
- 00:38:46actually be the a more crucial human
- 00:38:49delivered things or maybe like people
- 00:38:51that are now ER using tools to assist
- 00:38:54them will be the human evaluator to
- 00:38:56assist those tools I'm thinking who's
- 00:38:57going to be the Netflix of this era you
- 00:39:00know the the maybe it's maybe it's open
- 00:39:03AI I don't know maybe it's list James
- 00:39:04Cameron's company yeah he he said you
- 00:39:07know it's it's going to get to the stage
- 00:39:09where production is not production it's
- 00:39:11like just generation right so so let's
- 00:39:13say Roy you have a podcast in 10 minutes
- 00:39:16and you want to watch a thriller it's
- 00:39:18going to create something for you that's
- 00:39:1910 minutes long yeah exactly exactly
- 00:39:22what you want to watch just now with
- 00:39:23your favorite actors and actresses and
- 00:39:25you're going to watch it and that's
- 00:39:26going to be there's there's there are uh
- 00:39:29two guys named the door brothers and
- 00:39:32they actually create with V2 the door
- 00:39:35door d o r d o r brothers and they
- 00:39:39create uh animations online they are
- 00:39:41very famous for they got viral for a lot
- 00:39:43of things they did like politically
- 00:39:44stuff but they now created um the first
- 00:39:47their first AI show right so there's
- 00:39:50like an episode and it's it's something
- 00:39:53sci-fi about an astronaut and he's going
- 00:39:55to to space with aliens so it's not not
- 00:39:57something you would watch you replace
- 00:39:59Netflix yet but it's also already
- 00:40:00started these are faceless YouTube
- 00:40:02videos right no no there's there's a
- 00:40:04face there's an astronaut and then he
- 00:40:06takes his helmet off and there's a face
- 00:40:08but yes it's not there yet I mean like
- 00:40:10you don't have a a human being as the
- 00:40:12face of the YouTube channel it's like
- 00:40:14animation created with AI storyline
- 00:40:17created by AI scenes transitions all the
- 00:40:20no but it'll take a year or two years to
- 00:40:22start creating something was which more
- 00:40:24it's more serious yeah something you
- 00:40:26would watch we want well the whole
- 00:40:28Nvidia thing where they're like
- 00:40:30generating on the Fly Right to get super
- 00:40:33high FPS on games oh I saw yesterday a
- 00:40:36comparison of the
- 00:40:371590 what was that it's like he he
- 00:40:40actually calculates every tray every
- 00:40:42weight every trace of ray of light
- 00:40:45everywhere I mean no no everywhere but
- 00:40:48in so many bounces it's crazy and it's
- 00:40:50running like on steady 250 FPS the
- 00:40:53hardest uh cyber Punk game
- 00:40:57understand anything he says you're not
- 00:40:59alone yeah yeah but one question wearing
- 00:41:01a Super Mario br's shirt
- 00:41:04so yeah yeah no you're right safe
- 00:41:08assumption uh but the question that I
- 00:41:10had in mind it's kind of connected
- 00:41:11everything I lost in mind so lost my
- 00:41:13train thoughts you take it sorry I'll
- 00:41:15edit all this out into this question no
- 00:41:18single me
- 00:41:22out oh that's crazy it was the we're
- 00:41:25talking about the Nvidia and was before
- 00:41:27that was before that but never mind
- 00:41:29we'll get to there don't worry no no
- 00:41:32we'll get there it's going to go I Echo
- 00:41:33what you said about so like hyper
- 00:41:35personalization I think like that's and
- 00:41:37again that has nothing to do I think
- 00:41:38like without learning at the moment but
- 00:41:40just like the fact that content could be
- 00:41:42really adjusted per so it's not
- 00:41:43recommending the right content it's
- 00:41:45actually creating the right content for
- 00:41:47you based on what you've watched and I
- 00:41:48think like there was a Black Mirror
- 00:41:50episode with a similar concept right I
- 00:41:53can definitely see it going there
- 00:41:54hopefully a bit dark yeah oh hopefully I
- 00:41:58remember the question so maybe you know
- 00:42:00connecting it all if tell me if this is
- 00:42:02true about what hu if I said it
- 00:42:06right enter Japanese subtitles um so if
- 00:42:11Joe said that James K uh uh claims that
- 00:42:14okay we're GNA I'm going to tell you
- 00:42:16what I I say James CL Cameron claims
- 00:42:18claims oh sorry actually did yes you
- 00:42:20said I just literally said that two
- 00:42:22minutes ago you're right anyway can you
- 00:42:26make can you make everyone
- 00:42:27forget ex it's called editing I have 10
- 00:42:30minutes show me show me a skateboarding
- 00:42:32videos in the city of Las Vegas skating
- 00:42:35on top of the Las Vegas sphere so it's
- 00:42:38going to generate it but we need physics
- 00:42:40right real life physics to generate
- 00:42:43so if the physics engine is wrong and
- 00:42:46this is what we see when you generate
- 00:42:48skate videos you see them like looping
- 00:42:50all around maybe that's something we
- 00:42:52need to unlearn like get the physics
- 00:42:54right could be but again I feel like I I
- 00:42:57would be arrogant to say like unlearning
- 00:42:58is the solution for the problem of AI
- 00:43:00with physics and I think there's even
- 00:43:03brighter Minds working on so like this
- 00:43:05intersection both in foundational
- 00:43:07physics models and also in another
- 00:43:10approach to generative AI like if you
- 00:43:12see what f f Lee is doing or Yan leun
- 00:43:16they're talking about um world models if
- 00:43:19I remember correctly yes it's one of the
- 00:43:21top four problems identified yeah so so
- 00:43:25basically like as much as we're talking
- 00:43:27now about llms all the
- 00:43:30time again like obviously I'm
- 00:43:32paraphrasing but like they say this is a
- 00:43:33temporary setback like and the fact that
- 00:43:36all the industry is focusing on this is
- 00:43:37actually preventing us from reaching
- 00:43:39real artificial general intelligence
- 00:43:42which is models that understand how the
- 00:43:43world works and then you apply them to
- 00:43:45different missions instead of predicting
- 00:43:47the next probable token for the next war
- 00:43:50that you want in their response so I I
- 00:43:52heard that about about about the
- 00:43:54physical aspect of uh of AI I heard a
- 00:43:57podcast a while ago I forgot who it was
- 00:43:59but it was someone who knew what he was
- 00:44:01talking about and and he said that so
- 00:44:03not this one what's missing not this one
- 00:44:06it was
- 00:44:07mine it was better um no but I heard
- 00:44:11that right now all that's missing to to
- 00:44:13teach physical uh AI is data we're there
- 00:44:17as as far you know as far as the models
- 00:44:19but we need data we need real life data
- 00:44:21like what happens when you roll a bottle
- 00:44:23of a of a table then what happens to the
- 00:44:25bottle when you get reach the end so
- 00:44:28it's all about the data right like we
- 00:44:30had have enough data for textual with a
- 00:44:32internet and everything but we still
- 00:44:34don't have we have that in blender and
- 00:44:353D software you know for visual effects
- 00:44:38yeah we have phys physics data but you
- 00:44:40don't have enough that's what I that's
- 00:44:41why he said um assuming we don't have
- 00:44:43enough of in still Marvel in in released
- 00:44:46Marvel videos you can see wrong physics
- 00:44:49in visual effects I'm guessing I'm
- 00:44:51guessing um you know cars for example
- 00:44:54like Tesla and electric cars they also
- 00:44:56provide data even when they fail when
- 00:44:58they do when they create an accident
- 00:45:00when they make an accident right it's
- 00:45:02data what happens when you crash a wall
- 00:45:03it's something the AI needs to to learn
- 00:45:06hopefully without everybody anybody
- 00:45:07injuring I'm never going to buy a yellow
- 00:45:09motorbike by the way just so you
- 00:45:13know well it's fixed now you fixed okay
- 00:45:17fix you can buy it I'm color blind by
- 00:45:20the way so maybe so that's how wasn't
- 00:45:24fixing I thought you were going to say
- 00:45:26I'm never going toy this yeah I get
- 00:45:28scared for a second was was your
- 00:45:30categorization the 10% that was
- 00:45:32wrong what's the no the the yellow
- 00:45:35motorbike yeah yeah so I gave a yellow
- 00:45:37motorbike as an example but it's not
- 00:45:39that there was a particular issue with
- 00:45:40yellow motores it's everything you
- 00:45:42shouldn't get on the world right right
- 00:45:44so do you use AI at home not at
- 00:45:47work um I think like everyone now uses
- 00:45:50the it's a very like by the way like
- 00:45:52steep R I feel like we were heading
- 00:45:55somewhere else with a question so I feel
- 00:45:57like everyone is using AI now like right
- 00:46:00I'm I'm asking Chachi PT for advice when
- 00:46:03I need to troll a friend I'm asking it
- 00:46:05to sorry to write a song to it it's not
- 00:46:09just like for for job stuff and also we
- 00:46:12talked about Google can Google Now
- 00:46:14really control governments and public
- 00:46:17perception if you end up using GPT
- 00:46:19search or perplexity exactly so I think
- 00:46:22like AI was very present in our lives
- 00:46:24even before just like less a in its own
- 00:46:27face yeah and now it just like became a
- 00:46:30part of our day-to-day life the
- 00:46:31interface updated to we talked about it
- 00:46:34in the first episode we said it's it was
- 00:46:36the phase where we talk to the product
- 00:46:38and the product talk to the AI and then
- 00:46:40when we started talking to the AI That's
- 00:46:43that was what it was for me what are
- 00:46:44your kind of go-to applications and like
- 00:46:47what are the things that you're playing
- 00:46:49about with at home like in terms of AI
- 00:46:51um first of all like I I work way too
- 00:46:54much yeah there's no I home is if my
- 00:46:57investors are hearing
- 00:46:59me so I think like one of my goals for
- 00:47:02this year is maybe
- 00:47:03to get some work life balance like I I
- 00:47:06have a little little font the word like
- 00:47:08life in in this sense a yeah but I think
- 00:47:11like when it first came out I remember
- 00:47:13like I used like even before Chach PT
- 00:47:16there was like everyone were all about
- 00:47:17do right like image generation I
- 00:47:19remember like I used it for just like
- 00:47:21creative stuff I was like I had like a
- 00:47:23dream journal with like sometimes I
- 00:47:25would dream full Netflix like movies and
- 00:47:28then I was like okay I want now given
- 00:47:30that my poor like drawing skills will
- 00:47:32lead me nowhere I can just like describe
- 00:47:35some images some scenes from there and
- 00:47:37see what happens right this is when it
- 00:47:39was just on Discord right I think they
- 00:47:43start yeah yeah that's
- 00:47:46right Journey now now I now I use
- 00:47:49ideogram it's it's it's crazy it's crazy
- 00:47:51good yeah so by the way it's funny that
- 00:47:53like it's been what like a bit more than
- 00:47:55two years but now I'm like oh I missed
- 00:47:57the day creative about it cuz now I'm
- 00:48:00like it's crazy remember Netscape we're
- 00:48:02going to get there remember Netscape no
- 00:48:03one uses it now I don't think even CH
- 00:48:06knows
- 00:48:07what is because of some things you don't
- 00:48:10need to force it to forget right
- 00:48:14yeah but again just like for my dayto
- 00:48:16day I think like you know doing so like
- 00:48:19I like we're all done it like you're
- 00:48:20going into a new place you instead of
- 00:48:23like now Googling for like top 10
- 00:48:24recommended spots in Singapore I'm like
- 00:48:26okay chat GPT what can I do there in
- 00:48:29like two days uh I'll have like a
- 00:48:31layover in Abu Dhabi how can I both like
- 00:48:34maximize 24 hours both for business and
- 00:48:36like for which is what you would do on
- 00:48:38Google sorry yeah but with Google it
- 00:48:41would just like take longer it would
- 00:48:42take longer I now I feel like we're in a
- 00:48:46gap where in Google already like the
- 00:48:49results are Mastered by SEO and then the
- 00:48:52itself it's like paid for within the
- 00:48:55websites again like there a problem
- 00:48:57because then these models are trained on
- 00:48:59the same public data right so they have
- 00:49:01their the same biases or inaccuracies
- 00:49:04but I still think that it's like a
- 00:49:05moment before um like
- 00:49:10AI gets cannibalized by these like
- 00:49:13marketing uh techniques and right now I
- 00:49:15would trust more like the results that
- 00:49:17chat GPT is giving me for like
- 00:49:18recommendations for things to do in
- 00:49:20Singapore for two days then like uh
- 00:49:22first page on Google results and you
- 00:49:25know it's funny because I feel it was
- 00:49:27just yesterday everybody was talking
- 00:49:28that oh people are moving to search
- 00:49:31things from Google in Tik Tok right when
- 00:49:33you go to Singapore you would go to Tik
- 00:49:35Tok and Instagram and what what's the
- 00:49:37best restaurants to go and yeah know I'm
- 00:49:39saying oh my God we got there yeah and
- 00:49:41then and then it's kind of like it's
- 00:49:43forgotten now you talk about searching
- 00:49:45those stuff in in in AI it was very
- 00:49:47shortlived by the way I think people
- 00:49:49talked about it like for a year and then
- 00:49:51chpt came and and that's it everybody is
- 00:49:53like oh cgp has a search function let's
- 00:49:55go there oh yeah
- 00:49:57I don't want to say anything about like
- 00:49:58three adult like white men sitting in
- 00:50:00one room talking about you know my
- 00:50:02younger sisters or like
- 00:50:04siblings they are using Tik Tok I don't
- 00:50:06know like talking about bias you can fix
- 00:50:09that guys let me tell you how to do it
- 00:50:12yeah H so we're coming to the end let's
- 00:50:15H go around the room like we normally do
- 00:50:17it flew by this time didn't it flew by
- 00:50:19not this time that makes it sound like
- 00:50:20every other
- 00:50:22episode
- 00:50:24promise um let's uh
- 00:50:27let's maybe just uh summarize what our
- 00:50:30men thoughts have been from the the cast
- 00:50:33no me again this is uh this episode and
- 00:50:37talking to Ben about you know seeing we
- 00:50:39talk about shovels and stuff and but you
- 00:50:41know I'm trying to always get a macro
- 00:50:43view of where are we going is this the
- 00:50:45internet or is this Netscape and this
- 00:50:47episode is like we still have a lot in
- 00:50:50front of us there's a lot to do there's
- 00:50:52a lot of solutions a lot of problems to
- 00:50:54solve a lot of opportunities there going
- 00:50:56to get excited and scary and exciting
- 00:50:58that's the great balance of of
- 00:51:01Life yeah I get what you're saying for
- 00:51:03me is the realization of we're still in
- 00:51:06the early stage we're the ear early
- 00:51:08adopters we still have a lot of things
- 00:51:09to do and you know I I didn't hear about
- 00:51:12um the unlearning and uh last episode I
- 00:51:16didn't even think about an an analytics
- 00:51:18between Ai and consumers so every time
- 00:51:21we we're we're doing this episode I'm
- 00:51:23thinking oh man there's so many layers
- 00:51:25to this thing and it feels like it's
- 00:51:27just the beginning right and it's super
- 00:51:30exciting like I'm ready I'm ready for
- 00:51:32the next episode I don't know what's
- 00:51:34gonna happen what am I going to learn
- 00:51:36but unlearning and biases is something
- 00:51:38that I I watched in Netflix but just now
- 00:51:41I understand that it's just started and
- 00:51:43you have the competitive Advantage
- 00:51:44because you're doing it in real life
- 00:51:46scenarios not in Academy yeah you know
- 00:51:48so it's crazy yeah um yeah I feel like I
- 00:51:53mostly like talked about what I'm doing
- 00:51:54so I I don't want to talk about this
- 00:51:57again but just like it's for me I'm
- 00:52:00talking constantly to data scientists
- 00:52:02right like and I feel like there's been
- 00:52:03a mutual osis process where my very
- 00:52:06technical team and Technical Partners
- 00:52:09have become a bit more like me
- 00:52:10storytellers and for me now I'm a bit
- 00:52:13more like not that scientist I can't
- 00:52:15code for [ __ ] sorry my French but I can
- 00:52:18carry a conversation and understand the
- 00:52:19concepts but it's been just like very
- 00:52:21fun for me to take a step back talk
- 00:52:23about things like in bird's eye view get
- 00:52:26Outsider perspective on on like what we
- 00:52:28do and the challenges ahead and I think
- 00:52:30like coming out of it if I need to I
- 00:52:33said like each day I'm waking up on a
- 00:52:34different
- 00:52:35side I want to say like I'm an
- 00:52:37optimistic person at heart and I'm
- 00:52:39coming out optimistic from this
- 00:52:40conversation I think for me coming out
- 00:52:42of this conversation as well and having
- 00:52:44spoken to you about this I mean this was
- 00:52:45something that a nightmare I hadn't even
- 00:52:48had like what and you guys are already
- 00:52:51building a very very strong platform
- 00:52:54around it to save it so you know
- 00:52:56just the thought that we have like great
- 00:52:58minds out there kind of thinking about
- 00:53:00the future because it's moving so quick
- 00:53:02you know it's h it's actually very
- 00:53:04reassuring I have to say but Ben thanks
- 00:53:06for coming along it's been really great
- 00:53:08having you here and uh if you enjoyed
- 00:53:11the episode like subscribe let us know
- 00:53:14your comments in the comment section
- 00:53:16below and we'll catch you next week for
- 00:53:18the next episode of AI unscripted thanks
- 00:53:20guys thank you
- AI
- Unlearning
- Machine Learning
- Bias in AI
- Data Compliance
- Generative AI
- Autonomous Vehicles
- Data Science
- Technology Innovation
- Startup