What is Project Euphonia?

Project Euphonia is a Google initiative aimed at improving speech recognition for individuals with speech impairments, and recreating their voices as they sounded before diagnosis.

Tim Shaw is a former NFL player diagnosed with ALS, contributing to Project Euphonia by providing voice samples to help develop better speech recognition for ALS patients.

What impact is AI having on healthcare?

AI is improving diagnostic capabilities, helping predict and manage diseases, and enhancing communication tools for individuals with speech impairments.

How does AI assist in predicting blindness?

AI analyzes eye scans to detect early signs of diabetic retinopathy, helping prevent blindness through timely intervention.

What is diabetic retinopathy?

Diabetic retinopathy is a diabetes complication that affects the eyes and can lead to blindness if not diagnosed and treated early.

How is Google using AI for speech recognition?

Google uses machine learning models to improve speech recognition for people with atypical speech patterns, especially those affected by conditions like ALS.

What are some potential future applications of AI in healthcare?

AI could help in diagnosing mental health disorders, predicting epidemics, and creating personalized medicine regimens.

How does AI-driven voice synthesis work?

Voice synthesis, or voice imitation, involves using machine learning to convert text into sound waveforms, effectively recreating a person's voice.

What challenges do AI healthcare projects face?

Challenges include needing large amounts of data, adapting models to individual needs, and ensuring accuracy across diverse conditions.

How has AI changed communication for ALS patients?

AI provides tools for ALS patients to communicate more effectively by mimicking their natural speech patterns and voice.

Healed through A.I. | The Age of A.I.

00:39:55

https://www.youtube.com/watch?v=V5aZjsWM2wo

Ringkasan

TLDRThe episode explores the impact of AI on healthcare, highlighting projects like Google's Project Euphonia, which aims to improve communication for individuals with speech impairments, such as former NFL player Tim Shaw, who is living with ALS. AI technology in healthcare is also addressing diabetic retinopathy in India through early detection and analysis of eye scans, a critical development in preventing blindness. The potential of AI extends to various medical fields, promising better predictive diagnostics, personalized treatment plans, and possible breakthroughs in understanding complex diseases like cancer and mental health disorders. AI's evolution in healthcare signifies a future where medicine is more proactive than reactive, transforming how diseases are diagnosed and managed, and delivering healthcare solutions across the globe, especially in areas with scarce medical resources.

Takeaways

🔍 Importance of improving diagnostic tools with AI
🧠 AI's potential in mental and physical health
🏥 Role of AI in combatting diseases like diabetic retinopathy
🎤 Giving voice back to ALS patients with AI
🌐 Impact of AI on global healthcare access
💬 Revolutionizing communication for those with speech impairments
📊 Data-driven healthcare solutions
🏃 Tim Shaw's journey and impact on speech recognition
🔬 Future implications of AI in detecting health issues
🩺 AI's role in democratizing healthcare

Garis waktu

00:00:00 - 00:05:00
The introductory discussion reflects on life's certainties of death and taxes and the advancements in healthcare that have increased life expectancy over the decades. It raises questions about the potential for machine learning to predict illnesses before they occur and introduces the episode's focus: AI combating blindness and enabling communication for those with neurological diseases.
00:05:00 - 00:10:00
Tim Shaw's journey from an ambitious football player aiming for the NFL to facing the harsh reality of ALS, a neurodegenerative disease, is portrayed. His dedication to football, evident since childhood, transitions to a battle with a disease that strips him of his physical abilities.
00:10:00 - 00:15:00
Tim's ALS diagnosis leads to challenges in everyday activities, significantly impacting his communication. The emotional struggle for both Tim and his family is highlighted as they try to adapt to these changes, illustrating the critical role of AI in improving communication for those affected by such disorders.
00:15:00 - 00:20:00
The efforts to enhance speech recognition for individuals with conditions like ALS are detailed through Google's Project Euphonia. The project aims to train AI models with unique voice samples to improve speech recognition, especially for those whose voices aren't commonly represented in existing datasets.
00:20:00 - 00:25:00
The complexity of creating personalized speech recognition models for each unique individual is underscored. Partnerships with organizations and data collection through initiatives like the ice bucket challenge are crucial for acquiring the necessary voice samples to train AI models for ALS patients.
00:25:00 - 00:30:00
The application of AI in diagnosing diabetic retinopathy in India showcases its potential in healthcare, where the shortage of medical professionals affects timely diagnosis and treatment. AI's use in screening eye conditions helps bridge the gap, making healthcare more accessible and effective.
00:30:00 - 00:39:55
The narrative shifts to the larger possibilities of AI in healthcare, envisioning a future where AI aids in diagnosing various illnesses, including mental health disorders and cancer. The emotional culmination features the restoration of Tim Shaw's voice through AI, symbolizing hope and the ongoing journey towards integrating AI in improving life quality.

Tampilkan lebih banyak

Peta Pikiran

Video Tanya Jawab

What is Project Euphonia?
Project Euphonia is a Google initiative aimed at improving speech recognition for individuals with speech impairments, and recreating their voices as they sounded before diagnosis.
Who is Tim Shaw?
Tim Shaw is a former NFL player diagnosed with ALS, contributing to Project Euphonia by providing voice samples to help develop better speech recognition for ALS patients.
What impact is AI having on healthcare?
AI is improving diagnostic capabilities, helping predict and manage diseases, and enhancing communication tools for individuals with speech impairments.
How does AI assist in predicting blindness?
AI analyzes eye scans to detect early signs of diabetic retinopathy, helping prevent blindness through timely intervention.
What is diabetic retinopathy?
Diabetic retinopathy is a diabetes complication that affects the eyes and can lead to blindness if not diagnosed and treated early.
How is Google using AI for speech recognition?
Google uses machine learning models to improve speech recognition for people with atypical speech patterns, especially those affected by conditions like ALS.
What are some potential future applications of AI in healthcare?
AI could help in diagnosing mental health disorders, predicting epidemics, and creating personalized medicine regimens.
How does AI-driven voice synthesis work?
Voice synthesis, or voice imitation, involves using machine learning to convert text into sound waveforms, effectively recreating a person's voice.
What challenges do AI healthcare projects face?
Challenges include needing large amounts of data, adapting models to individual needs, and ensuring accuracy across diverse conditions.
How has AI changed communication for ALS patients?
AI provides tools for ALS patients to communicate more effectively by mimicking their natural speech patterns and voice.

Lihat lebih banyak ringkasan video

Dapatkan akses instan ke ringkasan video YouTube gratis yang didukung oleh AI!

Teks

Gulir Otomatis:

00:00:07
You know how they say
00:00:08
there are two certainties in life, right?
00:00:10
Death and taxes.
00:00:12
Can't we get rid of one of those?
00:00:15
See, 100 years ago, life expectancy was only 45,
00:00:17
can you believe that?
00:00:18
Then by the 1950s, it was up to 65,
00:00:20
and today, it's almost 80.
00:00:22
Tomorrow, who knows? Right?
00:00:25
Healthcare has made huge progress.
00:00:27
We've eradicated epidemics that used to kill millions,
00:00:30
but life is fragile.
00:00:32
People still get sick, or pass away
00:00:34
for reasons that maybe
00:00:36
should be, someday curable.
00:00:41
What if we could improve diagnosis?
00:00:44
Innovate to predict illness instead of just react to it?
00:00:47
In this episode,
00:00:49
we'll see how machine learning is combating
00:00:51
one of the leading causes of blindness,
00:00:53
and enabling a son with a neurological disease
00:00:56
to communicate with his family.
00:00:58
AI is changing the way we think
00:01:00
about mind and body,
00:01:01
life and death,
00:01:03
and what we value most,
00:01:05
our human experience.
00:01:07
[fanfare music playing]
00:01:09
[announcer] ...and our other co-captain, Number 8! Tim Shaw!
00:01:14
[crowd cheering]
00:01:18
[John Shaw] We've lived with his football dream,
00:01:20
All the way back to sixth grade
00:01:22
when his coach said,
00:01:23
"This kid is gonna go a long way."
00:01:25
From that point on,
00:01:27
Tim was doing pushups in his bedroom at night,
00:01:29
Tim was the first one at practice.
00:01:32
Tim took it seriously.
00:01:34
[crowd screaming and cheering]
00:01:42
[whistle blows]
00:01:44
[announcer] Number 8, Tim Shaw!
00:01:47
[crowd cheering]
00:01:49
I don't know what they're doing out there,
00:01:51
and I don't know who they comin' to!
00:01:53
[Robert Downey Jr.] For as long as he can remember,
00:01:55
Tim Shaw dreamed of three letters...
00:01:58
N-F-L.
00:02:00
[whistle blows]
00:02:01
He was a natural from the beginning.
00:02:03
As a kid, he was fast and athletic.
00:02:05
He grew into 235 pounds of pure muscle,
00:02:09
and at 23, he was drafted to the pros.
00:02:13
His dream was real.
00:02:16
He was playing professional football.
00:02:19
[reporter] Hello, I'm with Tim Shaw.
00:02:21
You get to start this season right. What's it feel like?
00:02:24
It's that amazing pre-game electricity,
00:02:26
the butterflies are there, and I'm ready to hit somebody. You might wanna look out.
00:02:30
Hey, Titans fans, it's Tim Shaw here,
00:02:32
linebacker and special teams animal.
00:02:35
He loves what he does.
00:02:37
He says, "They pay me to hit people!"
00:02:39
[crowd cheering]
00:02:42
I'm here to bring you some truth,
00:02:44
a little bit of truth,
00:02:46
and so we'll call it T-Shaw's truth,
00:02:47
'cause it's not all the way true, but it's my truth.
00:03:03
[Tim Shaw speaking]
00:03:27
[Tim from 2015 interview] In 2012,
00:03:29
my body started to do things it hadn't done before.
00:03:31
My muscles were twitching, I was stumbling,
00:03:34
or I was not making a play I would have always made.
00:03:39
I just wasn't the same athlete,
00:03:40
I wasn't the same football player that I'd always been.
00:03:45
[Tim speaking]
00:04:03
[Downey] The three letters that had defined Tim's life up to that point
00:04:07
were not the three letters that the doctor told him that day.
00:04:10
A-L-S.
00:04:12
[Tim speaking]
00:04:37
Okay...
00:04:39
[Downey] A-L-S, which stands for "amyotrophic lateral sclerosis,"
00:04:43
is also known as Lou Gehrig's Disease.
00:04:45
It causes the death of neurons controlling voluntary muscles.
00:04:49
[Sharon Shaw] He can't even scratch his head...
00:04:52
Better yet?
00:04:53
...none of those physical things
00:04:55
that were so easy for him before.
00:04:58
He has to think about every step he takes.
00:05:01
So Tim's food comes in this little container.
00:05:03
We're gonna mix it with water.
00:05:06
[Tim speaking]
00:05:17
[Downey] As the disease progresses,
00:05:19
muscles weaken.
00:05:20
Simple everyday actions, like walking, talking, and eating,
00:05:24
take tremendous effort.
00:05:49
Tim used to call me on the phone in the night,
00:05:52
and he had voice recognition,
00:05:54
and he would speak to the phone,
00:05:56
and say, "Call Dad."
00:05:57
His phone didn't recognize the word "Dad."
00:06:02
So, he had said to me...
00:06:05
[voice breaking] "Dad, I've changed your name.
00:06:07
I'm calling... I now call you "Yo-yo."
00:06:11
So he would say into his phone, "Call Yo-yo."
00:06:16
[Sharon] Tim has stopped a lot of his communication.
00:06:20
He just doesn't talk as much as he used to,
00:06:23
and I, I miss that.
00:06:25
I miss it.
00:06:26
-What do you think about my red beard? -No opinion.
00:06:29
[snorts] That means he likes it,
00:06:31
just doesn't wanna say on camera.
00:06:32
Now, my favorite was when you had the handlebar moustache.
00:06:35
[Downey] Language, the ability to communicate with one another.
00:06:39
It's something that makes us uniquely human,
00:06:42
making communication an impactful application for AI.
00:06:47
[Sharon] Yeah, that'll be fun.
00:06:57
[Julie Cattiau] My name is Julie.
00:06:59
I'm a product manager here at Google.
00:07:01
For the past year or so, I've been working on Project Euphonia.
00:07:04
Project Euphonia has two different goals.
00:07:06
One is to improve speech recognition
00:07:09
for people who have a variety of medical conditions.
00:07:12
The second goal is to give people their voice back,
00:07:15
which means actually recreating the way they used to sound
00:07:18
before they were diagnosed.
00:07:20
If you think about communication,
00:07:22
it starts with understanding someone,
00:07:24
and then being understood,
00:07:26
and for a lot of people,
00:07:28
their voice is like their identity.
00:07:31
[Downey] In the US alone, roughly one in ten people
00:07:34
suffer acquired speech impairments,
00:07:36
which can be caused by anything from ALS,
00:07:39
to strokes, to Parkinson's, to brain injuries.
00:07:43
Solving it is a big challenge,
00:07:45
which is why Julie partnered with a big thinker to help.
00:08:07
[Downey] Dimitri is a world-class research scientist and inventor.
00:08:11
He's worked at IBM, Princeton, and now Google,
00:08:14
and holds over 150 patents.
00:08:18
Accomplishments aside,
00:08:19
communication is very personal to him.
00:08:21
Dimitri has a pretty strong Russian accent,
00:08:24
and also he learned English when he was already deaf,
00:08:28
so he never heard himself speak English.
00:08:32
Oh, you do? Oh, okay.
00:08:34
[Downey] Technology can't yet help him hear his own voice.
00:08:37
He uses AI-powered Live Transcribe
00:08:40
to help him communicate.
00:08:42
[Cattiau] Okay, that's awesome.
00:08:43
So we partnered up with Dimitri to train a recognizer
00:08:47
that did a much better job at recognizing his voice.
00:08:50
The model that you're using right now for recognition,
00:08:53
what data did you train it on?
00:09:01
[Downey] So, how does speech recognition work?
00:09:07
First, the sound of our voice is converted into a waveform,
00:09:10
which is really just a picture of the sound.
00:09:13
Waveforms are then matched to transcriptions,
00:09:16
or "labels" for each word.
00:09:18
These maps exist for most words in the English language.
00:09:21
This is where machine learning takes over.
00:09:24
Using millions of voice samples,
00:09:26
a deep learning model is trained
00:09:28
to map input sounds to output words.
00:09:31
Then the algorithm uses rules, such as grammar and syntax,
00:09:35
to predict each word in a sentence.
00:09:37
This is how AI can tell the difference
00:09:39
between "there," "their," and "they're."
00:09:42
[Cattiau] The speech recognition model that Google uses
00:09:45
works very well for people
00:09:47
who have a voice that sounds similar
00:09:50
to the examples that were used to train this model.
00:09:53
In 90% of cases, it will recognize what you want to say.
00:09:56
[Downey] Dimitri's not in that 90%.
00:09:58
For someone like him, it doesn't work at all.
00:10:01
So he created a model based on a sample of one.
00:10:23
[Downey] But making a new unique model
00:10:25
with unique data for every new and unique person
00:10:27
is slow and inefficient.
00:10:29
Tim calls his dad "Yo-yo."
00:10:32
Others with ALS may call their dads something else.
00:10:35
Can we build one machine
00:10:37
that recognizes many different people,
00:10:40
and how can we do it fast?
00:10:42
[Cattiau] So this data doesn't really exist.
00:10:45
We have to actually collect it.
00:10:46
So we started this partnership with ALS TDI in Boston.
00:10:50
They helped us collect voice samples
00:10:52
from people who have ALS.
00:10:54
This is for you, T. Shaw.
00:10:55
[all] One, two, three!
00:11:00
[all cheering]
00:11:02
I hereby accept your ALS ice bucket challenge.
00:11:07
[yelping softly]
00:11:09
[Downey] When the ice bucket challenge went viral,
00:11:11
millions joined the fight, and raised over $220 million for ALS research.
00:11:17
There really is a straight line from the ice bucket challenge
00:11:20
to the Euphonia Project.
00:11:29
ALS Therapy Development Institute is an organization
00:11:31
that's dedicated to finding treatments and cures for ALS.
00:11:35
We are life-focused.
00:11:37
How can we use technologies we have
00:11:40
to help these people right away?
00:11:42
Yeah, they're actually noisier.
00:11:45
That's a good point.
00:11:47
I met Tim a few years ago
00:11:49
shortly after he had been diagnosed.
00:11:51
Very difficult to go public,
00:11:53
but it was made very clear to me
00:11:55
that the time was right.
00:11:57
He was trying to understand what to expect in his life,
00:12:00
but he was also trying to figure out,
00:12:02
"All right, what part can I play?"
00:12:04
All the ice bucket challenges and the awareness
00:12:07
have really inspired me also.
00:12:09
If we can just step back,
00:12:10
and say, "Where can I shine a light?"
00:12:13
or "Where can I give a hand?"
00:12:15
When the ice bucket challenge happened,
00:12:17
we had this huge influx of resources of cash,
00:12:21
and that gave us the ability
00:12:23
to reach out to people with ALS who are in our programs
00:12:26
to share their data with us.
00:12:28
That's what got us the big enough data sets
00:12:31
to really attract Google.
00:12:36
[Downey] Fernando didn't initially set out
00:12:38
to make speech recognition work better,
00:12:40
but in the process of better understanding the disease,
00:12:44
he built a huge database of ALS voices,
00:12:47
which may help Tim and many others.
00:12:54
[John] It automatically uploaded it.
00:12:56
[Tim] Oh.
00:13:59
How many have you done, Tim?
00:14:07
2066?
00:14:08
[Fernando Vieira] Tim, he wants to find every way that he can help.
00:14:12
It's inspiring to see his level of enthusiasm,
00:14:16
and his willingness to record lots and lots of voice samples.
00:14:20
[Downey] To turn all this data into real help,
00:14:23
Fernando partnered with one of the people
00:14:25
who started the Euphonia Project, Michael Brenner...
00:14:28
-Hey, Fernando. -Hey, how are you doing?
00:14:30
[Downey] ...a Google research scientist
00:14:32
and Harvard-trained mathematician
00:14:34
who's using machine learning
00:14:35
to solve scientific Hail Marys, like this one.
00:14:38
Tim Shaw has recorded almost 2,000 utterances,
00:14:42
and so we decided to apply our technology
00:14:44
to see if we could build a recognizer that understood him.
00:14:48
[Tim speaking]
00:14:51
The goal, right, for Tim, is to get it so that it works
00:14:54
outside of the things that he recorded.
00:14:56
The problem is that we have no idea
00:14:58
how big of a set that this will work on.
00:15:00
[Brenner] Dimitri had recorded upwards of 15,000 sentences,
00:15:04
which is just an incredible amount of data.
00:15:07
We couldn't possibly expect anyone else
00:15:09
to record so many sentences,
00:15:11
so we know that we have to be able to do this
00:15:14
with much less recordings from a person.
00:15:16
So it's not clear it will work.
00:15:18
[Tim speaking]
00:15:22
-That didn't work at all. -Not at all.
00:15:24
He said, "I go the opposite way,"
00:15:26
and it says, "I know that was."
00:15:28
[Brenner] When it doesn't recognize,
00:15:29
we jiggle around the parameters of the speech recognizer,
00:15:32
then we give it another sentence,
00:15:34
and the idea is that you'll get it to understand.
00:15:37
[Tim's recording] Can we go to the beach?
00:15:42
-Yes! Got it. -Got it.
00:15:43
That's so cool. Okay, let's try another.
00:15:45
[Downey] If Tim Shaw gets his voice back,
00:15:48
he may no longer feel that he is defined,
00:15:50
or constrained, by three letters,
00:15:53
but that's a big "if."
00:15:55
While Michael and team Euphonia work away,
00:15:58
let's take a moment and imagine what else is possible
00:16:01
in the realm of the senses.
00:16:03
Speech.
00:16:05
Hearing.
00:16:07
Sight.
00:16:08
Can AI predict blindness?
00:16:10
[truck horn beeps]
00:16:12
Or even prevent it?
00:16:50
[Downey] Santhi does not have an easy life.
00:16:53
It's made more difficult because she has diabetes,
00:16:56
which is affecting her vision.
00:17:05
[Downey] If Santhi doesn't get medical help soon, she may go blind.
00:17:12
[Dr. Jessica Mega] Complications of diabetes
00:17:15
include heart disease,
00:17:16
kidney disease,
00:17:17
but one of the really important complications
00:17:20
is diabetic retinopathy.
00:17:21
The reason it's so important is that
00:17:24
it's one of the lead causes of blindness worldwide.
00:17:27
This is particularly true in India.
00:17:31
[giving instructions]
00:17:53
In the early stages, it's symptomless,
00:17:55
but that's when it's treatable,
00:17:57
so you want to screen them early on,
00:17:59
before they actually lose vision.
00:18:05
In the early stages,
00:18:06
if a doctor is examining the eye,
00:18:08
or you take a picture of the back of the eye,
00:18:10
you will see lots of those bleeding spots in the retina.
00:18:21
Today, the doctors are not enough to do the screening.
00:18:25
We are very limited ophthalmologists,
00:18:27
so there should be other ways
00:18:28
where you can screen the diabetic patients
00:18:31
for diabetic complications.
00:18:32
[Downey] In the US,
00:18:33
there are about 74 eye doctors for every million people.
00:18:37
In India, there are only 11.
00:18:39
So just keeping up with the sheer number of patients,
00:18:42
let alone giving them the attention and care they need,
00:18:45
is overwhelming, if not impossible.
00:18:48
[Dr. R. Kim] We probably see about 2,000 to 2,500 patients
00:18:52
every single day.
00:18:53
[Mega] The interesting thing with diabetic retinopathy
00:18:56
is there are ways to screen and get ahead of the problem.
00:18:59
The challenge is that not enough patients undergo screening.
00:19:02
[Downey] Like Tim Shaw's ALS speech recognizer,
00:19:05
this problem is also about data,
00:19:08
or lack of it.
00:19:09
To prevent more people from experiencing vision loss,
00:19:13
Dr. Kim wanted to get ahead of the problem.
00:19:19
So there's a hemorrhage.
00:19:21
All these are exudates.
00:19:24
[Downey] Dr. Kim called up a team at Google.
00:19:27
Made up of doctors and engineers,
00:19:29
they're exploring ways to use machine learning
00:19:31
to solve some of the world's leading healthcare problems.
00:19:35
So we started with could we train an AI model
00:19:39
that can somehow help read these images,
00:19:41
that can decrease the number of doctors required
00:19:45
to do this task.
00:19:47
So this is the normal view.
00:19:48
When you start looking more deeply,
00:19:50
then this can be a microaneurysm, right?
00:19:53
-This one here? -[man] Could be.
00:19:55
[Downey] The team uses the same kind of machine learning
00:19:58
that allows us to organize our photos
00:20:00
or tag friends on social media,
00:20:02
image recognition.
00:20:05
First, models are trained
00:20:07
using tagged images of things like cats or dogs.
00:20:10
After looking at thousands of examples,
00:20:12
the algorithm learns to identify new images
00:20:15
without any human help.
00:20:17
For the retinopathy project,
00:20:19
over 100,000 eye scans
00:20:21
were graded by eye doctors
00:20:23
who rated each eye scan on a scale from one to five,
00:20:26
from healthy to diseased.
00:20:28
These images were then used
00:20:30
to train a machine learning algorithm.
00:20:32
Over time, the AI learned to predict
00:20:35
which eyes showed signs of disease.
00:20:37
[Dr. Lily Peng] This is the assistant's view
00:20:40
where the model's predictions
00:20:41
are actually projected on the original image,
00:20:45
and it's picking up the pathologies very nicely.
00:20:49
[Downey] To get help implementing the technology,
00:20:51
Lily's team reached out to Verily,
00:20:53
the life sciences unit at Alphabet.
00:20:56
[Mega] So, how was India?
00:20:58
[Peng] Oh, amazing!
00:20:59
[Mega] Verily came out of Google X,
00:21:01
and we sit at the intersection of technology, life science, and healthcare.
00:21:05
What we try to do is think about big problems
00:21:07
that are affecting many patients,
00:21:09
and how can we bring the best tools
00:21:11
and best technologies
00:21:12
to get ahead of the problems.
00:21:14
The technical pieces are so important, and so is the methodology.
00:21:17
How do you capture the right image,
00:21:19
and how does the algorithm work,
00:21:21
and how do you deploy these tools not only here,
00:21:24
but in rural conditions?
00:21:25
If we can speed up this diagnosis process
00:21:28
and augment the clinical care,
00:21:30
then we can prevent blindness.
00:21:32
[Downey] There aren't many other bigger problems that affect more patients.
00:21:36
Diabetes affects 400 million worldwide,
00:21:39
70 million in India alone,
00:21:41
which is why Jessica and Lily's teams
00:21:44
began testing AI-enabled eye scanners there,
00:21:47
in its most rural areas,
00:21:49
like Dr. Kim's Aravind Eye Clinics.
00:21:52
-Is the camera on? -Now it's on.
00:21:55
Yeah.
00:21:56
So once the camera is up,
00:21:57
we need to check network connectivity.
00:21:59
[Sunny Virmani] The patient comes in.
00:22:01
They get pictures of the back of the eye.
00:22:03
One for the left eye, and right eye.
00:22:05
The images are uploaded to this algorithm,
00:22:08
and once the algorithm performs its analysis,
00:22:10
it sends the results back to the system,
00:22:13
along with a referral recommendation.
00:22:15
It's good. It's up and running.
00:22:17
Because the algorithm works in real time,
00:22:20
you can get a real-time answer to a doctor,
00:22:23
and that real-time answer comes back to the patient.
00:22:30
[Kim] Once you have the algorithm,
00:22:32
it's like taking your weight measurement.
00:22:34
Within a few seconds,
00:22:36
the system tells you whether you have retinopathy or not.
00:22:40
[Downey] In the past,
00:22:41
Santhi's condition could've taken months to diagnose,
00:22:44
if diagnosed at all.
00:23:20
[Downey] By the time an eye doctor would've been able to see her,
00:23:23
Santhi's diabetes might have caused her to go blind.
00:23:27
Now, with the help of new technology,
00:23:29
it's immediate,
00:23:31
and she can take the hour-long bus ride
00:23:33
to Dr. Kim's clinic in Madurai
00:23:35
for same-day treatment.
00:23:49
[Downey] Now thousands of patients
00:23:51
who may have waited weeks or months to be seen
00:23:53
can get the help they need before it's too late.
00:24:11
Thank you, sir.
00:24:27
[Downey] Retinopathy
00:24:29
is when high blood sugar damages the retina.
00:24:32
Blood leaks, and the laser treatment
00:24:34
basically "welds" the blood vessels
00:24:36
to stop the leakage.
00:24:38
Routine eye exams can spot the problem early.
00:24:41
In rural or remote areas, like here,
00:24:44
AI can step in and be that early detection system.
00:24:57
[Pedro Domingos] I think one of the most important applications of AI.
00:25:01
is in places where doctors are scarce.
00:25:04
In a way, what AI does is make intelligence cheap,
00:25:08
and now imagine what you can do when you make intelligence cheap.
00:25:11
People can go to doctors they couldn't before.
00:25:14
It may not be the impact that catches the most headlines,
00:25:16
but in many ways it'll be the most important impact.
00:25:31
[family chattering happily]
00:25:40
[Mega] AI now is this next generation of tools
00:25:42
that we can apply to clinically meaningful problems,
00:25:45
so AI really starts to democratize healthcare.
00:25:56
[Mega] The work with diabetic retinopathy
00:25:59
is opening our eyes to so much potential.
00:26:03
Even within these images,
00:26:05
we're starting to see some interesting signals
00:26:07
that might tell us about someone's risk factors for heart disease.
00:26:11
And from there, you start to think about
00:26:13
all of the images that we collect in medicine.
00:26:16
Can you use AI or an algorithm
00:26:18
to help patients and doctors
00:26:20
get ahead of a given diagnosis?
00:26:23
Take cancer as an example of how AI can help save lives.
00:26:26
We could take a sample of somebody's blood
00:26:29
and look for the minuscule amounts of cancer DNA
00:26:31
or tumor DNA in that blood. This is a great application for machine learning.
00:26:36
[Downey] And why stop there?
00:26:38
Could AI accomplish
00:26:40
what human researchers have not yet been able to?
00:26:42
Figuring out how cells work well enough
00:26:45
that you can understand why a tumor grows and how to stop it
00:26:48
without hurting the surrounding cells.
00:26:50
[Downey] And if cancer could be cured,
00:26:52
maybe mental health disorders,
00:26:54
like depression, or anxiety.
00:26:56
There are facial and vocal biomarkers
00:26:58
of these mental health disorders.
00:27:00
People check their phones 15 times an hour.
00:27:03
So that's an opportunity
00:27:05
to almost do, like, a well-being checkpoint.
00:27:08
You can flag that to the individual,
00:27:10
to a loved one,
00:27:11
or in some cases even to a doctor.
00:27:14
[Bran Ferren] If you look at the overall field of medicine,
00:27:17
how do you do a great job of diagnosing illness?
00:27:21
Having artificial intelligence,
00:27:24
the world's greatest diagnostician, helps.
00:27:31
[Downey] At Google,
00:27:33
Julie and the Euphonia team have been working for months
00:27:35
trying to find a way for former NFL star Tim Shaw
00:27:38
to get his voice back.
00:27:39
[Dimitri Kanevsky speaking]
00:27:47
Yes! So Zach's team, the DeepMind team,
00:27:50
has built a model that can imitate your voice.
00:27:53
For Tim, we were lucky,
00:27:55
because, you know, Tim has a career of NFL player,
00:27:59
so he did multiple radio interviews and TV interviews,
00:28:02
so he sent us this footage.
00:28:04
Hey, this is Tim Shaw, special teams animal.
00:28:07
Christmas is coming, so we need to find out
00:28:10
what the Titans players are doing.
00:28:12
If you gotta hesitate, that's probably a "no."
00:28:14
[Cattiau] Tim will be able to type what he wants,
00:28:17
and the prototype will say it in Tim's voice.
00:28:21
I've always loved attention.
00:28:23
Don't know if you know that about me.
00:28:24
[laughs] She's gonna shave it for you.
00:28:26
[Downey] Interpreting speech is one thing,
00:28:29
but re-creating the way a real person sounds
00:28:31
is an order of magnitude harder.
00:28:33
Playing Tecmo Bowl, eating Christmas cookies, and turkey.
00:28:36
[Downey] Voice imitation is also known as voice synthesis,
00:28:40
which is basically speech recognition in reverse.
00:28:43
First, machine learning converts text back into waveforms.
00:28:46
These waveforms are then used to create sound.
00:28:50
This is how Alexa and Google Home are able to talk to us.
00:28:53
Now the teams from DeepMind and Google AI
00:28:56
are working to create a model
00:28:58
to imitate the unique sound of Tim's voice.
00:29:01
Looks like it's computing.
00:29:03
But it worked this morning?
00:29:05
We have to set expectations quite low.
00:29:08
[Cattiau] I don't know how our model is going to perform.
00:29:11
I hope that Tim will understand
00:29:13
and actually see the technology for what it is,
00:29:16
which is a work in progress and a research project.
00:29:20
[Downey] After six months of waiting, Tim Shaw is about to find out.
00:29:26
The team working on his speech recognition model
00:29:28
is coming to his house for a practice run.
00:29:33
[doorbell rings]
00:29:34
[dog barks]
00:29:39
[Sharon] Good girl, come say hello.
00:29:41
-Hi! -Oh, hi!
00:29:43
Welcome.
00:29:44
-Hi! -Come in.
00:29:45
Thanks for having us.
00:29:47
[Sharon] He's met some of you before, right?
00:29:49
How are you doing, Tim?
00:29:51
-Hi, Tim. -Good to see you.
00:29:53
-Hello. -Hello.
00:29:55
Hi.
00:29:56
Hi, I'm Julie. We saw each other on the camera.
00:30:01
It's warmer here than it is in Boston.
00:30:03
[Sharon] As it should be.
00:30:05
[all laughing]
00:30:09
Okay.
00:30:11
Lead the way, Tim.
00:30:13
[Cattiau] I'm excited to share with Tim and his parents
00:30:16
what we've been working on.
00:30:18
I'm a little bit nervous. I don't know if the app
00:30:21
is going to behave the way we hope it will behave,
00:30:24
but I'm also very excited, to learn new things
00:30:27
and to hear Tim's feedback.
00:30:29
So I brought two versions with me.
00:30:32
I was supposed to pick, but I decided to just bring both
00:30:34
just in case one is better than the other,
00:30:37
and, just so you know,
00:30:38
this one here was trained
00:30:41
only using recordings of your voice,
00:30:44
and this one here was trained using recordings of your voice,
00:30:47
and also from other participants from ALS TDI
00:30:51
who went through the same exercise of... [laughing]
00:30:54
So, okay.
00:30:57
I was hoping we could give them a try.
00:30:59
Are we ready?
00:31:03
Who are you talking about?
00:31:12
[app chimes]
00:31:15
It got it.
00:31:16
[John] It got it.
00:31:19
[gasps]
00:31:24
[Tim] Is he coming?
00:31:27
[app chimes]
00:31:29
Yes.
00:31:33
Are you working today?
00:31:36
[app chimes]
00:31:44
[chuckling]
00:31:46
It's wonderful.
00:31:48
[Cattiau] Cool.
00:31:49
Thank you for trying this.
00:31:51
-Wow! -It's fabulous.
00:31:53
[John] What I love, it made mistakes,
00:31:55
-and then it corrected itself. -Yeah.
00:31:57
I was watching it like, "That's not it,"
00:31:59
and then it went... [mimics app] Then it does it right.
00:32:02
These were phrases,
00:32:04
part of the 70% that we actually used
00:32:06
to train the model,
00:32:08
but we also set aside 30% of the phrases,
00:32:11
so this might not do as well,
00:32:14
but I was hoping that we could try some of these too.
00:32:18
[John] So what we've already done
00:32:20
is him using phrases that were used to train the app.
00:32:23
That's right.
00:32:24
Now we're trying to see if it can recognize phrases
00:32:27
-that weren't part of that. -[Cattiau] Yes, that's right.
00:32:30
So let's give it a try?
00:32:33
Do you want me to?
00:32:39
Do you have the time to play?
00:32:44
[app chimes]
00:32:47
What happens afterwards?
00:32:52
[app chimes]
00:32:53
Huh. So, on the last one,
00:32:55
this one got it, and this one didn't.
00:32:58
-We'll pause it. So... -I love the first one, where it says,
00:33:02
-"Can you help me take a shower?" -[laughing]
00:33:04
-[Cattiau] That's not at all what he said. -[John] I know,
00:33:07
you've gotta be really careful what you ask for.
00:33:10
[all laughing]
00:33:12
[John] So if, when it's interpreting his voice,
00:33:15
and it makes some errors,
00:33:17
is there a way we can correct it?
00:33:19
Yeah. We want to add the option
00:33:22
for you guys to fix the recordings,
00:33:24
but as of today, because this is the very first time
00:33:27
we actually tried this,
00:33:28
we don't have it yet.
00:33:30
[Cattiau] This is still a work in progress.
00:33:32
We have a speech recognition model
00:33:34
that works for Tim Shaw,
00:33:36
which is, you know, one person,
00:33:38
and we're really hoping that, you know,
00:33:41
this technology can work for many people.
00:33:43
There's something else I want you to try,
00:33:46
if that's okay?
00:33:47
We're working with another team at Google called DeepMind.
00:33:50
They're specialized in voice imitation and synthesis.
00:33:56
[Downey] In 2019,
00:33:57
Tim wrote a letter to his younger self.
00:34:02
They are words written by a 34-year-old man with ALS
00:34:05
who has trouble communicating
00:34:08
sent back in time
00:34:09
to a 22-year-old on the cusp of NFL greatness.
00:34:15
[Cattiau] So let me give this a try.
00:34:18
I just like using this letter because it's just so beautiful,
00:34:21
so let me see if this is gonna work.
00:34:27
[Tim's younger voice] So, I've decided to write you this letter
00:34:30
'cause I have so much to tell you.
00:34:32
I want to explain to you
00:34:33
why it's so difficult for me to speak,
00:34:36
the diagnosis, all of it,
00:34:38
and what my life is like now,
00:34:39
'cause one day, you will be in my shoes,
00:34:41
living with the same struggles.
00:34:57
It's his voice, that I'd forgotten.
00:35:14
We do.
00:35:47
[app chimes]
00:36:03
[app chimes]
00:36:05
We're so happy to be working with you.
00:36:07
It's really an honor.
00:36:12
[John] The thought that one day,
00:36:14
that can be linked with this,
00:36:18
and when you speak as you are now,
00:36:20
it will sound like that, is...
00:36:29
It's okay. We'll wait.
00:36:33
[Cattiau] There is a lot of unknown
00:36:34
and still a lot of research to be conducted.
00:36:37
We're really trying to have a proof of concept first,
00:36:41
and then expand to not only people who have ALS,
00:36:44
but people who had a stroke,
00:36:45
or a traumatic brain injury, multiple sclerosis,
00:36:48
any types of neurologic conditions.
00:36:51
Maybe other languages, too, you know?
00:36:52
I would really like this to work for French, for example.
00:36:56
[Mega] Wouldn't it be a wonderful opportunity
00:36:58
to bring technology to problems that we're solving
00:37:01
in life science and healthcare,
00:37:03
and in fact, it's a missed opportunity
00:37:05
if we don't try to bring the best technologies
00:37:07
to help people.
00:37:09
This is really just the beginning.
00:37:10
[Downey] Just the beginning indeed.
00:37:13
Imagine the possibilities.
00:37:15
I think in the imaginable future for AI and healthcare
00:37:19
is that there is no healthcare anymore,
00:37:22
because nobody needs it.
00:37:24
You could have an AI that is directly talking to your immune system,
00:37:27
and is actually preemptively creating the antibodies
00:37:30
for the epidemics that are coming your way,
00:37:32
and will not be stopped.
00:37:34
This will not happen tomorrow, but it's the long-term goal
00:37:37
that we can point towards.
00:37:43
[Downey] Tim had never heard his own words
00:37:46
read out loud before today.
00:37:48
Neither had his parents.
00:37:51
[Tim] Every single day is a struggle for me.
00:37:54
I can barely move my arms.
00:37:58
[John] Have fun.
00:38:00
I can't walk on my own,
00:38:02
so I recently started using a wheelchair.
00:38:07
I have trouble chewing and swallowing.
00:38:10
I'd kill for a good pork chop.
00:38:12
Yes, my body is failing, but my mind is not giving up.
00:38:18
Find what's most important in your life,
00:38:20
and live for that.
00:38:24
Don't let three letters, NFL,
00:38:26
define you...
00:38:28
[crowd cheering]
00:38:31
...the same way I refuse to let three letters define me.
00:38:43
[John] One of the things Tim has taught us,
00:38:45
and I think it's a lesson for everyone...
00:38:48
Medically speaking, Tim's life has an end to it.
00:38:51
In fact, five years ago we were told he only had two to five years left.
00:38:55
We're already past that.
00:38:58
He has learned very quickly
00:39:01
that today is the day that we have,
00:39:05
and we can ruin today
00:39:08
by thinking about yesterday
00:39:10
and how wonderful it used to be,
00:39:12
and, "Oh, woe is me," and "I wish it was like that."
00:39:16
We can also ruin today
00:39:17
by looking into the future,
00:39:19
and in Tim's case,
00:39:20
how horrible this is going to be.
00:39:22
"This is going to happen,"
00:39:23
"I won't be able to do this anymore."
00:39:26
So if we go either of those directions,
00:39:28
it spoils us from being present today.
00:39:31
That's a lesson for all of us.
00:39:32
Whether we have an ALS diagnosis or not,
00:39:35
try to see the good and the blessing of every day.
00:39:38
You're here with us today.
00:39:40
It's going to be a good day.