00:00:00
We hear sound because our ears can detect
vibrations in the air, which come from sources
00:00:05
like everyday objects, speakers and other
people talking.
00:00:09
These vibrations travel through the air as
areas with higher pressure, known as compressions,
00:00:15
and areas with lower pressure, known as rarefactions,
bounce back and forth, carrying this longitudinal
00:00:21
wave forward.
00:00:22
If we graph the air pressure across the space
that the sound is travelling through, we can
00:00:27
see the wave shape of the sound.
00:00:30
This is Inspecto, and this video will be all
about sound frequencies, wave shapes and the
00:00:34
math behind it all.
00:00:39
You probably know that a sound’s pitch is
dependent on the frequency of its sound wave.
00:00:44
Frequency and pitch hold an exponential relationship,
meaning that a difference of a certain number
00:00:49
of hertz, for example a 200 Hz difference
from 200 Hz to 400 Hz, and from 400 Hz to
00:00:58
600 Hz, will not result in the same perceived
difference in pitch.
00:01:06
Rather, the same ratio between different frequencies,
for example a 1:2 ratio from 200 Hz to 400
00:01:14
Hz, then from 400 Hz to 800 Hz, is what will
result in the same perceived difference in
00:01:24
pitch.
00:01:28
How does this tie into music, and its familiar
12 note scale?
00:01:33
Two notes that are one octave apart are named
the same, and kind of sound like they are
00:01:37
the same note… but one is higher pitch,
and one is lower.
00:01:46
This important gap of the octave is precisely
a double in frequency.
00:01:51
All other note intervals are derived from
this – if each semitone is 1/12 of an octave,
00:01:57
the ratio of a semitone would be the 12th
root of 2, or 2 to the power of 1/12.
00:02:04
This way, to increase a frequency by one semitone,
it can simply be multiplied by 2 to the one
00:02:10
twelfth.
00:02:11
Increasing that frequency by 12 semitones,
or an octave, it would be multiplied 12 times,
00:02:16
which is the 2 to the one twelfth, to the
power of 12 – in other words, 2 to the power
00:02:21
of 12/12, which is just multiplying it by
2.
00:02:26
Combinations of two notes at certain intervals
can sound “consonant”, which means they
00:02:30
sound like they “agree” with each other,
when the two frequencies have a simple whole
00:02:35
number ratio.
00:02:36
For example, the octave has a ratio of 1:2
– really simple ratio.
00:02:41
Other than the octave, the most consonant
interval is arguably the perfect fifth, which
00:02:46
is 7 semitones.
00:02:51
Let’s see what 2 to the power of 7/12 is
– it is approximately 1.498, which is quite
00:03:00
close to 1.5.
00:03:02
1:1.5 simplifies to 2:3 – another really
simple ratio.
00:03:08
The reason a simple whole number ratio sounds
consonant probably has to do with the period
00:03:13
of the resulting wave when the two waves are
added together.
00:03:17
With a 1:2 ratio, the resulting wave’s period
is 1x the lower frequency wave and 2x the
00:03:24
higher frequency wave.
00:03:26
With a 2:3 ratio, the resulting period is
2x the lower frequency wave and 3x the higher
00:03:33
frequency wave.
00:03:35
These simple ratios make it easy for our brains
to notice a regular pattern when the two notes
00:03:40
are added together.
00:03:41
Now let’s hear what a dissonant interval
sounds like – one where the notes sound
00:03:46
like they “disagree” with each other.
00:03:49
The most dissonant interval is arguably the
tritone, which is 6 semitones.
00:03:58
2 to the power of 6/12, or one half, is just
square root of 2, equal to about 1.4142.
00:04:07
This results in an irrational ratio which
isn’t close to any simple ratio, resulting
00:04:12
in a very irregular pattern when the waves
of notes one tritone apart are added together.
00:04:18
A chord consists of three or more notes.
00:04:21
The most consonant chord is arguably the basic
major triad, which is just a root, a perfect
00:04:31
fifth, and a major third added in the middle,
4 semitones above the root.
00:04:39
2 to the power of 4/12 is about 1.26, which
is close to one and a quarter.
00:04:47
This results in, once again, quite a simple
ratio of 4:5.
00:04:51
However, to see that dissonant intervals also
have their roles, we can look at jazz chords
00:04:56
such as the dominant 7th chord
and diminished 7th chord, which both have
00:05:13
dissonant intervals in them, but sound nice.
00:05:22
Sinusoidal waves are the simplest waves that
us humans can hear – we perceive them to
00:05:26
only be one frequency.
00:05:29
Now the Fourier theorem states that periodic
functions that are reasonably continuous can
00:05:34
often be expressed as the sum of different
sinusoidal functions.
00:05:39
For sound, this means that the sounds we hear
can be broken down into a combination of many
00:05:43
simple sinusoidal waves, each with their respective
frequencies.
00:05:48
This information is very important to understanding
the sounds produced by musical instruments,
00:05:53
or when synthesising our own musical sounds.
00:05:56
The ratios of intervals within the 12-note
scale range from 1 to 2, in 2 to the 1/12
00:06:03
geometric intervals.
00:06:05
But if we look at frequencies that are integer
multiples of the fundamental frequency…
00:06:10
like 2x… 3x… 4x… 5x… etc, those form
a whole nother system known as the harmonic
00:06:23
series.
00:06:24
The significance of the harmonic series is
that adding on these harmonics doesn’t change
00:06:29
the period of the fundamental.
00:06:32
As a result, the combined sound sounds tonal
– it’s very clear what pitch or note the
00:06:37
sound is – it’s just the pitch of the
fundamental frequency.
00:06:41
On the flip side of this, when a sound contains
frequencies that are not in the harmonic series
00:06:46
of its fundamental frequency, known as inharmonic
frequencies, it sounds more atonal – in
00:06:53
other words, it’s harder to tell its pitch
or note.
00:06:57
Comparing the frequencies of a piano to a
drum, we can see that the tonal instrument
00:07:07
does have more harmonic frequencies.
00:07:10
In fact, we can theoretically recreate a musical
instrument digitally, if we use a synthesiser
00:07:15
to match all its frequencies, including its
fundamental, harmonic and inharmonic frequencies
00:07:21
– there are of course other factors to match
as well, but those are the most important.
00:07:37
The science of sound waves also applies to
the world of sound design, which is the artificial
00:07:42
creation of instruments and sound effects,
such as in electronic music.
00:07:48
Sound design is all about the combination
and manipulation of wave shapes, which come
00:07:53
in four basic forms: the sine wave, triangle
wave, square wave and saw wave.
00:08:11
The way that these waves each sound can be
explained by how they can be broken down into
00:08:16
simpler sinusoidal waves, each with their
own frequency.
00:08:20
First of all, the sine wave sounds the simplest,
as it only represents one frequency.
00:08:26
However, here’s an equation that represents
an infinite sum of waves.
00:08:31
Since it’s a wave, it’s a function of
time, with t being the time, and f being the
00:08:36
fundamental frequency.
00:08:38
The sigma means that many copies of this expression
are being added up, with increasing values
00:08:43
of k.
00:08:45
This scales up the input of the sine function,
increasing the frequency of successive waves,
00:08:52
but scales down the output of the sine function,
decreasing the amplitude of successive waves.
00:08:59
This specific equation is for the square wave.
00:09:02
As you can see, the expression uses 2k – 1
instead of k, which gives all the odd integers.
00:09:08
This means that all the odd harmonics are
present in a square wave, in addition to the
00:09:13
fundamental – which is why it sounds so
full compared to the sine wave.
00:09:18
Starting as a one sine wave and adding more
and more sine waves following this equation,
00:09:23
we can see that the sum does approach the
shape of a square wave.
00:09:28
The triangle wave can be made by also adding
up the odd harmonics, but this time making
00:09:33
every second one, that is the 1st, 5th, 9th
etc, negative, and the denominator is squared,
00:09:42
making the successive harmonics decrease in
amplitude much faster.
00:09:46
That’s why if we look at an analyser, the
harmonics of a triangle wave are not as rich
00:09:51
as those of a square wave, and you can hear
it too – here’s a triangle wave, and here’s
00:09:58
a square wave.
00:10:02
The saw wave can be made by adding up all
the harmonics, not just the odd ones, and
00:10:08
making every second one negative.
00:10:10
Having all harmonics is what makes the saw
wave the richest sounding wave out of all
00:10:15
the basic wave shapes, and that’s why it
is often used for elements in electronic music
00:10:21
that need to have a big, rich sound.
00:10:23
This could be a supersaw, which is used to
voice thick chords… a pad, which is used
00:10:36
to fill up the atmosphere… or even a saw
bass, which can be used for an aggressive
00:10:45
low end.
00:10:46
Altogether, they sound like this.
00:10:54
The other waves of course also have their
place.
00:10:57
For example, a square wave could be used to
make 8-bit, chip tune sounds.
00:11:04
A triangle wave could also be used to make
some unique lead sounds.
00:11:14
And lastly, a distorted sine wave can be used
to make an 808.
00:11:38
So that was some of the basic math behind
music and sound synthesis, which hopefully
00:11:43
can now give you a fresh perspective when
encountering music and instruments.
00:11:47
Hope you enjoyed the video, until next time.