00:00:00
in this video we continue talking about
00:00:02
transcription with a particular focus on
00:00:05
eukariotic
00:00:06
transcription so transcription as we
00:00:08
have seen before is the process whereby
00:00:11
the information in the DNA is converted
00:00:14
into RNA molecules and when we think
00:00:17
about it in terms of genes we have a
00:00:19
piece of DNA where we have a coding
00:00:22
region the start of the transcription in
00:00:24
this genene is called the transcription
00:00:26
start site which is the plus one site
00:00:29
all things on the right side of the TSS
00:00:32
are referred to as Downstream and all
00:00:35
things left side of the TSS are referred
00:00:38
to be on the Upstream we saw in the
00:00:41
procaryotic transcription case the
00:00:44
promoters of the genes were strictly
00:00:46
present Upstream of the genes but in
00:00:49
eukariotic genes the promoter can be
00:00:51
present Downstream of the TSS as well
00:00:54
and they often also overlap with the
00:00:56
coding region so in this video we will
00:00:59
explore exp the promoter structure of
00:01:01
eukaryotic genes so let's begin by
00:01:04
defining promoters it is a DNA sequence
00:01:07
that defines where the transcription of
00:01:10
a gene begins via the RNA polymerise and
00:01:13
its Associated factors so let's dive
00:01:16
into the
00:01:17
promoters let's begin by drawing a
00:01:19
cartoon of the DNA where we have the two
00:01:21
strands running anti- parallel the TSS
00:01:25
will be the plus one site for the Gene
00:01:28
and we will have plus two plus3 and so
00:01:30
on towards the downstream and -1 -2
00:01:34
positions and so on on the Upstream
00:01:37
notice that there is no site zero so the
00:01:40
most common promoter element that we
00:01:42
hear about in new curiot is the Tata box
00:01:46
flanking the upstream and the downstream
00:01:48
location of the Tara box we sometimes
00:01:51
find Bru and b d elements if we go
00:01:56
further Upstream we find something known
00:01:59
as the C cat box and even further we can
00:02:03
sometimes find a GC box and we will
00:02:06
discuss these guys in details in a bit
00:02:09
so these are the promoters which are
00:02:11
strictly Upstream of the TSS now there
00:02:14
is a promoter called The initiator Motif
00:02:17
which overlaps the TSS and therefore
00:02:20
flanks both the upstream and the
00:02:22
downstream
00:02:23
regions and this is the overlapping
00:02:25
promoter type that I mentioned
00:02:28
previously often Downstream of initiator
00:02:31
we can find an Associated promoter
00:02:33
element called mte and in some other
00:02:36
cases we can also find DPE and in rare
00:02:40
cases we also have some specific ones
00:02:43
known as the
00:02:45
xcp1 in this video however we won't be
00:02:48
discussing
00:02:50
xcp1 okay so let's now discuss these
00:02:53
guys in a slightly more
00:02:55
detail let's start with the Tata box
00:02:58
this is present between -31 to -25 and
00:03:03
it has a consensus sequence t a t a w a
00:03:07
a r the W and R are the iopac symbols
00:03:11
and if you want to learn more about them
00:03:13
I have some links in the description
00:03:15
that you can check the sequence binds
00:03:18
tf2d which is a transcription factor and
00:03:21
more specifically it is the tbp subunit
00:03:24
of the tf2d that is involved in The
00:03:27
Binding the tbp refers to the binding
00:03:30
protein tf2d or transcription Factor 2D
00:03:34
is involved in recruiting RNA polymerase
00:03:36
to the promoter and in the procaryotic
00:03:39
promoter video we said that the
00:03:41
procaryotic homologue of tatab box could
00:03:44
be the -10 element which is also known
00:03:46
as the pnau Box flanking the tarab box
00:03:50
we have the BR Elements which stand for
00:03:54
B recognition elements the B here refers
00:03:57
to the fact that these elements are
00:03:59
recognized by tf2b transcription Factor
00:04:02
the U in the B refers to the Upstream
00:04:05
location of this element relative to the
00:04:07
Tata box its consensus is SSR cgc CG so
00:04:14
it makes this a very GC Rich region the
00:04:17
Upstream location of BR is between -37
00:04:21
to -32 the other side of the B is
00:04:25
Downstream which refers to its
00:04:28
Downstream location and this is present
00:04:30
between -26 to -20 the consensus of p d
00:04:35
is fairly at Rich which is in contrast
00:04:38
to the GC Rich consensus of BR U
00:04:42
functionally BR elements are considered
00:04:44
regulatory in nature as they can both
00:04:47
positively or negatively impact the
00:04:49
promoter capacity of adjacent motifs in
00:04:52
this case especially the Tata box now we
00:04:56
go on a bit further upstream and we can
00:04:58
find that there is a CCAP at box and
00:05:00
this is usually seen between 00 to -60
00:05:04
location the consensus for this is ggcc
00:05:09
a a
00:05:10
TCT this box usually binds non-canonical
00:05:13
transcription factors like nucleo Factor
00:05:16
proteins from family A B and C it also
00:05:20
binds some other class of proteins known
00:05:23
as the ccat enhancer binding proteins
00:05:27
cbps for short and for its ability to
00:05:30
bind these non-canonical transcription
00:05:32
factors the ccat box is also called an
00:05:36
enhancer like promoter a bit Upstream of
00:05:39
ccat box is another promoter element
00:05:42
known as the GC box and this is present
00:05:45
between 1110
00:05:47
to15 position and its consensus as the
00:05:51
name implies is quite GC rich like The
00:05:55
ccat Box it also binds non-canonical
00:05:58
transcription factors such as the sp1
00:06:01
and migy and for this reason GC box
00:06:04
similar to ccat box is also known as an
00:06:07
enhancer like promoter now let's talk
00:06:10
about the promoters on the other side
00:06:12
and let's start with the initiator which
00:06:15
as we said previously overlaps the
00:06:17
downstream and the Upstream parts of the
00:06:20
TSS the exact location of initiator
00:06:22
motif is highly variable but you can
00:06:25
expect to find it somewhere between -6
00:06:28
and + 45 the consensus of initiator is
00:06:31
also highly variable in humans for
00:06:34
instance it has a consensus yya and Wy y
00:06:40
if you look at initiator motive
00:06:41
consensus in other eukaryotic organisms
00:06:44
for example mouse or flies the consensus
00:06:47
is very different the initiator motive
00:06:50
is usually Bound by tf2d just like the T
00:06:54
box a neat fact about initiator among
00:06:56
the few that we will talk about later is
00:06:58
that it was was first discovered in a
00:07:00
tdt gene in lymphocytes the promoter of
00:07:04
this Gene does not have Tata box which
00:07:06
led to speculate the discovery of other
00:07:09
promoter elements and it led to the
00:07:11
discovery of initiator
00:07:13
motif finally we have promoters which
00:07:16
are Downstream of the TSS and these are
00:07:19
the motif 10 elements mte and the
00:07:22
downstream promoter element DPE they
00:07:25
were originally discovered in dropa but
00:07:28
since they're discovered they have also
00:07:30
been found in human genes the mte you
00:07:33
can find between + 18 to plus 27
00:07:37
position which is strictly Downstream of
00:07:39
the TSS and likewise DP which is also
00:07:42
Downstream of TSS is present somewhere
00:07:45
between -28 to + 33 and just like the
00:07:49
initiator element these are also Bound
00:07:52
by tf2d transcription Factor the
00:07:55
functional capacity of mte and DP is
00:07:58
considered to to be cooperative and it
00:08:01
has been shown that their function AIDS
00:08:03
in the potential of initiator motif to
00:08:05
begin
00:08:06
transcription thus they are largely
00:08:09
involved in regulatory function and
00:08:11
specifically it is the spacing and
00:08:13
distance of mte or DP from the initiator
00:08:17
which determines how strongly they will
00:08:19
impact the process of transcription
00:08:21
initiation with this knowledge of
00:08:24
promoters we can now think about a
00:08:26
couple of things the first idea is of
00:08:28
core promoters or basil promoters this
00:08:32
refers to a minimal stretch of DNA which
00:08:34
is sufficient to accurately begin
00:08:37
transcription when we talk about core
00:08:39
promoters we think about the ones that
00:08:42
bind canonical transcription factors and
00:08:45
these are usually the ones that span
00:08:47
from -35 to+
00:08:49
35 this also suggests that not all
00:08:52
promoter elements that we talk about
00:08:54
here are required to begin the
00:08:56
transcription so what is more profound
00:08:58
to realize I I is that you can have a
00:09:00
gene which only has the Tata box as its
00:09:03
promoter and nothing else and you can
00:09:05
have a gene which only has initiator
00:09:07
Motif in its promoter side and you can
00:09:10
also have another Gene where you find
00:09:12
both Tata boxes and initiator Motif at
00:09:15
the promoters in other cases you may
00:09:17
have only the Tata box and the BR
00:09:20
Downstream element so the point is that
00:09:23
you can have any combination of these
00:09:25
promoters in a gene now the more
00:09:27
interesting bit about promoter is the
00:09:29
misconception about t boxes which if you
00:09:32
look at the data in human genes is only
00:09:35
present in about 15% of the genes and
00:09:38
the misconception is that the T box
00:09:41
which is bound by tbp is somehow the
00:09:43
universal promoter which is not the case
00:09:46
tbp not only binds Tata box it is also
00:09:49
involved in The Binding of initiator
00:09:51
motifs if you look at the statistics
00:09:54
you'll see that almost 40% of the genes
00:09:57
in humans have initiator motif in their
00:09:59
promoters and 30% of these 45% don't
00:10:03
even have Tata boxes so this idea of tab
00:10:07
box being Universal promoter is not true
00:10:10
and this idea of universal promoter
00:10:13
itself is very incorrect because that
00:10:15
would make the process of transcription
00:10:17
initiation very
00:10:19
inflexible the final thing that I wanted
00:10:21
to add to this video is the idea of
00:10:24
types of transcription starts which can
00:10:26
be either focused or dispersed
00:10:29
so let's say you have a gene where you
00:10:31
only have one promoter element you will
00:10:33
always initiate transcription at a
00:10:35
single spot and that's called a focused
00:10:38
transcription start but in genes where
00:10:40
you have multiple promoters in tandem
00:10:43
you have different choices to begin the
00:10:45
transcription and this will result in
00:10:48
dispersed transcription start all right
00:10:51
so let's quickly summarize what we have
00:10:54
learned here we have some promoter
00:10:56
Elements which are located strictly
00:10:58
Downstream of of the TSS there are some
00:11:01
which overlap the TSS and there are some
00:11:04
promoters which are strictly Upstream of
00:11:06
the TSS a majority of these from -35 to
00:11:11
+ 35 are referred to as core promoter
00:11:14
elements and they bind canonical
00:11:17
transcription factors like tf2d and tf2b
00:11:20
the distal promoter elements Beyond Nega
00:11:23
35 buying the non-canonical
00:11:25
transcription factors like sp1 migy
00:11:29
CBP and Korea and finally we said that
00:11:33
tab box is not a universal promoter but
00:11:36
I would like you to know that the Tata
00:11:38
box is perhaps one of the oldest
00:11:39
promoter elements that we know of
00:11:42
because you can find it in virtually all
00:11:43
ukots spanning both plant and animals
00:11:47
and you can even find it in some archa
00:11:49
bacteria again this does not imply that
00:11:52
tarab box is a universal promoter
00:11:54
element and with that we have covered
00:11:57
the Crux of eukariotic prom elements