the deadly nightshade (amanita phalloides), one of the most

26
The deadly nightshade (Amanita phalloides), one of the most poisonous mushrooms known, contains a-amanitin, which blocks eukaryotic messenger RNA synthesis because of its potent inhibition of RNA polymerase II. 24

Upload: vuonghanh

Post on 01-Jan-2017

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: The deadly nightshade (Amanita phalloides), one of the most

The deadly nightshade (Amanita phalloides), one of the most poisonous mushrooms known, contains a-amanitin, which blocks eukaryotic messenger RNA synthesis because of its potent inhibition of RNA polymerase II.

24M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 730 30/10/14 2:01 PM

Page 2: The deadly nightshade (Amanita phalloides), one of the most

731

Transcription and Post-transcriptional Processing

We turn now to transcription, the process by which information

stored in the nucleotide sequence of DNA is read out through

the template-dependent synthesis of polyribonucleotides.

Mechanistically, transcription is similar to DNA replication,

particularly in the use of nucleoside triphosphate substrates and

the template-directed growth of nucleic acid chains in a 5′ S 3′

direction. There are, however, two major differences. First, with

a few exceptions, only one strand of the DNA duplex is used as

a template and transcribed for any particular gene, and second,

only part of the entire genome is transcribed in any one cell. The

importance of understanding transcription lies in regulation—the

processes used to select particular genes and template strands

for transcription. This selection, in large part, governs the

metabolic capabilities of a cell. These regulatory mechanisms

chiefly operate at the levels of initiation and

termination of transcription, through the

actions of proteins that contact DNA in a highly

site-specific manner. Figure 24.1 gives a preview

of what we have learned about transcriptional

regulation through the analysis of DNA–protein

interactions. We discuss this topic both here and in

Chapter 26.

Chapter 24

24.1 DNA as the Template for RNA Synthesis

24.2 Enzymology of RNA Synthesis: RNA Polymerase

24.3 Mechanism of Transcription in Bacteria

24.4 Transcription in Eukaryotic Cells

24.5 Post-transcriptional Processing

Tools of Biochemistry

24.A DNA Microarrays

24.B Chromatin Immunoprecipitation

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 731 30/10/14 2:01 PM

Page 3: The deadly nightshade (Amanita phalloides), one of the most

Subunits of thehomodimericrepressor

a-helical region (red)designed to makespecific contacts with DNA bases.

Figure 24.1Regulation of transcription by DNA–protein interactions. Gene expression is controlled by proteins that recognize particular DNA sequences and bind at those sites. Shown here is a regulatory protein (a repressor) of bacteriophage l, which binds to a specific operator site on l phage DNA and regulates the viral reproductive cycle by controlling the transcription of genes adjacent to this site. Yellow and blue depict subunits of the homodimeric repressor, and red depicts a-helical regions of the repressor, designed to make specific contacts with DNA bases. PBD ID 1LMB.

732 | Chapter 24 Transcription and Post-transcriptional Processing

The products of transcriptive RNA synthesis are rarely used directly. In eukaryotic cells transcription products are subject to further processing, including trimming, cutting, splicing, and modifications at the 3′ and 5′ ends. Comparable processing occurs for prokaryotic ribosomal and transfer RNAs. Therefore, we also discuss post-transcriptional processing in this chapter. We are concerned primarily with the synthesis and processing of messenger RNA, ribosomal RNA, and transfer RNA, the species most directly involved in gene expression. However, some RNA transcripts serve additional functions, including catalysis (Chapters 8 and 25) and the telomerase reaction (Chapter 22). In Chapter 26, we will discuss other RNA species and functions—RNA editing, riboswitches, small interfering RNAs, microRNAs, and noncoding RNAs.

24.1 DNA as the Template for rNA SynthesisThe idea that RNA molecules are complementary copies of DNA tem-plates originated from genetic analysis. In France’s Pasteur Institute, François Jacob, Jacques Monod, and Andre Lwoff were studying genes controlling the utilization of lactose in E. coli and the control of lysog-eny in bacteriophage l. Recall from Chapter 23 that lysogeny is the

state of gene repression after insertion of a circular bac-teriophage genome into the chromosome of a host bac-terium. Both the lactose and

phage l systems are described more fully in Chapter 26. Here we focus upon lactose utilization.

The Predicted existence of Messenger rNALactose utilization in E. coli was known to be controlled by three en-zymes, whose genes are adjacent on the chromosome. One of these enzymes is b-galactosidase, which hydrolyzes lactose and other b-galactosides. When bacteria are grown in a medium containing both glucose and lactose, the genes controlling lactose utilization are silent, and glucose is preferentially metabolized. Once the glu-cose supply is exhausted, the three enzymes of lactose utilization are rapidly synthesized, allowing the bacteria to take up and metabolize lactose. This enzyme induction is rapidly reversed upon adding glu-cose back to the medium, whereupon the b-galactosidase activity reverts to a low level. The rapid changes in b-galactosidase-forming capacity suggested that the template for synthesizing this enzyme is metabolically unstable—synthesized rapidly on demand and de-graded when the stimulus to induction is removed. Ribosomal and transfer RNAs are metabolically stable. Hence, these species were

unlikely to be intermediates in information transfer. What might these observations tell us about RNA synthesis?

Jacob and Monod analyzed E. coli mutants that displayed faulty control over induction of the lactose-utilizing enzymes. Some ex-pressed all three genes at high levels even when lactose or a similar inducer was absent, and others could not produce any of the en-zymes, even after the addition of lactose to a bacterial culture. Based on these studies, as well as parallel work by Lwoff with phage l, in 1961 Jacob and Monod proposed a unifying hypothesis of gene reg-ulation in which transcription, or copying information encoded in

ConCept Our original concepts of RNA synthesis came from genetic studies that predicted the existence of messenger RNA.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 732 30/10/14 2:01 PM

Page 4: The deadly nightshade (Amanita phalloides), one of the most

24.1 DNA as the Template for RNA Synthesis | 733

Regulator gene

Inducer

Messenger RNA

Proteins

Repressor–inducercomplex

Repressor

Operator Structural genesR O SG1 SG2 SG3

SG1 SG2 SG3

2

1 4

5

3Loosening facilitates transcription of the structural genes, resulting in production of mRNA — an RNA copy of the structural genes.

The mRNA sequence is translated into proteins.

The repressor–inducer complex binds less tightly to the operator.

A small-molecule inducer complexes with the repressor, altering the equilibrium between conformational states of the repressor.

The regulator gene R encodes a repressor molecule, which can bind to the operator (O) and thereby inhibit transcription of the adjacent structural genes SG1,2,3.

were labeled for short periods (a 3- or 4-minute “pulse”) with [32P]orthophosphate, about 2% of the total RNA became radio-active. This radiolabeled RNA had two properties that led to its eventual identification as viral mRNA. First, it was metaboli-cally unstable; after termina-tion of the labeling interval, this RNA rapidly lost radioactivity, as if the labeled species were be-ing degraded. Second, the RNA seemed to be a product of viral DNA metabolism because its nucleotide composition resem-bled that of T2 DNA; the radi-olabeled RNA was rich in ad-

enine and uracil and low in guanine and cytosine; T2 DNA has a high A + T content.

Additional evidence came from sucrose gradient centrifugation of the rapidly labeled phage RNA, which showed that the labeled material sediments heterogeneously and distinctly from any of the known rRNA or tRNA species. In the experiment of Figure 24.3(a), an ultracentrifuge tube contains buffer with a linear gradient of sucrose concentration. The RNA solution being analyzed is carefully layered onto the top of the gradient. The sucrose minimizes convection, so that in a centrifu-gal field molecules sediment as discrete bands, at rates related to their size and shape. This differs from equilibrium gradient centrifugation, as in the Meselson–Stahl experiment (Chapter 4), where molecules are separated on the basis of density differences (mass per unit volume).

In the experiment of Figure 24.3(a), fractions obtained after cen-trifugation were analyzed for their ultraviolet light absorption, which measures RNA concentration, and radioactivity, which measures the relative amount of RNA synthesized during the pulse. The UV ab-sorbance profile shows two large peaks at 16S and 23S, and a smaller peak at 4S. (S stands for “Svedberg units,” a measure of sedimenta-tion rate; it is related to molecular size and shape.) The 16S and 23S peaks represent rRNA, and the 4S peak is mostly tRNA, which is a much smaller molecule. The radioactivity was distributed heteroge-neously throughout the gradient, suggesting that the labeled RNA is a collection of molecules of varying sizes, none of which was present in great abundance.

Benjamin Hall and Sol Spiegelman established that this phage RNA is a viral gene product when they carried out the first DNA–RNA hy-bridization experiment, show-ing that the labeled RNA was complementary in sequence to phage DNA. DNA and RNA were heated and slow-cooled together, so that RNA could base-pair with DNA of complementary nucleotide sequence. The earliest experiments were based on the fact that RNA is denser than DNA (why is this so?). Thus, a DNA–RNA hybrid could be detected in equilibrium gradi-ent centrifugation as a species of intermediate density containing label derived from both DNA and RNA. Such a hybrid was formed when

DNA, was regulated specifically at the level of initiation. Hypotheti-cal regulatory elements called repressors and operators controlled the synthesis of other hypothetical entities called messenger RNAs (mRNAs). mRNA was postulated to be a complementary copy of the DNA that encompassed a set of structural genes, which encode proteins, as schematized in Figure 24.2. A set of contiguous genes plus adjacent regulatory elements that control their expression was termed an operon, and the Jacob–Monod hypothesis thus became known as the operon model.

Jacob and Monod predicted several characteristics of the hypo-thetical mRNA. First, they predicted a high rate of mRNA synthesis followed by rapid degradation, which would explain the fast turn-on

of the genes after induction by lactose and turn-off in the presence of another preferen-tially used sugar. Second, be-cause of rapid synthesis and degradation, they expected mRNA to accumulate rapidly but not to high steady-state

levels. Third, because they thought that the messenger was a copy of two or more contiguous genes, they expected it to be fairly large and part of a heterogeneous size class of RNA molecules. Finally, if the mRNA was a complementary copy of DNA, its nucleotide sequence should be identical to a portion of one of the DNA strands.

T2 Bacteriophage and the Demonstration of Messenger rNAThe first physical demonstration of mRNA came from work with the closely related T2 and T4 bacteriophages. Infection by these large DNA viruses arrests all expression of host-cell genes, and no significant accumulation of RNA can be detected after infec-tion. However, in 1956 the use of radioisotopes led to detection of a distinctive RNA in T2-infected E. coli. When infected cultures

ConCept Bacterial genetics predicted messenger RNA to be a col-lection of metabolically active RNAs that are present in low abundance, heterogeneous in size, and comple-mentary in sequence to portions of the DNA genome.

Figure 24.2The operon model, as proposed in 1961 by Jacob and Monod.

ConCept The ability of T2 phage RNA to hybridize with T2 DNA and to associate with ribosomes made before infection was evidence for the existence of mRNA.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 733 30/10/14 2:01 PM

Page 5: The deadly nightshade (Amanita phalloides), one of the most

734 | Chapter 24 Transcription and Post-transcriptional Processing

not turn over, label accumulates, and the fraction of total label in the stable RNA species continues to increase. Consistent with this idea, Spiegelman also showed that highly labeled ribosomal and transfer RNAs hybridize to E. coli DNA, demonstrating that all three major classes of RNA are synthesized from template DNA strands.

As noted previously, the earliest DNA–RNA hybridization ex-periments involved gradient centrifugation, a laborious technique. Spiegelman and his col-leagues made the important discovery that single-strand-ed DNA binds irreversibly to membrane filters made of material such as nitrocel-lulose. This technique al-lowed rapid hybridization analysis of a large number of samples because a radiolabeled RNA could hybridize to denatured DNA immobilized on a nitrocellulose filter. After washing unbound

T2 RNA was heated and slowly cooled along with T2 DNA, but not when the DNA came from E. coli. Additional density- labeling experiments established that phage proteins were syn-thesized on ribosomes that had been formed before infection. These experiments showed the ribosome to be a nonspecific workbench on which any protein could be assembled, depending on which template became asso-ciated with that workbench.

rNA Dynamics in uninfected CellsThe experiments described in the previous section supported the existence of mRNA in phage-infected E. coli. What about uninfected bacteria? Spiegelman and his colleagues showed that pulse-labeled RNA from uninfected E. coli hybridized to E. coli DNA. At very short labeling intervals, the sedi-mentation pattern showed incorporation into both rRNA and tRNA species and a heterogeneously sedimenting species (Figure 24.3(c)). After a “chase”, when the radiolabeled phosphate was removed, the radioactivity profile followed the absorbance profile, showing that all RNA species had become labeled to equivalent specific radioactivi-

ties. This finding is consist-ent with the postulated short lifetime of mRNA (t1/2 = 2-3 minutes). mRNA would reach its maximal radioactiv-ity within just a few minutes, but during the chase, mRNA turnover would release nu-cleotides that could flow

into stable RNA molecules. Because those stable RNA molecules do

23S 16S 4S

(b) Sedimentation profile of total and pulse-labeled RNAs in T2 phage-infected E. coli. Total concentration of RNA in each fraction (black) was determined by ultraviolet absorbance (A260). The radioactivity profile (red) shows the heterogeneous size distribution of RNA molecules synthesized during the pulse.

(a) A centrifuge tube containing a linear 5% to 20% gradient of sucrose with an RNA preparation layered on top. After centrifugation, fractions are collected through a hole punched in the bottom of the tube.

(c) Pulse-labeled RNA species in uninfected bacteria and their fate in a chase. The orange line represents RNA molecules labeled in a 3-minute pulse. The blue line depicts an identical culture with the label chased by 0.7 generation of growth in nonradioactive medium, and the black line is the A260 profile, representing ribosomal and transfer RNA species as in panel (b).

Distance from tube bottom Distance from tube bottom

A26

0

205% sucrose

Beforecentrifugation

Aftercentrifugation

Total RNA4S RNA

16S RNA23S RNAD

ista

nce

from

top

cpm

A26

0

cpm

23S 16S 4S

cpm

Pulsecpm

Chasecpm

A260

Figure 24.3Demonstration of mRNA by pulse labeling and sedimentation. Panel a demonstrates the technique of gradient centrifugation, while panels b and c depict the results of analysis of pulse-labeled RNA preparations.

ConCept The demonstration of mRNA in E. coli involved a classic “pulse-chase” experiment, in which incorporation of a radioisotope into a metabolic species takes place during a short time interval; then the fate of that species is followed after removal of the radioisotope.

ConneCtion The discovery that single-stranded DNA, immobilized on a membrane filter, can hybridize with complementary-sequence RNA or DNA underlies forensic DNA analysis and gene-expression analysis by microar-ray technology.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 734 30/10/14 2:01 PM

Page 6: The deadly nightshade (Amanita phalloides), one of the most

24.2 Enzymology of RNA Synthesis: RNA Polymerase | 735

radioactivity off the filter, the extent of hybridization could be deter-mined simply by counting the radioactivity of the filter. The same principle— immobilization of DNA on nitrocellulose followed by the analysis of bound radioactivity—underlies Southern blotting (de-scribed in Chapter 21), which is now more widely used to analyze gene organization and expression. These developments also led to microarray technology, in which thousands of DNA–RNA hybrid-ization reactions are analyzed on a single DNA chip (see Tools of Biochemistry 24A; see also Figure 1.11).

24.2 enzymology of rNA Synthesis: rNA PolymeraseAs discussed earlier, RNA synthesis involves copying a template DNA strand. However, the earliest known enzyme capable of RNA synthesis in vitro did not require a template. This RNA-synthesizing enzyme, called polynucleotide phosphorylase, was discovered in the 1950s. The enzyme was quite different from DNA polymerase. The enzyme required no template, and it used ribonucleoside diphosphates (rNDPs) as substrates to produce a random-sequence polynucleotide whose base composition matched the nucleotide composition of the reaction medium.

n rNDP ∆ 1rNMP2n + n Pi

Initially, it was thought that polynucleotide phosphorylase might be the major RNA-synthesizing enzyme, but the lack of a template requirement was puzzling, as was the apparent absence of the enzyme

in eukaryotic cells. Ulti-mately, polynucleotide phos-phorylase turned out to play no role in RNA synthesis in vivo but instead was found to participate in the degradation

of bacterial mRNAs. The enzyme was of great value, however, in the synthesis of polynucleotides used as templates for in vitro protein syn-thesis, when the genetic code was being elucidated (see Chapter 25).

Investigators continued to search for an enzyme that would copy a DNA template in vitro. In 1960, such an enzyme was discovered almost simultaneously in four different laboratories. The enzyme, DNA-directed RNA polymerase, resembled DNA polymerases in the nature of the reaction catalyzed.

n (ATP + GTP + CTP + UTP) ERRRFMg2+, DNA

(AMP - GMP - CMP - UMP)n + n PPi

The reaction product is a complementary RNA copy of the DNA template.

Biological role of rNA PolymeraseIn bacteria a single RNA polymerase catalyzes the synthesis of mRNA, rRNA, and tRNA. This was shown in experiments with ri-fampicin, an antibiotic that inhibits RNA polymerase in vitro and blocks the synthesis of mRNA, rRNA, and tRNA in vivo. Rifampicin-resistant mutants of E. coli were found both to contain a rifampicin-resistant form of RNA polymerase and to be capable of synthesizing

all three RNA classes in vivo in the presence of rifampicin. Because a single mutation af-fects both the RNA polymer-ase and the synthesis of all RNA types in vivo, RNA polymerase must be the one enzyme catalyzing all forms of transcription in bacteria.

ConneCtion Rifampicin is a widely used antibiotic that specifically targets bacterial RNA polymerase.

ConCept Polynucleotide phosphorylase catalyzes the reversible, template-independent synthesis of random-sequence polyribonucleotides.

A-Amanitin

N

H

HO

CHOC

CH

OC

HN

C

NH

CO

NH CO CH NH CO CH2 NH

CH CO NH CO NH

CH2OH

CH2 CO

HOHN

H

Rifampicin

NOH

OH O

O

N

N

CH3

CH3

NH

OCH3

CH3 CH3

OHOH

CH

HCHCH

OH

H3C

C

S

H2C

CH2

H3C CONH2

CH3O

CH3COOCH3

CH3

HO

CH3

O

CH2CH3

CH3

ConneCtion a-Amanitin, the toxic agent in one of the most deadly poisonous mushrooms, acts by inhib-iting RNA synthesis and, hence, by blocking all protein synthesis.

In contrast, eukaryotes contain three distinct RNA polymerases, one each for synthesis of rRNA, mRNA, and small RNAs (tRNA plus the 5S species of rRNA)—RNA polymerases I, II, and III, re-spectively. The existence of separate enzymes was revealed partly because they differ in their sensitivity to inhibition by A-amanitin, a toxin from the poisonous Amanita mushroom. RNA polymerase II is inhibited at low concentrations of a- amanitin, RNA polymerase III is inhibited only at high concentrations, and RNA poly-merase I is quite resistant.

Because DNA polymerases and RNA polymerases catalyze similar reactions, it is worthwhile to compare some of their kinetic features.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 735 30/10/14 2:01 PM

Page 7: The deadly nightshade (Amanita phalloides), one of the most

736 | Chapter 24 Transcription and Post-transcriptional Processing

to a different set of promoters, thereby activating transcription of a block of genes called heat-shock genes. The most abundant s in E. coli, and the one that will frame our discussions, is called s70 because of its 70-kDa molecular weight. Subunit s70 is one of seven different s factors known in E. coli, each designed to direct RNA polymerase to a functionally related set of genes.

Eukaryotic RNA polymerases show similarity with the bacterial enzymes, but they have a more complex subunit structure. RNA poly-merase II (pol II) from yeast has 12 subunits. The common ancestry among RNA polymerases is evident, however, as shown in Figure 24.4.

The kcat for E. coli DNA polymerase III holoenzyme, at about 500 to 1000 nucleotides per second, is much higher than kcat for purified RNA

polymerase—50 nucleotides per second, which is about the same as the rate of tran-scription in vivo. Although there are only about 10 mol-ecules of DNA polymerase III

per E. coli cell, there are some 2000 molecules of RNA polymerase, of which half might be involved in transcription at any instant. Why is this significant? As seen in Chapter 22, replicative DNA chain growth is rapid but takes place at few sites, whereas transcription is much slower but occurs at many sites. The result is that far more RNA ac-cumulates in the cell than DNA. Like the DNA polymerase III ho-loenzyme, the action of RNA polymerase is highly processive. Once past the initial stages of transcription, RNA polymerase rarely, if ever, dissociates from the template until the specific signal to terminate has been reached.

Another important difference between DNA and RNA polymer-ases is the accuracy with which a template is copied. With an error rate of about 10-5, RNA polymerase is less accurate than replicative DNA polymerase holoenzymes, although RNA polymerase is much more accurate than expected if base-pairing with template nucleotides were the sole determinant of the base sequence of the transcript. Given that RNA does not carry information from one cell generation to the next, an ultrahigh-fidelity template-copying mechanism is evidently not needed.

Structure of rNA PolymeraseWhen highly purified E. coli RNA polymerase is analyzed in denaturing electrophoretic gels, five dis-tinct polypeptide subunits are observed. Their proper-ties are summarized in TABle 24.1. Two copies of the α subunit are present, along with one each of b, b′, s, and v, giving Mr of about 450,000 for the holoenzyme.

The s subunit is easily dissociated from RNA poly-merase. The s-free enzyme, called core polymerase, is still catalytically active, but it binds to DNA at far more sites than does the RNA polymerase holoenzyme, and it shows no strand or sequence specificity. The s subunit thus plays an essential role in directing RNA polymerase to bind to the duplex DNA at the proper site for initiation—the promoter site—and to select the correct strand as the template for transcription. The addition of s to core polymerase reduces the af-finity of the enzyme for nonpromoter sites by about 104, thereby increasing the enzyme’s specificity for binding to promoters.

These discoveries about s suggested that gene expression could be regulated by having core polymerase interact with different forms of s, which would in turn direct the holoenzyme to different promoters. In fact, this does occur. In one example an E. coli culture is stressed by a sudden temperature increase. In these heat-shocked cells, a new form of s appears and directs the modified RNA polymerase

ConCept DNA replication involves rapid chain growth occurring at few intracellular sites, and transcription involves slower growth at many sites. More RNA accumulates than DNA.

table 24.1 Subunit composition of E. coli RNA polymerase

Subunit Mr

Number per Enzyme Molecule Function

a 36.5 kDa 2 Chain initiation, interaction with regulatory proteins and upstream promoter elements

b 151.0 kDa 1 Chain initiation and elongation

b′ 155.0 kDa 1 DNA binding

s 70.0 kDaa 1 Promoter recognition

v 11.0 kDa 1 Promotion of enzyme assembly

a The 70-kDa s subunit is one of several alternative s subunits.

Catalyticcore

Assemblyplatform

Auxiliaryspecialized functions

Subunitcomplexity

RNApolymerasefunction:

Bacteria

Archaea

Eukarya

b9

A9

B9

B0

D N L P H K F ETF

S/X

RPB

1

RPB

2

RPB

3R

PB10

RPB

11R

PB12

RPB

5R

PB6

RPB

4R

PB7

RPB

9R

PB8

C82

C53

C37

C34

C31

A19

0

A13

5

AC

40R

PB10

AC

19R

PB12

RPB

5R

PB6

A14

A43

A12

A49

A34

.5

RPB

8

pol II

pol I

pol III

C16

0

C12

8

AC

40R

PB10

AC

19R

PB12

RPB

5R

PB6

C17

C25

C11

RPB

8

A0

b a a v

Conserved among the three eukaryotic RNA polymerases

Figure 24.4RNA polymerase subunit structures in the three domains of life. The subunits are arranged by function rather than by size. Homologous subunits are color-coded. Subunits marked with an asterisk are conserved among the three eukaryotic RNA polymerases. Only core enzyme subunits are shown. Data from F. Werner (2007) Structure and function of archaeal RNA polymerases, Molecular Microbiology 65:1395–1404.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 736 30/10/14 2:01 PM

Page 8: The deadly nightshade (Amanita phalloides), one of the most

24.3 Mechanism of Transcription in Bacteria | 737

in vertebrates. Plant chloroplasts also contain an organelle-specific RNA polymerase as well as two additional RNA polymerases, IV and V. These enzymes are involved in RNA-directed DNA methylation, a process in gene silencing (Chapter 26).

The crystal structures of eukaryotic and bacterial RNA polymer-ases were first determined in 2001 and 2002 in the laboratories of Roger Kornberg and Seth Darst, respectively. Figure 24.5(a) shows the structure of E. coli RNA polymerase holoenzyme (containing sigma), reported in 2013, and Figure 24.5(b) shows a cutaway view of yeast RNA polymerase II, illustrating structural features common to all of the multisubunit RNA polymerases. We shall describe the rela-tionship between these structures and the RNA polymerase reaction presently.

24.3 Mechanism of Transcription in BacteriaLike DNA replication and protein synthesis, transcription occurs in three distinct phases—initiation, elongation, and termination. Initiation and termination signals in the DNA sequence punctu-ate the genetic message by directing RNA polymerase to particu-lar genes and by specifying where transcription will start, where it will stop, and which DNA strand will be transcribed. The signals involve both instructions encoded in DNA base sequences and in-teractions between DNA and proteins other than RNA polymerase. We focus first on bacterial RNA polymerases, exemplified by the widely studied E. coli enzyme, but the basic mechanics of transcrip-tion are similar in all organisms. Actions of the eukaryotic enzymes are discussed later.

initiation of Transcription: interactions with PromotersPromoters, the DNA sites at which RNA polymerase binds and initi-ates transcription, were originally identified from DNA base sequence analysis upstream (on the 5′ side) from sites at which transcription begins. This analysis for several E. coli genes identified two conserved adenine- and thymine-rich sequences centered at about 10 and 35 nu-cleotides to the 5′ side of the transcriptional start site, or, at positions -10 and -35, where +1 represents the first DNA nucleotide to be

For example, the two largest subunits in the three eukaryotic multisub-unit polymerases are related to b and b′ of the bacterial enzyme. The figure also shows the subunit composition of archaeal RNA polymer-ases, which reveal them to be much more closely related to eukaryotic than to bacterial polymerases. Eukaryotic RNA polymerases have no direct counterpart to bacterial s. Instead, a series of proteins called transcription factors functions in comparable fashion, helping to di-rect RNA polymerase to promoter sites, forming an initiation complex (page 713).

Although the multisubunit motif for RNA polymerases is the dominant structural theme, it is not universal. Exceptions include single-subunit polymerases encoded by some bacteriophage genomes, including T7 and SP6, and a mitochondrial RNA polymerase found

a subunits

Active site

b9 subunit

b9 clamp

b subunit

v subunits factor

(a) E. coli holoenzyme. Some functional domains are indicated. For other views, see http://www.jbc.org/content/suppl/2013/02/06/M112.430900.DC1/ jbc.M112.430900-1.mov. PDB ID 4IGC.

(b) Cutaway view of the yeast RNA polymerase II elongation complex. Cut surfaces are in light gray. Nontemplate strand (blue) is not shown where it is disordered. The 39 end of the growing RNA chain is adjacent to one of two catalytically essential Mg2+ ions.

Transcribed DNA(upstream)

Clamp

Rudder

Lid

Exit59

Wall Bridge

PoreFunnel

NTPs

Transcription

Jaw

Entering DNA(downstream)

Mg21

RNA hybridized tothe template strand.

TemplateDNA strand

Figure 24.5Crystal structure of RNA polymerase. (a) E. coli holoenzyme. a subunits in yellow and green; b, cyan; b1, pink; v, gray; s70, orange. Some functional domains are indicated. PDB ID 4IGC. Data from K. S. Murakami (2013), X-ray Crystal Structure of Escherichia coli RNA Polymerase s70 Holoenzyme, J. Biol. Chem. 288:9126–9134; (b) Cutaway view of the yeast RNA polymerase II elongation complex. Labeled parts of the structure will be described in the text. Cut surfaces are in light gray. Template DNA strand is in purple. Nontemplate strand (blue) is not shown where it is disordered. RNA that is hybridized to the template strand is in orange. The 3′ end of the growing RNA chain is adjacent to one of two catalytically essential Mg2+ ions. PDB ID 116H.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 737 30/10/14 2:01 PM

Page 9: The deadly nightshade (Amanita phalloides), one of the most

738 | Chapter 24 Transcription and Post-transcriptional Processing

transcribed (Figure 24.6). There was some sequence variation among the promoters analyzed, but a consensus sequence emerged—meaning those nucleotides found most often at each position in each site—within this conserved region.

Biochemical analysis by a technique called footprinting confirmed that RNA polymerase binds tightly to DNA sequences immediately upstream from the tran-scription start site and that the -10 and -35 “boxes” are in close contact with the bound enzyme. In footprinting, a cloned segment of DNA is made radioactive by use of a kinase enzyme and [32P]ATP to phosphorylate the DNA molecule at its 5′ end. The 5′ end-labeled DNA is mixed with RNA polymerase or other site-specific DNA-binding protein of interest. As shown in Figure 24.7, the end-labeled DNA by itself and the DNA–protein complex are digested separately with pancreatic deoxyribonuclease (DNase I), an enzyme that cleaves DNA at random sites. Conditions are chosen so that most DNA molecules are cleaved once. The reaction mixtures are then subjected to gel electrophoresis and autoradiography. DNA sites in close contact with the pro-tein are protected from cleavage, and bands representing those sites are not seen.

The search process by which RNA polymerase finds a promoter site embedded in many kilobases of DNA is not well understood. s is required, for core polymerase binds weakly to DNA with no specificity whatsoever. The initial encounter between RNA poly-merase holoenzyme and a promoter generates a closed-promoter complex. Whereas DNA strands unwind later in transcription, no unwinding is detectable in a closed-pro-moter complex. Binding is primarily electrostatic and of low affinity. Footprinting stud-

ies show that polymerase is in contact with DNA from about nucleotide -55 to -5.

Next, RNA polymerase unwinds several base pairs of DNA, from about -12 to -1, giv-ing an open-promoter complex, so-called be-cause it binds DNA whose strands are open, or

–35 region Spacer –10 region Spacer TranscribedPromoterfor:

trp operon T T G A C A N17 T T A A C T N7 A

tRNATyr T T T A C A N16 T A T G A T N7 A

lP2 T T G A C A N17 G A T A C T N6 G

lac operon T T T A C A N17 T A G T T N6 A

recA T T G A T A N16 T A A A T N7 A

lexA T T C C A A N17 T A T A C T N6 A

T7A3 T

G

C

G

C

C

G

G T G A C A N17 T A C G A T N7 A

CONSENSUS T T G A C A T A T A A T

T

T

Transcription start sites

Figure 24.6Conserved sequences in promoters recognized by E. coli RNA polymerase. Lengths of spacer regions between the two conserved segments are shown.

ConCept Transcription begins with sequence-specific interaction between RNA polymerase and a pro-moter site, where duplex unwinding and template strand selection occur.

*

*

* * * * * * * * ** * * * * * *

*DNase Icleaves DNAat random sites.

Missingsegments

Electrophoresisand radioautography

1

2 29

3

DNA-bindingprotein

Radioactive labelat 59 end of DNA

Bands representing DNA sites in close contact with the protein are not seen.

Cleavage sites

Figure 24.7Footprinting to identify binding sites on DNA for site-specific DNA-binding proteins.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 738 30/10/14 2:01 PM

Page 10: The deadly nightshade (Amanita phalloides), one of the most

24.3 Mechanism of Transcription in Bacteria | 739

Figure 24.9The transcription bubble. This idealized view was originally determined by indirect means, such as the reactivity of transcription complexes with reagents specific for nucleotides in single-stranded conformation. Nt = nucleotide.

it rewinds the template behind the 3′ end of the growing RNA chain, as suggested in Figure 24.9. In this model, about 18 base pairs of DNA are unwound at any given time, forming a moving “transcription bub-ble.” As one base pair becomes unwound in advance of the 3′ end of the nascent RNA strand, one base pair becomes rewound near the trailing end of the RNA polymerase molecule. About nine base pairs of the 3′ end of the nascent transcript are hybridized to the template DNA strand. During elongation, RNA polymerase functions as a true molecular motor. Techniques for analyzing single complexes show that RNA polymerase generates forces exceeding those of well-studied cytoskeletal motor proteins, such as myosin and kinesin.

Refer back to Figure 24.5(b) for a stylized picture of an elongation complex. Although this model was derived from the structure of yeast RNA polymerase II, it shows generally applicable features of tran-scriptional elongation, including the large “wall”, which forces DNA to bend, almost at right angles. In this schematic illustration, the di-rection of polymerase motion is left to right as shown. DNA entering the enzyme is gripped by protein “jaws” (upper jaw not shown in this cutaway model). The 3′ end of the growing RNA is adjacent to one of the catalytically essential Mg2+ ions. The wall forces the DNA to turn. rNTPs probably enter the active site, as shown, through a funnel struc-ture and pore. The 5′ end of the growing RNA chain is diverted from the DNA template by a protein loop called the rudder, which limits the length of RNA hybridized to template DNA. The rudder and lid, which guide the exit of RNA, emanate from a large clamp that swings from back to front, as shown, over the catalytic site and contributes to bind-ing nucleic acids, and hence, to the high processivity of transcription.

The concept of a transcription bubble as a central intermediate in transcription, as depicted in Figure 24.9, suggests that the enzyme moves along the DNA template in register with the growing RNA transcript, with the footprint advancing by one base pair for each ri-bonucleotide incorporated into the transcript. However, footprinting of numerous initiation and elongation complexes has shown that the enzyme often advances discontinuously, holding its position for sev-eral cycles of nucleotide addition and then jumping forward by several base pairs along the template. These and other observations suggest that the means of RNA polymerase translocation is fundamentally dif-ferent from the continuous movement implied by a picture of the tran-scription bubble. It has long been known that some DNA sequences are difficult to transcribe, and RNA polymerase “pauses” when it reaches

unwound. The open-promoter complex is extremely stable; it forms with a Ka as high as 1012 M-1. Structural analysis of DNA complexed with s shows specific interactions between the protein and DNA bases in the -10 box. DNA melting is an essential feature of promoter recognition.

Next, an Mg2+ -dependent isomerization occurs, giving a modi-fied form of the open-promoter complex with the unwound DNA region now extending from -12 to +2. DNA is bent in this structure, as shown in Figure 24.8. Note also the positions of the major promoter elements (-10 and -35) and the bound Mg2+ at the catalytic site.

initiation and elongation: incorporation of ribonucleotidesRNA chain growth begins with binding of the template-specified rNTP, followed by binding of the second nucleotide and its fitting to the template. Nucleophilic attack by the 3′ hydroxyl of the first nu-cleotide on the a (innermost) phosphorus of the second nucleotide generates the first phosphodiester bond and leaves an intact triphos-phate moiety at the 5′ position of the first nucleotide. Nucleotide in-corporation occurs via the two-metal mechanism described for DNA polymerases in Chapter 22. The transcript is bound unstably during the first several phosphodiester bond-forming reactions, as shown by the fact that most initiations are abortive, with release of oligonucleo-tides two to nine residues long. The basis for this low efficiency of initiation is not fully understood.

During incorporation of the first 10 nucleotides, the s subunit dissociates from the transcription complex, and the remainder of the transcription process is catalyzed by the core polymerase. Once s has dissociated, the elongation complex becomes quite stable. Transcrip-tion, as studied in vitro, can no longer be inhibited by adding rifampi-cin, and virtually all transcription events proceed to completion.

During elongation, the core enzyme moves along the duplex DNA template. As it moves, it simultaneously unwinds the DNA, exposing a single-stranded template for base pairing with incoming nucleotides and with the nascent transcript (the most recently synthesized RNA);

NontemplateDNA strand

235 region 210 region

TemplateDNA strandMg21

Figure 24.8DNA bending in an open-promoter complex. This image is taken from the crystal structure of Thermus aquaticus RNA polymerase. Parts of b have been removed to show interior structure. Note the sharp bend in the template strand. The first two ribonucleotides to be incorporated (i and i + 1) are shown close to a catalytically essential Mg2+ . Much of the a and b′ subunits are rendered transparent. PDB ID 1L9U. Data from Science 296:1285–1290, K. S. Murakami, S. et al (2002) Taq RNA polymerase.

8–9 nt6–7 nt

18 bp

13–15 nt 17–19 nt

5′

3′Catalytic site

“Bubble”

RNA polymerase

3′

5′

Non-templatestrand

Templatestrand

mRNA 5′

3′

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 739 30/10/14 2:01 PM

Page 11: The deadly nightshade (Amanita phalloides), one of the most

740 | Chapter 24 Transcription and Post-transcriptional Processing

process. In bacteria we recognize two distinct types of termination events—those that depend on the action of a protein termination factor called r (rho), and those that are r factor–independent.

Factor-independent Termination

Sequencing the 3′ ends of genes that terminate in a factor- independent manner reveals two structural features shared by many such genes and is illustrated in Figure 24.11: (1) two symmetrical GC-rich segments

such a site in vitro, often sitting at the same site for several seconds before transcription is resumed. At such sites, RNA polymerase often translocates backward, and in the process the 3′ end of the nascent transcript is displaced from the catalytic site of the enzyme. This gives a 3′ “tail,” which may be several nucleotides long and is not base-paired to the template, protruding downstream of the enzyme (Figure 24.10). For transcription to resume, an RNA 3′ end must be positioned in the ac-tive site. At this point the enzyme may either cleave the single-stranded RNA end or move forward without new nucleotide incorporation, thereby repositioning the RNA 3′ nucleotide in duplex conformation. These pausing, backtracking, and cleavage reactions are thought to help control the transcription rate and to enhance transcriptional fi-delity by removing mismatched or damaged ribonucleotides.

Punctuation of Transcription: TerminationBecause of the great stability of transcription complexes, termination of transcription, with release of the nascent transcript, is an involved

39 tail

BacktrackedWhen RNA polymerase reachesa site that is difficult to transcribe,it may pause or shift backwards.

Catalytic site

PolymeraseActive

Transcription can resume eitherby polymerase sliding forward orby cleavage of the non-base-paired part of the transcript, creating anew base-paired 39 terminus.

RNA

G

C C C A G CC CGC C U A A U CA G

3 9-

5 9-

GG T CG G G C G GA T T A

T

C

G

T

A

C

G

G

C

C

G

C

G

C

G

G

C GGGC

C T T T T T T T T

A A A A A A A A

U U U U U U U U

C

OH3'

G

T

A

T

A

G T T T T-5 9

C A A A A-3 9C C C A G C C C G C C T A A

39-

5 9- -3 9

-5 9G

C

G

C

G

C

T

A

C

G

G

C

G

C

G

C

C

G

G

C

G

C

A

T

T

A

T

A

A

T

C

G

T

A

C

C C C AGCCCGC

CUA

A UGA

GCGGCCU

G

G

C

C

G

C

G

C

G

G A

T

A

U

T

A

U

T

A

U

T

A

U

T

A

U

T

A

U

T

A

U

C

C

T

A

T

AT

C

G

C

T

A

T

A

T

A

T

AOH3'

Stem-loop structure(RNA–RNA duplex)

-3 9

-5 93 9-

5 9-

G

C

G

C

G

C

T

A

C

G

G

C

G

C

G

C

A

T

T

A

T

A

T

A

A

T

C

G

C

G

G

C

C

G

C

G

C

G

G

C

A

T

A

T

A

T

A

T

A

T

A

T

A

T

A

T

C

–OH3'

G

G

C

T

A

T

A

T

A

T

A

T

A

T

A

G

C

G

C

C

G

GCCCGC

CC C C A U U U U UU U U

GGGCG

C AU GA A U

(a) An A-rich segment of the template (orange segment on right) has just been transcribed into a U-rich mRNA segment.

(b) The RNA–RNA duplex, stabilized by G–C base pairs (yellow), eliminates some of the base pairing between template and transcript.

(c) The unstable A–U bonds linking transcript to template hybrid dissociate, releasing the transcript.

Complementary GC-rich parts of the transcript base-pair with one another, displacing this part of the transcript from its template or from its enzyme-binding site.

Figure 24.11A model for factor-independent termination of transcription.

Figure 24.10Backtracking in an elongation complex. Above, the 3′ terminus of the transcript is in the catalytic site. Below, the enzyme has slipped backward, leaving the 3′ transcript terminus at the end of a non–base-paired RNA tail, several nucleotides long. Data from E. Nudler (2012) RNA polymerase backtracking in gene regulation and genome instability. Cell 149;1438–1445.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 740 30/10/14 2:01 PM

Page 12: The deadly nightshade (Amanita phalloides), one of the most

24.4 Transcription in Eukaryotic Cells | 741

59

RNA polymerase

mRNABinding of r toDNA–RNA–polymerasecomplex

r recognitionsite

r protein

r moves toward39 end, displacingthe DNA templatestrand

r protein

RNA polymerase

mRNA

This weakens theinteraction betweentemplate andtranscript, causingthem to dissociate;r and polymerasealso dissociate

r recognition site

ATP

ADP + Pi

59

59

59

39

39

39

39

that in the transcript have the potential to form a stem-loop structure, and (2) a down-stream run of four to eight A residues. These features sug-gest the following as elements of the termination mechanism. First, RNA polymerase slows down, or pauses, when it reaches the first GC-rich segment, because the stabili-ty of G-C base pairs makes the template hard to unwind. In vitro, RNA polymerase does pause, sometimes for as long as several minutes, at a GC-rich segment. Second, pausing gives time for the complementary GC-rich parts of the nascent transcript to base-pair with one another, thereby displacing this part of the transcript from its template or from its enzyme binding site. Hence, the ternary complex of RNA polymer-ase, DNA template, and RNA is weakened. Further weakening, lead-ing to dissociation, occurs when the A-rich segment is transcribed to give a series of A–U bonds (very weak) linking transcript to template.

The actual mechanism of termination is more complex than just described, in part because DNA sequences both upstream and down-stream from the regions shown in Figure 24.11 also influence termi-nation efficiency.

Factor-Dependent Termination

Factor-dependent termination sites are less frequent, and the process requires an additional protein factor, called r. The r protein, a hexamer composed of identical subunits, was originally discovered in studies on termination of l phage DNA transcription in vitro. This protein, which has been characterized as an RNA–DNA helicase, contains a nucleo-side triphosphatase activity that is activated by binding to polynucleo-tides. Apparently, r acts by binding to the nascent transcript at a C-rich site near the 3′ end, when RNA polymerase has paused (Figure 24.12). Then r moves along the transcript toward the 3′ end, with the helicase activ-ity unwinding the 3′ end of the transcript from the template (and/or the RNA polymerase molecule) and causing it to be released. Release involves an additional protein, NusA (not shown).

24.4 Transcription in eukaryotic CellsTranscription in eukaryotes is more complex than in bacteria and their phages. Not only is there much more discrimination in what is to be transcribed and what is not, but this transcription must be precisely programmed during development and tissue differentia-tion. Furthermore, the transcription machinery must deal with the complex levels of structure in eukaryotic chromatin. Reflecting this complexity is the fact that eukaryotic cells have three different RNA polymerases, each with a specialized function (we leave aside the two additional polymerases in plants). For each polymerase several proteins must assemble at promoters and other upstream sites on the template DNA, along with RNA polymerase, in order to form a functional transcription complex. None of the three polymeras-es has a direct counterpart to the s factor of bacterial complexes. However, all three require a set of transcription factors that play

ConCept DNA sequences that promote factor-independent termina-tion include a GC-rich region that forms a stem-loop followed by a run of 4 to 8 A residues.

ConCept In factor-dependent termination, R protein acts as an RNA–DNA helicase, unwinding the template–transcript duplex and facili-tating release of the transcript.

Figure 24.12Rho (R) factor–dependent termination. Rho (r) binds to a site on the nascent transcript and unwinds the RNA–DNA duplex.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 741 30/10/14 2:01 PM

Page 13: The deadly nightshade (Amanita phalloides), one of the most

742 | Chapter 24 Transcription and Post-transcriptional Processing

metal-binding zinc fingers make contact with and identify DNA se-quences (Figure 24.14). This class of proteins contains conserved his-tidine and cysteine residues, which complex with Zn2+ . This DNA-binding protein motif was mentioned in Chapter 20 as a structural element in steroid hormone receptors.

rNA Polymerase ii: Transcription of Structural genesAll of the structural genes (those encoding proteins) in the eukary-otic cell are transcribed by RNA polymerase II (pol II). This enzyme also transcribes some of the small nuclear RNAs involved in splicing (discussed in Section 24.5). Like other RNA polymerases, pol II is a complex multisubunit enzyme. However, not even its 12 subunits are sufficient to allow pol II to initiate transcription on a eukaryotic promoter. Because the expression of many eukaryotic genes is either tissue-specific or developmental stage-specific or both, eukaryotic promoter structure is far more complex than that of bacteria. Protein factors in addition to RNA polymerase are required for promoter rec-ognition, recruitment of RNA polymerase to a promoter, and genera-tion of an active elongation complex. A typical initiation complex con-tains 60 proteins in addition to RNA polymerase. Of these, about half form a preinitiation complex that assists the enzyme in recognizing and binding to a promoter. The remainder are involved in regulation.

A typical eukaryotic promoter contains an initiator region (Inr) with the sequence YYANWYY, where N is any nucleotide, Y is a pyrim-idine (C or T), W is either A or T, and N represents the +1 initiation site.

roles comparable to that of s, in addition to proteins that might be specialized for the transcription of a par-ticular gene. By convention,

transcription factors are named TFI, TFII, or TFIII, depending on whether they function with RNA polymerase I, II, or III, respective-ly. Within one class of transcription factors, each individual factor is identified with a letter; thus, TFIIA is one of several transcrip-tion factors functioning with RNA polymerase II. The ensemble of proteins required to form an initiation complex is conserved among polymerases I, II, and III.

Other differences from bacteria pertain to the fact that bacterial ge-nomes are organized into blocks of functionally related genes—operons, such as the lactose operon mentioned earlier in this chapter—which are co-transcribed to give multicistronic mRNAs, while eukaryotic genes are almost always transcribed singly and processed as templates for indi-vidual proteins. Post-transcriptional processing, as discussed later in this chapter, is far more complex for eukaryotic than for bacterial transcripts.

rNA Polymerase i: Transcription of the Major ribosomal rNA genesThe eukaryotic ribosome contains four rRNA molecules (see Chapter 25). The small subunit has an 18S rRNA, whereas the large subunit contains 28S, 5.8S, and 5S components. Of these, the 28S, 18S, and 5.8S subunits are all produced from an initial 45S pre-rRNA transcript, and it is the special function of RNA polymerase I (pol I) to carry out this transcription. rRNA transcription and processing, along with ri-bosome assembly, occur in the nucleolus. Transcription occurs from multiple, tandemly arranged copies of the 45S rRNA gene, as shown in Figure 24.13. After transcription, the 45S pre-rRNA is processed to yield 18S, 5.8S, and 28S rRNA molecules. About 6800 nucleotides are

discarded in this process. The rRNAs are then combined with 5S rRNA from the nu-cleus and ribosomal proteins synthesized in the endoplas-mic reticulum. The resulting

ribosomal subunits are exported from the nucleolus back into the cytosol.

rNA Polymerase iii: Transcription of Small rNA genesRNA polymerase III (pol III) is the largest and most complex of the eukaryotic RNA polymerases (see Figure 24.4). The major targets for pol III are the genes for all the tRNAs and for the ribosomal 5S rRNA. Like the major ribosomal genes described in the previous section, these small genes are present in multiple copies, but they are usually not grouped together in tandem arrays, nor are they localized in one region of the nucleus. Rather, they are scattered over the genome and throughout the nucleus.

Of all the genes transcribed by pol III, the most thoroughly studied are those for 5S rRNA. At least three transcription factors— TFIIIA, –B, and –C, are needed for the expression of these genes. TFIIIB and TFIIIC participate in transcribing tRNA genes as well, but TFIIIA is specific only for the 5S genes. TFIIIA is an example of an abundant class of sequence-specific DNA-binding proteins, in which

ConCept Eukaryotic cells have three RNA polymerases, each requiring additional protein factors to initiate transcription.

Nontranscribedspacer

5.8S

5.8S

DNA with tandemlyarranged genes formammalian 45S rRNA

Transcriptionby RNApolymerase I

Mammalian 45S pre-rRNA (13,000 nucleotides)

Multistep rRNAprocessing (discard6800 nucleotides)

18S

18S

~30 proteins

40S 60S

~50 proteins

Ribosomal subunits

5S rRNA

28S

18S28S 28S

5.8S

5.8S

18S

~30 proteins

40S 60S

~50 proteins5S rRNA

28S

ConCept Pol I transcribes the major ribosomal RNA genes; pol III transcribes small RNA genes; and pol II transcribes protein-encoding genes and a few small RNA genes.

Figure 24.13

Transcription and processing of the major ribosomal RNAs in eukaryotes. The genes exist in tandem copies, separated by nontranscribed spacers. The 45S transcripts first produced are processed by removing the portions shown in tan, thus yielding the 18S, 5.8S, and 28S products. These are then assembled into ribosomal subunits by associating with ribosomal proteins.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 742 30/10/14 2:01 PM

Page 14: The deadly nightshade (Amanita phalloides), one of the most

24.4 Transcription in Eukaryotic Cells | 743

RNA polymerase II interacts with several general transcription factors, including TBP (TATA box-binding protein) and TFIIA, -B, -E, -F, and -H. The formation of initiation and elongation complexes has been studied in detail with yeast RNA polymerase II, by use of crystallographic and other biophysical methods. A remarkable feature of the preinitiation complex, as shown recently by Roger Kornberg’s laboratory, is the fact that DNA is bound in this complex but is not in direct contact with RNA polymerase.

TFIIB plays an important role in converting the initial closed-promoter complex to an open-promoter complex. The process, sum-marized below, is illustrated in a movie produced in Patrick Cramer’s laboratory. (See reference to Cheung and Cramer at the end of this chapter.) Figure 24.16 shows scenes from the movie. Panel (a) shows a front view of the core RNA polymerase, while the other panels present side views at different stages of the process.

In forming the closed-promoter complex, TBP first binds to DNA and bends it by 90 degrees (Figure 24.16(b)). The C-terminal domain of TFIIB binds to TBP and flanking DNA regions. The N-terminal domain recruits RNA polymerase to promoter DNA near the tran-scription start site, forming the closed-promoter complex, also shown in panel (b). Next, a TFIIB element called the B-linker opens DNA before the transcription start site, leading to an open-promoter com-plex (Figure 24.16(c)). ATP is required for this process. TFIIB threads the template DNA strand into the active center. For this TFIIB uses another structural element, the B-reader, which consists of a helix fol-lowed by a mobile loop. Next, DNA is scanned for an initiator (Inr) motif near the start site; DNA “scrunching,” movements of the non-template strand, facilitate this process. Following this, the first two ribonucleotide substrates are positioned opposite Inr, and the first phosphodiester bond is formed (Figure 24.16(d)). Just as seen with bacterial RNA polymerase, most of the early chain initiation events are abortive. Finally, growth of RNA chains beyond seven nucleotides triggers release of TFIIB, and this completes the process of promoter escape. The process as described is actually similar to transcription initiation as studied with bacterial RNA polymerases, even though the proteins other than RNA polymerase are quite different.

Essential to the formation of an elongation complex is phosphory-lation of the carboxyl terminus of the largest subunit (Rpb I) of RNA pol II. This protein contains several dozen repeats of the heptapeptide sequence –YSPTSPS–, as many as 52 repeats in the mammalian en-zyme. Many or most of the serine (S) residues in this sequence must

A counterpart to the bacterial -10 region, called the TATA box, posi-tioned between -20 and -30, has the sequence TATAAAA. Upstream from that box are arrayed additional control elements, includ-ing the CAAT box (GGCCAATCT), the GC box (GGGCGG), and Octamer (ATTTGCAT). Additional regulatory sites may exist several kbp upstream from the initiation site; these are called enhancer regions. Although these far upstream acti-vator sites are involved in transcriptional regulation, they are not considered part of the promoter itself. Figure 24.15 shows the locations of these elements in several well-studied eukaryotic promoters.

ZnZn

97 6 5 4 3

2

1

8

97 6 5 4 3

2

1

8

C block

80 90 NH250 60 70

A block

HOOC25

His 23

His 19

a-helix

b-sheet

Cys 6

NH2

1

Cys 3

(b) Structure of a synthetic polypeptide that contains the zinc finger motif. The a-helix lies within the DNA major groove, as shown in panel (a). The two histidine and two cysteine residues that coordinate the zinc are shown in detail.

COOHa-helix withinmajor groove

(a) The transcription factor TFIIIA binds to the 5S RNA gene via zinc fingers inserted into the major groove. The two major recognition regions, A block and C block, are contacted by fingers 7–9 and 1–3, respectively.

Figure 24.14Zinc fingers. Data from M. S. Lee et al (1989) Science 245:635–637.

–160 –140 –120 –100 –80 –60 –40 –20 0 +20

Transcriptionalstart site

b globin

SV40, early

Thymidinekinase

Histone H2B

CAAT

GC

Octamer

TATA

Nucleotide number

Figure 24.15Structures of some eukaryotic promoters. The colored boxes represent different regulatory elements: orange, TATA box; blue, GC box; yellow, CAAT box; purple, Octamer. Based in part upon Genes IV, B. Lewin, Oxford: Oxford University Press, 1990.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 743 30/10/14 2:02 PM

Page 15: The deadly nightshade (Amanita phalloides), one of the most

744 | Chapter 24 Transcription and Post-transcriptional Processing

Pol IIcore

Cleft

DownstreamDNA

TBPTFIIF

TFIIB

TFIIBreader/linker

Front view Side view Active site

Active site

DNA ‘scrunching’

InitialRNA

UpstreamDNA

Exiting RNA

BacktrackedRNA

OpenDNAbubble

(a) Core RNA polymerase

(d) The first two ribonucleotide substrates are positioned opposite Inr, and the first phosphodiester bond is formed.

(g) Reinitiation after cleavage of the RNA 39 end.

(e) Elongation complex with a cleft through which the RNA 59 end exits.

(f) Polymerase backtracking

(b) TBP binds to DNA and bends it by 90 degrees. (c) B-linker opens DNA before the transcription start site, leading to an open-promoter complex.

undergo phosphorylation in order for RNA polymerase to escape the promoter. The phosphorylation is catalyzed by a subunit of TFIIH. Figure 24.16(e) shows an elongation complex with a cleft through which the RNA 5′ end exits. Figures 24.16(f) and 24.16(g) illustrate the polymerase backtracking process shown in Figure 24.10 for bacte-ria and reinitiation after cleavage of the RNA 3′ end.

Figure 24.16An outline of RNA polymerase II transcription initiation as deduced from crystal structures of participants. RNA polymerase is shown in silver-gray, with other participants as shown. Data from A. C. M. Cheung and P. Cramer (2012) Cell 149:1431–1437.

As mentioned earlier, trans-acting factors binding at enhancer sequences far removed from the promoter itself—by as much as sev-eral kilobase pairs—can influence transcription. Their mode of action appears to involve DNA looping, perhaps mediated by nucleosomes, which can bring enhancer-bound proteins into close physical contact with proteins bound to the promoter. Some transcription factors can bind in either promoter or enhancer regions. These can act as inter-mediates between activators or repressors bound to enhancer regions and the core transcription complex as schematized in Figure 24.17. Also involved in communication between upstream control elements and proteins bound at the promoter is a multiprotein complex called mediator (Chapter 26).

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 744 30/10/14 2:02 PM

Page 16: The deadly nightshade (Amanita phalloides), one of the most

24.4 Transcription in Eukaryotic Cells | 745

Figure 24.17A schematic representation of DNA looping as a process to bring enhancer-bound (ENH) activator (Act) proteins into contact with trans-acting factors (TAFs) associated with the core transcription complex.

sites—regions in which the DNA in isolated chromatin is readily cleaved, as if it were not complexed with histones at those sites, granting ready access to RNA polymerase.

How are accessible sites established in previously unrespon-sive genes? In some cases, such as the globin genes, it seems that the chromatin structure is rearranged at the time of replication. In other instances, protein factors seem able to interfere with chromatin structure at specific loci, opening hypersensitive sites. In either case, the clearance of histones from nuclease-sus-ceptible sites involves the action of chromatin remodeling fac-tors. These are proteins that enable promoter regions to be able to accept the complex and bulky machinery depicted in Figure 24.16. These complexes, which we discuss in Chapter 26, require ATP hydrolysis to somehow “open” nucleosomes transiently, to allow transcription complexes to form.

Another and perhaps equally important role is played by histone acetyltransferases and deacetylases. Histones of the nucleo-some core are subject to acetylation at specific lysine residues in the N-terminal tails (see Figure 24.18); chromatin is subject to other modifications as well, as mentioned in Chapter 21. High levels of acetylation are correlated with high transcriptional activity, and low acetylation with low activity. Chemically, this makes sense: neutral-ization of histone basic residues by acetylation would loosen ionic interactions between histones and DNA in chromatin. A number of proteins recruited to the initiation complex by activators and trans-acting factors have histone acetylase activity. The fact that specific transcription factors are involved in this process may provide the long-sought explanation for how the chromatin of specific genes can be targeted for disruption.

Transcriptional elongation

Formation of the open-promoter complex (see Figure 24.16) is fol-lowed, in the presence of rNTPs and ATP, by melting of a short re-gion of DNA and initiation of transcription. As noted previously, the

Chromatin Structure and TranscriptionThe complex interplay of transcription factors and polymerases we have described occurs not on naked DNA but on chromatin. The chromatin structure presents two major problems: First, how can the transcription factors and initiation complex bind to DNA in the

presence of nucleosomes? Second, how can the actively transcribing polymerase pass through arrays of nucle-

osomes? This is an area of intense research interest, which we treat in more detail in Chapter 26. Here we can point out that transcription often initiates in chromatin regions containing nuclease-accessible

TAFs

Mediator

IIE

TBPIIA IIB

Pol II

IIF IIH

Act

Act

ENH

ENH

Nucleosome

5′

3′

ConCept Nuclease-accessible (open) sites disrupt chromatin to allow initiation.

NH2 COOH

Basic domain

Core histone:

S G R G Q G G K+ +

A R+

A K+

A K+

S R+

S S R+

A G L Q [25-129] H2A

S A P A K+

PE PP A K+

G S AK+

V T T Q K24

+K+

G D R+

K+

K+

K+

K+

[32-125] H2B

Histone fold domain

K20

+K15

+K12

+

A P R+

QT GS GR+

ATQK+

TRA+

AL T A A R+

S A TP G G K+

HPK+

V [40-135] H3K+

K23

+

K16

+K12

+K8

+K5

+

K18

+K14

+K9

+

GG LGG RS+

G GG A R+

H R+

V L DR+

N I I K+

TGQ [32-102] H4K+

L

K5

+

Acetylation sites on N-terminal tails.

in the N-terminal domains, at the highly conserved sites indicated in red. Data from J. C. Hansen, C. Tse, and A. P. Wolffe, Biochemistry (1997) 37:17637–17641.

Figure 24.18Acetylation of core histones. The general structure of each of the four core histones involves a helical “histone-fold” domain plus an unstructured, highly basic N-terminal domain. Acetylation in nuclei occurs exclusively

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 745 30/10/14 2:02 PM

Page 17: The deadly nightshade (Amanita phalloides), one of the most

746 | Chapter 24 Transcription and Post-transcriptional Processing

incorporation, catalyzed by a special nontemplate-directed enzyme, poly(A) polymerase. The functions of the poly(A) tails of eukary-otic mRNAs include mRNA stabilization and facilitation of transport from nucleus to cytoplasm. We know that they cannot be essential for all messages, because some mRNAs (e.g., most histone mRNAs in higher eukaryotes) do not have them. However, poly(A) tails relate to message stability, for tail-less messages typically have much shorter lifetimes in the nucleus. Recent evidence indicates that the specific TTATTT signals used for termination vary in different tissues, so that the 3′ end of an mRNA is partly tissue-specific. The functional signifi-cance of this variation is not yet known.

24.5 Post-transcriptional ProcessingBacterial mrNA TurnoverCritical to mRNA metabolism in eukaryotes are events occurring after transcription—events that are necessary for messages to move from the nucleus to their sites of utilization in the cytosol. We discuss these events later in this chapter. In bacteria, by contrast, mRNAs are available for use in protein synthesis immediately. In fact, a nascent mRNA can serve as a template for translation at its 5′ end while still in the process of being synthesized toward the 3′ end. That is, transcrip-tion is coupled directly to translation.

The major post-transcriptional event in metabolism of bacterial mRNA is its own degradation, which in most cases is rapid. A few bac-terial mRNAs, notably those encoding outer membrane proteins, are long-lived; however, many bacterial messages have half-lives of only 2 to 3 minutes. This short life span means that genes being expressed must be transcribed continuously and that many mRNA molecules are translated only a few times. Although this might seem wasteful, it is consistent with bacterial lifestyles, which necessitate rapid adap-tation to environmental changes. Earlier we noted the selective ad-vantage to bacteria of expressing the genes for lactose utilization only when an inducer is present. By the same token, it would be wasteful for the cell to continue producing these proteins after lactose or a re-lated sugar was exhausted from the milieu. Rapid degradation of lac mRNA ensures that the seemingly wasteful synthesis of these proteins will cease soon after the need for these proteins is gone.

Turnover of bacterial RNAs occurs largely in the degradosome, a multiprotein complex containing polynucleotide phosphorylase (see Section 24.2), a nuclease called RNase E, and an RNA helicase, which helps to unwind regions of RNA secondary structure. Degradation starts from the 5′ end, which is important because translation also starts from the 5′ end. If degradation were to start from the 3′ end, a ribosome starting from a 5′ end might never reach an intact 3′ end.

Post-transcriptional Processing in the Synthesis of Bacterial rrNAs and trNAsBoth ribosomal RNAs and transfer RNAs are synthesized as larger transcripts (pre-rRNA and pre-tRNA, respectively), which undergo cleavage at both ends of the transcript, en route to becoming mature RNAs. This process is comparable to the processing of pre-rRNA in eukaryotic cells. As we will see in Chapter 25, however, the rRNA components in bacteria are somewhat smaller than in eukaryotes—23S, 16S, and 5S. The total amount of DNA encoding these rRNAs

C-terminal tail of the Rpb1 subunit of pol II becomes highly phos-phorylated, leading to promoter release, and elongation begins, with a helicase activity clearing the way. A number of the core transcrip-tion factors are released, and pol II, together with TFIIF, moves along the DNA. A residual complex, containing TBP, TFIIA, TAFs, and probably activator proteins, remains at the start site, ready to initiate another round.

At this point the polymerase also acquires several elongation fac-tors. Some of these factors assist the enzyme in traversing pause sites in the DNA. As previously noted for bacterial RNA polymerase, transcrip-

tion is relatively slow and in-terrupted by frequent pauses, especially in T-rich regions. Elongation factors assist the

enzyme in passing such sites. Nucleosomes form even larger obstacles to the progress of an RNA polymerase II along the DNA, as mentioned previously.

Just how pol II transcribes through nucleosomes is still some-thing of a mystery. Do the nucleosomes unfold and re-form as the polymerase passes? Are they temporarily displaced? Current evidence favors temporary displacement, but the issue is far from settled.

Termination of TranscriptionThe termination of mRNA transcription is also different in eukary-otes. Whereas the bacterial RNA polymerase recognizes terminator signals, which sometimes function with the aid of the r protein, the eukaryotic polymerase II usually continues to transcribe well past the end of the gene. In doing so, it passes through one or more TTATTT signals, which lie beyond the 3′ end of the coding region (Figure 24.19). The pre-mRNA, carrying this signal as AAUAAA, is then cleaved by a special endonuclease that recognizes the signal and cuts at a site 11 to 30 residues downstream of it. At this point, a tail of polyriboadenylic acid, poly(A), as many as 300 bases long, is added by sequential ATP

DNA A A T A A A

mRNA

Endonucleaserecognizes AAUAAAand cuts themRNA 11 to 30nucleotides away

Poly (A) tail added bypoly (A) polymerase

Poly (A) tail(up to 300 residues)

Transcription

5' 3'

Cut

A A U A A A

5' 3'ATP

A A U A A A

5' 3'A A U A A A A A A A A A A A A

T T A T T T

Figure 24.19Termination of transcription in eukaryotes: Addition of poly(A) tails. There is a TTATTT sequence near the 3′ end of most eukaryotic genes. When the complementary strand is transcribed to AAUAAA, it provides a signal for endonuclease cleavage and poly(A) tail addition.

ConCept Pol II, with the aid of other proteins, can transcribe through nucleosome arrays.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 746 30/10/14 2:02 PM

Page 18: The deadly nightshade (Amanita phalloides), one of the most

24.5 Post-transcriptional Processing | 747

trNA Processing

Aside from the tRNAs embedded in pre-rRNA transcripts, the other tRNAs are synthesized in transcripts that contain one to seven tRNAs each, all surrounded by lengthy flanking se-quences. The maturation steps are summarized in Figure 24.21, using as an example the well-studied case of the E. coli tyrosine tRNA species (tRNATyr). Maturation (step 1) starts with RNase E, which cleaves next to a stem-loop structure on the 3′ side of the tRNA sequence. This is followed by the action of ribonu-clease D (step 2), which carries out exonucleolytic cleavage to a point two nucleotides removed from the CCA sequence at the 3′ end. Next, the 5′ end is formed by the action of ribonu-

clease P (step 3), which cleaves to leave a phosphate on the 5′ terminal G. This enzyme creates the 5′ terminus of all tRNA molecules. It is unclear what structural features are recognized by RNase P, because different sequences are con-tained in the cleavage sites. As pointed out in Chapter 8, ribo-nuclease P was one of the first identified ribozymes. The en-zyme consists of one RNA molecule of 377 nucleotides and one protein molecule with Mr of about 20,000. Both components are necessary for full catalytic activity, but the RNA molecule alone can catalyze accurate cleavage.

and tRNAs accounts for less than 1% of the E. coli genome, but be-cause of the instability of mRNA, rRNA and tRNA constitute about 98% of the total RNA in a bacterial cell. Transcription of rRNA genes is extremely efficient when cells are growing rapidly. The intracellular concentrations of ribonucleoside triphosphates are important control elements here; ATP, whose level is high in rapidly growing cells, ac-tivates rRNA gene transcription by stabilizing the rel-evant open- promoter complexes.

rrNA Processing

The E. coli genome contains seven different oper-ons for rRNA species. Each one encodes, in a sin-gle 30S transcript, sequences for one copy each of 16S, 23S, and 5S rRNAs (Figure 24.20). Because the three species are used in equal amounts, the logic of this organization is apparent. Less obvious is that each transcript also includes sequences for one to four tRNA molecules, which vary among the seven different operons. This interspersion of rRNA and tRNA sequences may represent a means of coordi-nating the rates of synthesis of these RNAs.

Processing of the 30S pre-rRNA probably be-gins with cleavage within two stem-loop structures by RNase III, a nuclease specific for double-stranded RNA. One double-strand cut in each of two giant stem-loop regions releases precursors to 16S and 23S rRNAs. RNase E is similarly involved in the process-ing of 5S rRNA. Further maturation steps require the presence of particular ribosomal proteins, which be-gin to assemble on the precursor RNAs while tran-scription is still in progress. The embedded tRNA sequences are processed to give mature tRNAs, along the same routes used for other tRNA species, as discussed next.

RIII RIII

RIIIRIII

16SrRNA 23S

rRNA

tRNA tRNA

P1 P2

5SrRNA

1 or 2 0–2

Figure 24.20Structure of E. coli 30S pre-rRNA. Sequences complementary to two promoter sites (P1 and P2), RNase III cleavage sites (RIII) that release 16S and 23S species, and the locations of tRNA sequences embedded within the transcript are shown.

U

G

UU

U

U

A

GGG

ACC

U C

1 Ribonuclease Ecleavage

5 Pseudouridine

5 Pseudouridine

5 O2-Methylguanosine

5 4-Thiouridine

3 RNase P cleavageforms 59 end.

4 RNase Dremoves thesetwo nucleotides

2 RNase D removesthese 7 nucleotidesone by one

Approximately170 bases

—OH 39

5 2-Isopentenyladenosine

59

Figure 24.21Modification steps (1–4) that occur in the maturation of E. coli tRNATyr from its transcript and modified bases (5) seen in the mature tRNA. The tRNA sequence is shown in blue.

ConCept Bacterial transcripts undergo post-transcriptional process-ing, involving both endonucleolytic and exonucleolytic cleavage.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 747 30/10/14 2:02 PM

Page 19: The deadly nightshade (Amanita phalloides), one of the most

748 | Chapter 24 Transcription and Post-transcriptional Processing

Splicing

After being capped, the pre-mRNA becomes complexed with a num-ber of small nuclear ribonucleoprotein particles (snRNPs, often called “snurps”), which are themselves complexes of small nuclear RNAs ( snRNAs) and special splicing proteins. The snRNAs are all less than 300 nucleotides long. The snRNP–pre-mRNA complex is called a spliceosome, and it is here that the most elegant part of the processing takes place—the cutting and splicing that is necessary to excise introns from the pre-mRNA and join the ends of the two exons. In forming a spliceosome, snRNAs recognize and bind intron–exon splice sites by means of complementary sequences (Figure 24.23). Precise recognition

Once the proper 5′ terminus has been formed, ribonuclease D removes the remaining two nucleotides from the 3′ end (step 4). Should excessive “nibbling” occur through faulty control of RNase D activity, there is an enzyme (CCA nucleotidyltransferase) that will restore the CCA end to any tRNA in a nontemplated fashion. This enzyme specifically recognizes the 3′ terminus of tRNAs that lack the CCA end and catalyzes sequential reactions with a CTP, another CTP, and an ATP. Note that the CCA end is encoded by ev-ery tRNA gene, so the nucleotidyltransferase is essentially a repair enzyme.

Creation of the modified bases (see Chapters 4 and 27) occurs at the final stage, including methylations, thiolations, and reduction of uracil to dihydrouracil. In the specific example shown, the modifica-tions include formation of two pseudouridines, one 2-isopentenyl-adenosine, one O2-methylguanosine, and one 4-thiouridine (step 5).

These modifications serve to stabilize the tRNA molecules against intracellular degrada-tion, and in some cases they promote translational fidelity. The modifications are not es-

sential for tRNA function, however, because many tRNAs lacking the modifications are fully active in vitro. Pathways for eukaryotic tRNA synthesis are similar, including the involvement of ribonuclease P. In the yeast Saccharomyces cerevisiae the average tRNA molecule has more than 12 of its 75–80 bases modified.

Processing of eukaryotic mrNABacterial and eukaryotic cells differ significantly in the ways that mRNAs for protein-coding genes are produced and processed. Recall that bacterial mRNAs are synthesized at the nucleoid in direct contact with the cytosol and are immediately available for translation. A specific nucleotide sequence at the 5′ end recognizes a site on the bacterial rRNA, allowing attachment of the ribosome and initiation of translation, often even before transcription of the message is completed. Hence, there is little or no post-transcriptional processing of bacterial mRNAs.

In eukaryotes, mRNA is produced in the nucleus and must be ex-ported to the cytosol for translation. Furthermore, the initial product of transcription (pre-mRNA) includes all of the introns and substan-tial flanking regions; the introns must be removed before correct translation can occur. For these reasons, eukaryotic mRNA requires extensive processing before it can be used as a template. This process-ing, including polyadenylylation at the 3′ end (Section 24.4), takes place while mRNA is still in the nucleus.

Capping

The first modification occurs at the 5′ end of the pre-mRNA. First, one phosphate is removed hydrolytically from the triphosphate moiety at the 5′ terminal nucleotide. Next, the resulting 5′ diphosphate end at-tacks the α (inner) phosphate of GTP; in essence, the guanine nucleo-tide is added in reverse orientation (5′ S 5′). Together with the first two nucleotides of the chain, it forms a cap (Figure 24.22). The cap is further modified by the addition of methyl groups to the N-7 position of the guanine and to one or two sugar hydroxyl groups of the cap nucleotides. This cap structure positions the mRNA on the ribosome for translation, and also it probably contributes to stabilization of the message.

ConCept Post-transcriptional pro-cessing in bacteria involves cleavage of the primary transcript, modification of bases (in tRNA synthesis) and non-transcriptive nucleotide addition.

Splicesite

snRNA

Intron

Pre-mRNA

Exon

Figure 24.23Structure of a small nuclear RNA (snRNA). Human U1 RNA is shown, together with the intron–exon boundary region to which it binds in forming the spliceosome.

O

O

Base

Base

5′ end of mRNA

7-Methylguanosine 5′-to-5′triphosphate

bridge

2′3′

OH OHCH2 CH2

CH2

CH3

O

OTo 5′ end oftranscript

2′3′(CH3 or H)

H2N

CH3

2′ 3′ 5′ 5′

N

N

N

N1

O2

O

P

P P P

Figure 24.22Structure of a processed mRNA 5′ end. Details of the 5′ cap region are shown. Methyl groups that are added are in red.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 748 30/10/14 2:02 PM

Page 20: The deadly nightshade (Amanita phalloides), one of the most

24.5 Post-transcriptional Processing | 749

O

O2

2O

OP

O

O

O2

O2O

PO

P

O

2O OP

O

OH

O

OP

O

OH

Lariat

3'

O O

O2OP

OHO

O

O

AO G

OH

O

GO

GOAO

GO

U1 recognition site

E1 splice site Branch site

Base pairing ofcomplementary sequences

E2 splice site

59 39

39

59

E1 E2GGGUGAGUA

UCCUA

GUG GAC

CAG39

UACU

AUGA

UACUUAUCC CAGC

Loop and spliceformation

Spliced message

1

Looped intron

E2

E1

3959

UACUUAUCC

AUGAGUG

E2E1GGC

G

O O

O2O

OH2O

O

OP

P

G C

O

O2OP

O O

OP

O2

O

OHO

OH

OH29

29

2959

E1 splice sitepairs with asequence at thebranch site.

(a) The overall process. Exons (E1 and E2) are indicated by purple lines, and the intron by a black line or sequence. The E1 splice site, presumably with the aid of the small RNA U1, pairs with a sequence at the branch site to form a loop. The 29 hydroxyl on the branch site AMP carries out a transesterification reaction by attacking the phosphate of a GMP residue (blue) at the 39 end of the exon 1 (the E1 splice site). This frees the adjacent G (red) to attack with its 39 hydroxyl the phosphate 59 to the C at the 59 end of exon 2. The products are a spliced message and a looped intron “lariat” structure, which is then degraded.

(b) The first transesterification reaction. The second reaction (not shown) involves nucleophilic attack of the GMP 39-hydroxyl group (in red) upon the phosphate 59 to a CMP residue, as seen schematically in part a.

Figure 24.24A schematic view of the mechanism of mRNA splicing.

of splice sequences is essential, because even a single-base error would disrupt the sense of the genetic message. A schematic view of the chemistry of splicing is shown in Figure 24.24.

Excision of a single intron involves assembling and disassembling a spliceosome. Figure 24.25 depicts the overall process. The sequence begins with binding of the U1 snRNP to the G site at the 5′ end of the intron. The U2 snRNP then binds at the branch site. With continued assembly of the spliceosome, including the addition of several more snRNPs, the lariat loop in the intron is formed and the two exons are joined. Catalytically essential Mg2+ ions are located in the RNA of U6 snRNP, implying that RNA plays a catalytic role. Splicing has now been accomplished, and the products—a ligated mRNA and a looped intron—are released. As the spliceosome disintegrates, the looped in-tron is degraded, and the mRNA is exported from the nucleus.

Early steps in spliceosome assembly require more proteins and sites than are indicated in Figure 24.25. Kinetic experiments

indicate that all of the early reactions in spliceosome assembly are reversible, with the process being driven by the irreversibility of the late steps.

Because of the importance of accurate splicing for correct expres-sion of genetic information, splicing errors are responsible for many genetic diseases. As many as 15% of all genetic diseases arise from splicing errors. In some forms of thalassemia, a family of diseases arising from defective synthesis of hemoglobin chains (Chapter 7), mutations have been found in both the 5′ and 3′ splice sites of both genes for the b chain of hu-man hemoglobin. Usually, an in-correct mRNA chain is formed, which leads to premature termination of translation of the message. Another example of defective splicing is Hutchinson–Gilford progeria syndrome, a disease of accelerated aging in which afflicted individuals

ConneCtion Defects in pre-mRNA splicing are responsible for about 15% of known genetic diseases.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 749 30/10/14 2:02 PM

Page 21: The deadly nightshade (Amanita phalloides), one of the most

750 | Chapter 24 Transcription and Post-transcriptional Processing

die in their teens from aging-related symp-toms such as cardiovascular disease. The dis-ease affects a nuclear envelope protein called Lamin A. The mutation activates a cryptic do-nor splicing site in the gene for a lamin A pre-cursor, leading to accumulation of a truncated form of the protein.

Alternative SplicingOnce investigators discovered and described mRNA splicing, they were surprised to learn that the same pre-mRNA can undergo splic-ing in several different ways. The existence of alternative splicing means that different c o m b i -nat ions of exons from the same gene can be processed into different mature mRNAs and then undergo transla-tion into quite different proteins in different tissues or at different developmental stages of the same organism. Alternative splicing made it less surprising when the Human Ge-nome Project revealed the existence of far fewer genes than had been expected given the size of the genome and the complexity of Homo sapiens. Alternative splicing greatly enlarges the repertoire of proteins that can be encoded by a genome.

A dramatic example of alternative splicing is shown in Figure 24.26. The protein a-tropo-myosin is used in contractile systems in various cell types. Apparently, the need for functional domains coded for by different exons differs among the various uses of a-tropomyosin. Rather than having different genes expressed in different tissues, a single gene is employed, but the specific splicing patterns in different tissues provide a variety of a-tropomyosins. As the figure shows, there are two positions at which alternative choices can be made for which exon to splice in. The 3′ member of each of these pairs is the default exon; it will be chosen unless a specific cellular signal dictates otherwise.

Many mechanisms have been described for alternative splicing. A well-studied ex-ample involves calcitonin, a peptide hormone that in thyroid gland regulates calcium and phosphorus metabolism. In neuronal tissue, the gene for calcitonin is alternatively spliced to give a protein that acts as a vasodilator—two quite different gene products expressed from the same gene. Analysis of the human ge-nome sequence has revealed that most human genes, 90% or more, are subject to alternative splicing.

5′

Exon 1

G 3′

Exon 2

AGA

5′ G 3′AG

5′

5′

5′

G

3′AG

Intron

Pre-mRNA

3′

3′

1 U1 binds

A

A

U1

U1

U1

U2

2 U2 binding forms loop structure

ATP3 U4/6 and U5

bind

U5

U1

A

G

AG

U2

U4/6

AG

G

U6

U2

U5

U4

A

7 Looped intron removed

8 Intron degrades

ATP

4 Cleavage

U5

U1

AG

U2U4 U6

A

ATP5 Transfer

U1

AG

U2U4 U6

A

U5

5′

3′

5′ 3′Ligated product

mRNA Product released, spliceosome disassembles

6

Figure 24.25The overall process of splicing. The pre-mRNA plus assorted snRNPs assemble and disassemble a spliceosome, which carries out the splicing reaction. The snRNPs are designated U1, U2, and so on. U1 is bound in step 1, which together with U2 binding (step 2) leads to a looped structure. Factors U4/6 and U5 then bind (step 3), and cleavage and transfer then occur (steps 4, 5). The spliceosome disassembles, releasing the ligated product (step 6) and the looped intron (step 7). This is degraded into small oligonucleotides (step 8).

ConCept Alternative splicing allows one gene to specify different proteins.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 750 30/10/14 2:02 PM

Page 22: The deadly nightshade (Amanita phalloides), one of the most

24.5 Post-transcriptional Processing | 751

Nonmuscle/fibroblast

Striatedmuscle

a–TM EXON GENE ORGANIZATION

a–TM mRNA TRANSCRIPTS

59UT1–38 39–80 39–80

81–125

126–164

165–188

189–213

214–234

235–257

258–284

258–28439UT 39UT

Striatedmuscle

Myoblast

Smoothmuscle

Brain

Constitutive

Smooth muscle-specific

Experimentally documented pathway

Inferred pathway

Striated muscle-specific

VariableHepatoma

KEY

Figure 24.26A-Tropomyosin gene organization (rat) and seven alternative splicing pathways. Exons are indicated with their encoded amino acids (numbered). Experimentally documented splicing pathways (solid lines) and others (dotted lines) inferred from nuclease protection mapping are shown. The smooth (SM) and striated (STR) exons encoding amino acid residues 39–80 are mutually exclusive, and there are alternative 3′-terminal exons as well. UT signifies untranslated regions.

SummaryAll RNA is synthesized by the template-dependent copying of one DNA strand within a gene, catalyzed by RNA polymerase. RNA polymerases use 5′-ribonucleoside triphosphates as substrates, and they transcribe in a 5′ S 3′ direction. Bacteria synthesize all RNA classes with one poly-merase, whereas eukaryotic cells have different polymerases—I, II, and III—for synthesis of ribosomal, messenger, and transfer RNA species, respectively.

Strand selection and duplex unwinding and rewinding are carried out by RNA polymerase in conjunction with other proteins. The enzyme binds at a promoter site, by formation of specific DNA–protein contacts, largely involving the enzyme’s s subunit in bacteria and by a host of tran-scription factors and regulatory proteins in eukaryotes. Most initiations are abortive, but after a productive initiation involving different factors in bacteria and eukaryotes, elongation continues. In bacteria this is car-ried out by the core polymerase, a2bb′v. Transcription is highly proces-sive and is terminated by specific DNA sequences, sometimes in bacteria with the participation of r protein. Structural analyses have identified

common features in bacterial and eukaryotic RNA polymerases, which have helped to reveal common mechanistic features.

Post-transcriptional RNA processing includes cutting pre-rRNA transcripts, which encode large and small rRNA components, and trim-ming tRNA precursors in both bacterial and eukaryotic cells with the aid of the ribozyme ribonuclease P, followed by modification of several nucleotides in each tRNA molecule and nontranscriptive addition of the 3′ terminal CCA sequence. Bacterial mRNAs undergo little if any post- transcriptional processing, whereas eukaryotic messages are extensively processed, with polyadenylylation at the 3′ end, capping at the 5′ end with an inverted and modified guanine nucleotide residue, and splicing throughout the gene. Splicing is carried out by small nuclear ribonucleo-protein particles, guided by complementary base sequence interactions between splice sites and base sequences in the small nuclear RNA com-ponents. Alternative splicing is a process that expands the information content of a genome by directing quite different mRNA splicing patterns in different tissues and at different developmental stages.

Nonmuscle/fibroblast

Striatedmuscle

a–TM EXON GENE ORGANIZATION

a–TM mRNA TRANSCRIPTS

59UT1–38 39–80 39–80

81–125

126–164

165–188

189–213

214–234

235–257

258–284

258–28439UT 39UT

Striatedmuscle

Myoblast

Smoothmuscle

Brain

Constitutive

Smooth muscle-specific

Experimentally documented pathway

Inferred pathway

Striated muscle-specific

VariableHepatoma

KEY

ProblemSEnhanced by

for Biochemistry

MasteringChemistry® for Biochemistry provides select end-of-chapter problems and feedback-enriched tutorial problems, animations, and interactive figures to deepen your understanding of complex topics while practicing problem solving.

1. Outline an experimental approach to determining the average chain growth rate for transcription in vivo. Chain growth rate is the number of nucleotides polymerized per minute per RNA chain.

2. Outline an experimental approach to determining the average RNA chain growth rate during transcription of a cloned gene in vitro.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 751 30/10/14 2:02 PM

Page 23: The deadly nightshade (Amanita phalloides), one of the most

752 | Chapter 24 Transcription and Post-transcriptional Processing

(a) How many exons does the gene contain? How many introns? (b) Where in this structure would you expect to find a 5′,5′-internucleotide

bond? Where would you expect to find a polyadenylic acid sequence?

10. Introns in protein-coding genes of some eukaryotes are rarely shorter than 65 nucleotides long. What might be a rationale for this limitation?

11. Heparin is a polyanionic polysaccharide that blocks initiation by RNA polymerase by virtue of its binding to double-stranded DNA. But heparin inhibits only when added before the onset of transcription, and not if added after transcription begins. Explain this difference.

12. Estimate the time needed for E. coli RNA polymerase at 37 °C to transcribe the entire gene for a 50-kilodalton protein. What assumption or assumptions must be made for this estimate to be accurate?

13. Describe how RNA polymerase backtracking could function to increase the fidelity of transcription.

14. Is RNA polymerase saturated with substrates in vivo? Describe experiments that might indicate whether RNA polymerase is operating at Vmax with respect to its nucleotide substrates.

15. As discussed in the text, promoters were originally identified as consensus se-quences upstream from transcriptional start sites. What additional evidence might support the assignment of these sequences as parts of promoters?

16. In this chapter and elsewhere, we have described two types of ultracen-trifugation experiments—sucrose-gradient centrifugation and equilibrium density-gradient centrifugation. Briefly discuss these procedures with respect to the physical bases on which molecular species are separated in each method, the kinds of isotopic compounds used in the analysis, and the biological processes that have been or can be analyzed with each.

17. About 98% of the E. coli genome codes for proteins, yet mRNA, the template for protein synthesis, comprises only about 2% of the total RNA in the cell. Explain this apparent discrepancy.

3. Measurements of RNA chain growth rates are often led astray by the phenomenon of pausing, in which an RNA polymerase molecule stops transcription when it reaches certain sites, for intervals that may be as long as several seconds. How might pausing be detected?

4. Suppose you want to study the transcription in vitro of one particular gene in a DNA molecule that contains several genes and promoters. Without adding specific regulatory proteins, how might you stimulate transcription from the gene of interest relative to the transcription of the other genes on your DNA template? To make all of the complexes identical, you would like to arrest all transcriptional events at the same position on the DNA template before isolating the complex. How might you do this?

5. The tac promoter, an artificial promoter made from portions of the trp and lacUV5 promoters, has been introduced into a plasmid. It is a hybrid of the lac and trp (tryptophan) promoters, containing the -35 region of one and the -10 region of the other. This promoter directs transcription initiation more efficiently than either the trp or lac promoters. Why?

6. Explain the basis for the following statement: Transcription of two genes on a plasmid can occur without the concomitant action of a topoisomerase, but only if those two genes are oriented in opposite directions.

7. Some years ago, it was suggested that the function of the poly(A) tail on a eukaryotic message may be to “ticket” the message. That is, each time the message is used, one or more residues is removed, and the message is degraded after the tail is shortened below a critical length. Suggest an experi-ment to test this hypothesis.

8. For the original detection of DNA–RNA hybrid molecules, as described on page 733, the DNA–RNA hybrid was detected in a CsCl equilibrium gradi-ent. Why are RNA and DNA–RNA hybrids denser than double-stranded DNA?

9. Shown below is an R loop prepared for electron microscopy by annealing a purified eukaryotic messenger RNA with DNA from a genomic clone containing the full-length gene corresponding to the mRNA.

Cheung, A. C. M., and P. Cramer (2012) A movie of RNA polymerase II transcription. Cell 149:1431–1437. A description of the process with reference to a spectacular movie.

Egloff, S., M. Dienstbier, and S. Murphy (2012) Up-dating the RNA polymerase CTD code: Adding gene-specific layers. Trends Genet. 28:333–341. The phosphorylation state of RNA polymer-ase II C-terminal domain controls numerous functions.

Haag, J. R., and C. S. Pikaard (2011) Multisubunit RNA polymerases IV and V: Purveyors of non-coding RNA for plant gene silencing. Nat. Rev. Mol. Cell Biol. 12:483–492. Review of the plant RNA polymerases not described in this chapter.

Hoskins, A. A., and M. J. Moore (2012) The spliceo-some: A flexible, reversible macromolecular machine. Trends Biochem. Sci. 37:179–188. Short review of spliceosome structure and assembly.

Kornblitt, A. R., I. E. Schor, M. Alló, G. Dujardin, E. Petrillo, and M. J. Muñoz (2013) Alterna-tive splicing: A pivotal step between eukaryotic transcription and translation. Nat. Rev. Mol. Cell Biol. 13:153–165. Timely review of alternative splicing.

Ma, J., and M. D. Wang (2014) RNA polymerase is a powerful torsional motor. Cell Cycle 13:337–338. A new experimental approach to understanding mechanical properties of RNA polymerase.

Murakami, K., and 11 coauthors (2013) Architecture of an RNA polymerase II transcription pre-initiation complex. Science 342:709. Summary of a multipronged structural analysis, with refer-ence to the full paper.

Nudler, E. (2012) RNA polymerase backtracking in gene regulation and genome instability. Cell 149:1438–1445. Backtracking serves multiple functions.

Padgett, R. A. (2012) New connections between splicing and human disease. Trends Genet. 28:147–154. Many genetic diseases in humans are caused by defective splicing.

Petesch, S. J., and J. T. Lis (2012) Overcoming the nucleosome barrier during transcription elonga-tion. Trends Genet. 28:285–294. A review of one of the most difficult problems in understanding eukaryotic transcription.

Vannini, A., and P. Cramer (2012) Conservation between the RNA polymerase I, II, and III transcription initiation machines. Mol. Cell 45:439–446. Fundamental similarities in the ac-tions of the three eukaryotic RNA polymerases.

Werner, F., and D. Grohmann (2011) Evolution of multisubunit RNA polymerases in the three domains of life. Nat. Rev. Microbiol. 9:85–98. Comparison of bacterial, archaeal, and eukary-otic RNA polymerases.

referenceS

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 752 30/10/14 2:02 PM

Page 24: The deadly nightshade (Amanita phalloides), one of the most

24a DNA MicroarraysTOOLS OFBIOCHEMISTRY

The finding in the 1960s that single-stranded DNA binds irreversibly to membrane filters, and the development of recombinant DNA technology in the 1970s, led to a number of techniques for analyzing gene expression (i.e., measuring the levels of transcripts of particular genes in living cells). RNA could be radiola-beled in vivo and hybridized to gene-specific DNA—a cloned gene or a restriction frag-ment—and the bound radioactivity analyzed by autoradiography or in a liquid scintillation counter. Several techniques, such as Northern analysis, were based on these developments. Northern analysis is comparable to Southern analysis, except that in Northern analysis RNA is hybridized to DNA restriction fragments separated by gel electrophoresis and then immobilized on a filter. However, such ap-proaches allow the analysis of just one or a few genes in each experiment. With the availability of complete genome sequences, it became desirable to analyze levels of transcripts from many genes in a single experiment, that is, patterns of gene expression, which could be compared under different physiological condi-tions. Microarray technology makes this kind of analysis possible.

In a microarray experiment, minute amounts of gene-specific DNAs—usually several thousand—are immobilized on a sub-strate, such as glass or a membrane filter. The gene-specific DNAs are either cloned cDNAs or synthetic oligonucleotides. Using robotic tech-nology, the investigator “prints” single-stranded DNAs onto the substrate, which may be a microscope slide, suitably coated to bind the applied DNAs. The DNAs are printed as a large array, which allows the investigator to identify each gene from its position on the array. The DNAs are fixed irreversibly on the substrate, so that the “DNA chip” can be used repeatedly by stripping the annealed RNA targets off the chip after each experiment.

Typically, a microarray experiment involves the comparison of gene-expression profiles under different conditions— comparing a tumor with the tissue of origin, for example, or comparing a hormone-stimulated tissue with unstimulated tissue (see Figure 24A.1). The investigator wishes to learn which genes are activated under the conditions being analyzed and which are repressed. Total mRNA is

Imageanalysis

Imageacquisition

Scanning

(Fluorescencedetection)

Links to otherresources:

• cGAP• Locus link• Pathways

• UniGene database

Patternrecognition

Cross platformdata comparison

Oligonucleotidechip

• Oligo deposition• In situ synthesis DNA

microarray

Image Acquisition and Analysis

Robotic printing onglass slides ormembranes Test Reference

DNAmicroarray

Hybridization

Mix

Sequence validated cDNA clone setsavailable commercially

• Plate replication• PCR amplification of inserts

• Purification

Rob

otic

s fo

r hig

h th

roug

hput

par

alle

l pro

cess

ing

Array Fabrication

Organism (mouse, rat,worm, fly, plant, yeast,

bacteria, etc.)

Normalized library construction,EST sequencing, UniGene clustering,

sequence verification, rearrayinginto plate format

Target Preparation

Reverse transcriptionlabeling with fluorescent dyes

RNAextraction

Sample source

(Tissue, cell lines,pathology samples, organisms)

Database

Figure 24a.1Flowchart: Performing a microarray experiment.

753

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 753 30/10/14 2:02 PM

Page 25: The deadly nightshade (Amanita phalloides), one of the most

754

isolated from each tissue or cell culture and converted to a population of cDNAs using reverse transcriptase. During the enzymatic synthesis of the cDNAs, one of the deoxyribonucleoside triphosphates is tagged with a fluorescent dye. Typically, the reference sample is labeled with a red fluorophore, and the test sample is labeled with a green fluorophore. After cDNA synthesis is complete, the two samples are mixed and subjected to annealing conditions in the presence of the microarray. Unhybridized cDNAs are washed off, and the array is then scanned. Scanning at wavelengths corresponding to emission maxima of the fluorophores reveals which transcripts are more abun-dant in the test than in the reference (more green fluorescence) and which are less abundant (more red fluorescence). Analysis of the im-age reveals which genes were stimulated and which repressed under the conditions being tested. An example of a microarray analysis was shown in Figure 1.11.

Microarray technology has numerous applications in addition to measuring patterns of gene expression. For example, by using arrayed

oligonucleotides representing different mutant forms of a gene of interest, we can carry out DNA–DNA hybridization on the gene chip and identify mutations or single-nucleotide polymorphisms in biologi-cal samples.

ReferencesIoannidis, J. P. A., and 15 coauthors (2009) Repeatability of published microarray

gene expression analyses. Nat. Genet. 41:149–155. Analyses of data in 18 articles emphasize the importance of adequate controls, statistical analysis, and experimental details in generating and publishing meaningful microarray data.

Mittal, V. (2001) DNA array technology. In Sambrook, J., and D. W. Russell, eds., Molecular Cloning: A Laboratory Manual, 3rd ed. Vol. 3, pp. A.10.1–A.10.19. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory. Straightforward description of the technology.

24b Chromatin ImmunoprecipitationTOOLS OFBIOCHEMISTRY

So far the techniques we have discussed for identifying and character-izing binding sites for proteins such as RNA polymerase and repressors have involved single binding sites, whereas regulatory proteins such as nuclear hormone receptors act at multiple binding sites. Moreover, techniques such as footprinting are carried out in vitro, although our main interest is characterizing attachment sites for DNA-binding pro-teins in intact cells. Chromatin immunoprecipitation (ChIP) makes it possible to identify in vivo binding sites on a genome-wide basis.

The principle of ChIP is that any DNA-binding protein can be covalently attached to its DNA binding site(s) in vivo by use of a crosslinking reagent that can penetrate cell membranes and react covalently with both protein and DNA in a reversible manner. For-maldehyde is most commonly used, as shown in Figure 24B.1. After whole cells are treated with formaldehyde, chromatin is isolated and subjected to sonic oscillation under conditions that reduce the length of each DNA molecule to fragments several hundred base pairs long. The mixture is treated with antibody to the protein of interest, and the immunoprecipitated DNA–protein complexes are collected. At this point the crosslinks are broken, and the DNA that was precipitated along with the protein is subjected to sequence analysis. Originally this was done most often by PCR amplification of the DNA followed

by conventional sequence analysis. This approach, however, makes it possible only to analyze known and suspected DNA sequences. An alternate approach involves cloning all DNA fragments in the mixture and then using PCR primers corresponding to flanking sequences on the vector, followed by sequence analysis of each clone. With the ad-vent of microarray technology (Tools of Biochemistry 24A) it became possible to screen the DNA fragments against a DNA microarray containing hundreds or thousands of DNA sequences. This technique is called ChIP-chip, because the immunoprecipitated DNA fragments are identified on a gene chip. A still more recent innovation, called ChIP-seq, sequences all of the immunoprecipitated DNA in parallel using a next-generation sequencing technology that permits simulta-neous sequence analysis of hundreds or thousands of DNA molecules.

ReferencesFurey, T. S. (2012) ChIP-seq and beyond: New and improved methodologies to

detect and characterize protein-DNA interactions. Nat. Rev Genet.13:840–847. A recent review with references to the earlier literature.

Rhee, H. S., and B. F. Pugh (2012) Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution. Cell 147:1408–1419. Recent improvements in ChIP techniques.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 754 30/10/14 2:02 PM

Page 26: The deadly nightshade (Amanita phalloides), one of the most

755

Formaldehydecrosslinking

Fragment DNA

TF

TF

Chip

ChIP-seq

TF ChipDNA

Antibody

Immunoprecipitationusing antibody to TF

Hybridization

Microarray probes

intergenicGenic

1st generation

2nd generation

Bound

Unbound

107 parallelsequencing

reactions

Microarray

ReferenceDNA

Mapped positions of sequencing reads

ChIP-chip

111000 111500 112000 112500 113000 113500111186420

Rea

ds

302520151050

YBL059W SHP1

PTH2

11111

Figure 24b.1Chromatin immunoprecipitation. A transcription factor (TF) is crosslinked to the DNA sites to which it binds in chromatin. After fragmentation and immunoprecipitation using antibody to TF, the TF-bound DNA fragments are identified either by microarray analysis, in which the DNA is bound to a red fluorophore and subjected to hybridization analysis with an array of genomic fragments (ChIP-chip), or subjected to massive parallel sequence analysis (ChIP-seq). Data from B. J. Venters and B. F. Pugh (2009) How eukaryotic genes are transcribed, Critical Reviews in Biochemistry and Molecular Biology 44:117–141.

M24_MATH9763_01_SE_C24_730-755v4.0.1.indd 755 30/10/14 2:02 PM