biol 102 chp 17 powerpoint

Chapter 17

From Gene to Protein Rob Swatski

Associate Professor of Biology

HACC – York Campus

Overview: The Flow of Genetic Information

DNA information = specific sequences of nucleotides

DNA protein synthesis

Proteins: link genotype & phenotype

Gene expression: DNA directs protein synthesis

- 2 stages: transcription & translation

How was the fundamental relationship between genes & proteins discovered?

- examine evidence from studies of

metabolic defects

1909: British physician Archibald Garrod 1st suggested that genes dictate phenotypes with enzymes

- symptoms of inherited disease reflects inability to synthesize a certain enzyme

- required understanding that cells synthesize & degrade molecules using metabolic pathways

Neurospora bread mold

Nutritional Mutants in Bread Mold

George Beadle & Edward Tatum exposed Neurospora to x-rays

- created mutants that could not survive on minimal medium

(cannot synthesize certain molecules)

Identified 3 classes of arginine-deficient mutants

- each lacked a different enzyme needed to make arginine

Developed the “one gene – one enzyme hypothesis”

- each gene directs the synthesis of a specific enzyme

Minimal medium

No growth: Mutant cells cannot grow and divide

Growth: Wild-type cells growing and dividing

EXPERIMENT

RESULTS

Classes of Neurospora crassa

Wild type Class I mutants Class II mutants Class III mutants

Minimal medium (MM) (control)

MM ornithine

MM citrulline

Co

nd

itio

n

MM arginine (control)

Summary of results

Can grow with or without any supplements

Can grow on ornithine, citrulline, or arginine

Can grow only on citrulline or arginine

Require arginine to grow

Growth No growth

CONCLUSION

Wild type

Class I mutants (mutation in

gene A)

Class II mutants (mutation in

gene B)

Class III mutants (mutation in

gene C)

Gene (codes for enzyme)

Gene A

Gene B

Gene C

Precursor Precursor Precursor Precursor

Enzyme A Enzyme A Enzyme A Enzyme A

Enzyme B Enzyme B Enzyme B Enzyme B

Ornithine Ornithine Ornithine Ornithine

Enzyme C Enzyme C Enzyme C Enzyme C

Citrulline Citrulline Citrulline Citrulline

Arginine Arginine Arginine Arginine

The Products of Gene Expression: A Developing Story

• Some proteins aren’t enzymes… so researchers later revised the hypothesis to “one gene – one protein”

• But, many proteins consist of several polypeptides, each having its own gene

Beadle & Tatum’s hypothesis is now restated as the:

“one gene–one polypeptide hypothesis”

Basics of Transcription & Translation

RNA: the bridge between genes & the proteins they code

• Transcription: synthesis of RNA under the direction of DNA

- produces messenger RNA (mRNA)

• Translation: synthesis of a polypeptide under the direction of mRNA

- ribosomes: the sites of translation

Eukaryotes:

the nuclear envelope separates transcription from translation

- eukaryotic RNA transcripts are modified via RNA processing to yield finished mRNA

Prokaryotes:

mRNA transcripts are immediately translated without further processing

(a) Prokaryotic Cell (Bacteria)

TRANSCRIPTION DNA

mRNA

TRANSLATION Ribosome

Polypeptide

Primary transcript: initial RNA transcript from a gene before processing

Central dogma:

cells are governed by a cellular chain of command:

DNA RNA Protein

(b) Eukaryotic cell

TRANSCRIPTION

Nuclear envelope

DNA

Pre-mRNA RNA PROCESSING

mRNA

(b) Eukaryotic cell

TRANSCRIPTION

Nuclear envelope

DNA

Pre-mRNA RNA PROCESSING

mRNA

TRANSLATION Ribosome

Polypeptide

The Genetic Code

How are the instructions for assembling amino acids into proteins encoded into DNA?

There are 20 amino acids,

… but there are only 4 nucleotide bases in DNA

How many bases correspond to an amino acid?

Codons: Triplets of Nucleotides

The flow of information from gene protein is based on a triplet code

- series of non-overlapping 3-nucleotide “words”

Triplet: smallest unit that can code for amino acids

- “AGT” = placement of serine at its correct position in the polypeptide

Transcription: one of the two DNA strands (template strand)

provides a pattern for ordering the nucleotide sequence in the mRNA transcript

Translation:

mRNA base triplets (codons) are read in the 5 to 3 direction

- each codon specifies the amino acid (1 of 20) and it’s correct position in a polypeptide

DNA template strand

TRANSCRIPTION

mRNA

TRANSLATION

Protein

Amino acid

Codon

Trp Phe Gly

5

5

Ser

U U U U U 3

3

5 3

G

G

G G C C

T

C

A

A

A A A A A

T T T T

T

G

G G G

C C C G G

DNA molecule

Gene 1

Gene 2

Gene 3

C C

Cracking the Code

• All 64 codons were deciphered by the mid-1960s

• Of the 64 triplets, 61 code for amino acids

- 3 triplets are stop codons that end translation:

UAA, UAG, UGA

• The genetic code is redundant but not ambiguous

• Codons must be read in the correct reading frame (correct groupings) in order to synthesize the specified polypeptide

Second mRNA base

Firs

t m

RN

A b

ase

(5 e

nd

of

cod

on

)

Thir

d m

RN

A b

ase

(3

en

d o

f co

do

n)

UUU

UUC

UUA

CUU

CUC

CUA

CUG

Phe

Leu

Leu

Ile

UCU

UCC

UCA

UCG

Ser

CCU

CCC

CCA

CCG

UAU

UAC Tyr

Pro

Thr

UAA Stop

UAG Stop

UGA Stop

UGU

UGC Cys

UGG Trp

G C

U

U

C

A

U

U

C

C

C A

U

A

A

A

G

G

His

Gln

Asn

Lys

Asp

CAU CGU

CAC

CAA

CAG

CGC

CGA

CGG

G

AUU

AUC

AUA

ACU

ACC

ACA

AAU

AAC

AAA

AGU

AGC

AGA

Arg

Ser

Arg

Gly

ACG AUG AAG AGG

GUU

GUC

GUA

GUG

GCU

GCC

GCA

GCG

GAU

GAC

GAA

GAG

Val Ala

GGU

GGC

GGA

GGG Glu

Gly

G

U

C

A

Met or start

UUG

G

Evolution of the Genetic Code

The genetic code is shared by all living things

- Genes can be transcribed & translated after being transplanted from one species to another

(a) Tobacco plant expressing a firefly gene

(b) Pig expressing a jellyfish gene

Synthesis of an RNA Transcript

The 3 Stages of Transcription:

1. Initiation

2. Elongation

3. Termination

Transcription: DNA RNA

RNA synthesis is catalyzed by RNA polymerase

- pries DNA strands apart

- hooks RNA nucleotides together

The RNA is complementary to the DNA template strand

Follows same base-pairing rules as DNA

- except uracil substitutes for thymine

Promoter: DNA sequence that RNA polymerase attaches to

Transcription unit: section of DNA that is transcribed

Promoter Transcription unit

DNA Start point

RNA polymerase

5 5 3

3

Initiation

3 3

1

RNA transcript

5 5

Unwound DNA

Template strand of DNA

2 Elongation Rewound DNA

5

5 5 3 3 3

RNA transcript

3 Termination 5

5

5 3 3

3

Completed RNA transcript

RNA Polymerase Binding & Initiation of Transcription

Promoters: signal initiation of RNA synthesis

- TATA box promoter is crucial in forming the initiation complex in eukaryotes

Transcription factors:

needed to help bind RNA polymerase & initiate transcription

Transcription initiation complex:

completed assembly of transcription factors & RNA polymerase bound to a promoter

Transcription initiation complex forms 3

DNA Promoter

Nontemplate strand

5 3

5 3

5 3

Transcription factors

RNA polymerase II

Transcription factors

5 3

5 3

5 3

RNA transcript

Transcription initiation complex

5 3

TATA box

T

T T T T T

A A A A A

A A

T

Several transcription factors bind to DNA 2

A eukaryotic promoter 1

Start point Template strand

Elongation of the RNA Strand

As RNA polymerase moves along DNA, it untwists the double helix, 10-20 bases at a time

- transcription rate = 40 nucleotides/sec

A gene can be transcribed simultaneously by several RNA polymerases

Nucleotides are added to the 3’ end of the growing RNA molecule

Elongation

RNA polymerase

Non-template strand of DNA

RNA nucleotides

3 end

Direction of transcription

(“downstream”) Template strand of DNA

Newly made RNA

3

5

5

Termination of Transcription

In bacteria:

RNA polymerase stops transcription at end of the terminator & the mRNA can be translated

without further modification

In eukaryotes:

RNA polymerase continues transcription after the pre-mRNA is cleaved from the growing RNA

chain

- polymerase eventually falls off the DNA

Eukaryotic Cells Modify RNA After Transcription

Enzymes in the eukaryotic nucleus modify pre-mRNA (RNA processing) before mRNA “gene” enters

cytoplasm

Both ends of the primary transcript are usually altered

- and some interior parts of RNA are usually cut-out & other parts spliced together

Alteration of mRNA Ends

Each end of pre-mRNA is modified in a particular way:

- the 5 end gets a modified nucleotide 5 cap

- the 3 end gets a poly-A tail

Why Modify? - Facilitates export of mRNA

- Protects mRNA from hydrolytic enzymes

- Helps ribosomes attach to the 5 end

Protein-coding segment

Polyadenylation signal

5 3

3 5 5 Cap UTR Start codon

G P P P

Stop codon

UTR

AAUAAA

Poly-A tail

AAA AAA …

Split Genes & RNA Splicing

Most genes & their RNA transcripts have long noncoding regions (introns) that lie between

coding regions

- intron = intervening sequences (“in the way”)

Exons: coding regions

- expressed & translated into amino acid sequences

RNA splicing: removes introns & joins exons

- creates mRNA molecule with a continuous coding sequence

5 Exon Intron Exon

5 Cap Pre-mRNA Codon numbers

130 31104

mRNA 5 Cap

5

Intron Exon

3 UTR

Introns cut out and exons spliced together

3

105 146

Poly-A tail

Coding segment

Poly-A tail

UTR 1146

Some RNA splicing is carried out by spliceosomes

- consist of a variety of proteins & small nuclear ribonucleoproteins (snRNPs = “snurps”)

- snRNPs can recognize the splice sites

RNA transcript (pre-mRNA)

Exon 1 Exon 2 Intron

Protein

snRNA

snRNPs

Other proteins

5

5

Spliceosome

Spliceosome components

Cut-out intron mRNA

Exon 1 Exon 2 5

Ribozymes Catalytic RNA molecules that act as enzymes & splice RNA

- not all biological catalysts are proteins!

How can RNA function as an enzyme?

- can form a 3-D structure because it can base-pair with itself

- some RNA bases contain functional groups

- can hydrogen-bond with other nucleic acids

Some genes can encode more than 1 kind of polypeptide, depending on which segments are

treated as exons during RNA splicing

- the actual # of different proteins an organism can produce is much greater than its number of genes

Alternative RNA Splicing

Proteins often have a modular architecture consisting of discrete regions called domains

- different exons can code for different domains in a protein

Exon shuffling can result in the evolution of new proteins

Gene

DNA Exon 1 Exon 2 Exon 3 Intron Intron

Transcription

RNA processing

Translation

Domain 2

Domain 3

Domain 1

Polypeptide

Molecular Components of Translation

A cell translates mRNA message into protein with the help of transfer RNA (tRNA)

tRNA molecules are not identical:

- each carries a specific amino acid on one end

- each has an anticodon on the other end that base-pairs with a complementary codon on mRNA

Polypeptide

Ribosome

Amino acids

tRNA with amino acid attached

tRNA

Anticodon

Codons 3 5

mRNA

The Structure & Function of tRNA

tRNA: one RNA

strand, 80 nucleotides long

- when flattened, it resembles a cloverleaf

C

Amino acid attachment site

Hydrogen bonds

Anticodon

3

5

Can twist & fold into an “L”-shaped 3-D molecule through hydrogen-bonding

Amino acid attachment site

3

3

5

5

Hydrogen bonds

Anticodon Anticodon

(b) 3-D structure (c) Symbol used in this book

Translation requires 2 steps: “The Match Game”

1. tRNA and its amino acid are matched by the enzyme aminoacyl-tRNA synthetase

- forms “charged tRNA”

2. tRNA anticodon and an mRNA codon are matched

Flexible pairing at the 3rd base of a codon is called wobble

- allows some tRNAs to bind to more than 1 codon

Aminoacyl-tRNA synthetase (enzyme)

Amino acid

P P P Adenosine

ATP

P

P

P

P P i

i

i

Adenosine

tRNA

Adenosine P

tRNA

AMP

Computer model

Amino acid

Aminoacyl-tRNA synthetase

Aminoacyl tRNA (“charged tRNA”)

Ribosomes

Ribosomes facilitate specific coupling of tRNA anticodons with mRNA codons in protein synthesis

- 2 ribosomal subunits (large & small) are made of proteins & ribosomal RNA (rRNA)

Growing polypeptide

Exit tunnel tRNA molecules

Large subunit

Small subunit

(a) Computer model of functioning ribosome

mRNA

E P A

5 3

Exit tunnel

A site (Aminoacyl- tRNA binding site)

Small subunit

Large subunit

P A

P site (Peptidyl-tRNA binding site)

mRNA binding site

(b) Schematic model showing binding sites

E site (Exit site)

E

A ribosome has 3 binding sites for tRNA:

- A site: holds the tRNA carrying the next amino acid to be

added to the chain

- P site: holds the tRNA carrying the growing polypeptide

chain

- E site (Exit): where discharged tRNAs leave the ribosome

Amino end

mRNA E

(c) Schematic model with mRNA and tRNA

5 Codons

3

tRNA

Growing polypeptide

Next amino acid to be added to polypeptide chain

Building a Polypeptide

The 3 stages of translation:

1. Initiation

2. Elongation

3. Termination

All 3 stages require protein factors

Initiation of Translation

Initiation stage: brings together mRNA, a tRNA with the 1st amino acid, & the 2 ribosomal subunits

1. First, the small ribosomal subunit binds with mRNA and a special initiator tRNA

2. Then the small subunit moves along mRNA until it reaches the start codon (AUG)

- Initiation factors bring in the large subunit to complete the translation initiation complex

3

3 5 5 U

U A

A C

G

GTP GDP Initiator

tRNA

mRNA

5 3 Start codon

mRNA binding site

Small ribosomal subunit

5

P site

Translation initiation complex

3

E A

Large ribosomal subunit

Elongation of the Polypeptide Chain

Elongation stage: amino acids are added one by one

Each addition involves elongation factors and occurs in 3 steps:

a. Codon recognition

b. Peptide bond formation

c. Translocation

Amino end of polypeptide

mRNA

5

3 E

P site

A site

GTP

GDP

E

P A


mRNA

5

3 E

P site

A site

GTP

GDP

E

P A

E

P A


mRNA

5

3 E

P site

A site

GTP

GDP

E

P A

E

P A

GDP

GTP

Ribosome ready for next aminoacyl tRNA

E

P A

Termination of Translation

Termination:

- occurs when a stop codon in mRNA reaches the A site

The A site accepts a release factor protein

- adds a water molecule instead of an amino acid

- this releases the polypeptide

- the translation complex comes apart

Release factor

3

5

Stop codon (UAG, UAA, or UGA)

5

3

2

Free polypeptide

2 GDP

GTP

5

3

Polyribosomes

Groups of ribosomes that simultaneously translate one mRNA, forming a polyribosome (polysome)

- allows a cell to quickly make many copies of a polypeptide

Growing polypeptides

Completed polypeptide

Incoming ribosomal subunits

Start of mRNA

(5 end) End of mRNA

(3 end) (a)

Ribosomes

mRNA

(b) 0.1 µm

Post-Translation

A protein is usually not functional immediately after translation

- requires further post-translational modification

It spontaneously coils and folds into its correct 3-D shape

- some activated by enzymes that cleave them

- others assemble into protein subunits

Targeting Polypeptides to Specific Locations

Two populations of ribosomes are found in cells:

- Free ribosomes: synthesize proteins that function in the cytosol

- Bound ribosomes: synthesize proteins on ER and those that will be secreted from the cell

Ribosomes are identical and can switch from free to bound

Polypeptide synthesis always begins and ends in the cytosol

- unless the polypeptide signals the ribosome to attach to the ER

Polypeptides destined for the ER or for secretion are marked by a signal peptide

- a signal-recognition particle (SRP) binds to the signal peptide

- the SRP brings the signal peptide & its ribosome to the ER

Ribosome

mRNA

Signal peptide

Signal- recognition particle (SRP)

CYTOSOL Translocation complex

SRP receptor protein

ER LUMEN

Signal peptide removed

ER membrane

Protein

Point Mutations - chemical changes in just 1 base pair of a gene

- a change in one DNA nucleotide can lead to the production of an abnormal protein

Wild-type hemoglobin DNA

mRNA

Mutant hemoglobin DNA

mRNA

3

3

3

3

3

3

5

5

5

5

5

5

C C T T T

T G G A A A

A

A A A G G U

Normal hemoglobin Sickle-cell hemoglobin

Glu Val

Types of Point Mutations

- Base-pair substitutions

- Base-pair insertions or deletions

Wild type

3 DNA template strand

3

3 5

5

5 mRNA

Protein

Amino end

Stop

Carboxyl end

A instead of G

3

3

3

U instead of C

5

5

5

Stop

Silent mutations: have no effect on the amino acid because of redundancy in the genetic code

Wild type

DNA template strand

3

5

mRNA

Protein

5

Amino end

Stop

Carboxyl end

5

3

3

T instead of C

A instead of G

3

3

3

5

5

5

Stop

Missense: still codes for an amino acid, but not necessarily the right amino acid

Wild type

DNA template strand

3

5

mRNA

Protein

5

Amino end

Stop

Carboxyl end

5

3

3

A instead of T

U instead of A

3

3

3

5

5

5

Stop

Nonsense: changes an amino acid codon into a stop codon, nearly always leading to a nonfunctional protein

Insertions and Deletions

- additions or losses of nucleotide pairs in a gene

Can have a disastrous effect on the protein more often than substitutions

- may produce a frameshift mutation, which alters the reading frame

Wild type

DNA template strand

3

5

mRNA

Protein

5

Amino end

Stop

Carboxyl end

5

3

3

Extra A

Extra U

3

3

3

5

5

5

Stop

Frameshift Mutation causing immediate nonsense (1 base-pair insertion)

Wild type

DNA template strand

3

5

mRNA

Protein

5

Amino end

Stop

Carboxyl end

5

3

3

missing

missing

3

3

3

5

5

5

Frameshift Mutation causing extensive missense (1 base-pair deletion)

Wild type

DNA template strand

3

5

mRNA

Protein

5

Amino end

Stop

Carboxyl end

5

3

3

missing

missing

3

3

3

5

5

5

No Frameshift Mutation, but one amino acid missing (3 base-pair deletion)

Stop

What Is a Gene?

We have considered a gene as:

- A discrete unit of inheritance

- A region of specific nucleotide sequence in a chromosome

- A DNA sequence that codes for a specific polypeptide chain

In summary, a gene can be defined as:

- a region of DNA that can be expressed to

produce a final functional product, either a

polypeptide or an RNA molecule

TRANSCRIPTION

RNA PROCESSING

DNA

RNA transcript

3

5 RNA polymerase

RNA transcript (pre-mRNA)

Intron

Exon

NUCLEUS

Aminoacyl-tRNA synthetase

AMINO ACID ACTIVATION

Amino acid

tRNA CYTOPLASM

Growing polypeptide

3

Activated amino acid

mRNA

TRANSLATION

Ribosomal subunits

5

E

P

A

A Anticodon

Ribosome

Codon

E

biol 102 chp 17 powerpoint

Education

transcription gene

gene protein

genetic code

products of gene expression

direction of mrna

messenger rna mrna translation

direction of dna

gene aprecursorgene