principles of protein structure and stability
DESCRIPTION
Principles of protein structure and stability. Polypeptide bond is formed between two amino acids. Backbone conformation is described by φ and ψ angles. Picture from T. Przytycka, 2002. Hierarchy of protein structure. Amino acid sequence Secondary structure Tertiary structure - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/1.jpg)
Principles of protein structure and stability.
![Page 2: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/2.jpg)
Polypeptide bond is formed between two amino acids.
![Page 3: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/3.jpg)
Backbone conformation is described by φ and ψ angles.
Picture from T. Przytycka, 2002
![Page 4: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/4.jpg)
Hierarchy of protein structure.
1. Amino acid sequence
2. Secondary structure
3. Tertiary structure
4. Quaternary structure
Picture from Branden & Tooze “Introduction to protein structure”
![Page 5: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/5.jpg)
Right-handed alpha-helix.Helix is stabilized by HB between backbone –NH and backbone carbonyl atom.
Geometrical characteristics:- 3.6 residues per turn- translation of 5.4 Å per turn- translation of 1.5 Å per residue
![Page 6: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/6.jpg)
Β-strand and β-sheet.
![Page 7: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/7.jpg)
Loop regions are at the surface of protein molecules.
Adjacent antiparallel β-strands are joined by hairpin loops.
Loops are more flexible than helices and strands.
Loops can carry binding and active sites, functionally important sites.
Branden & Tooze “Introduction to protein structure”
![Page 8: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/8.jpg)
Protein classification based on the secondary structure content.
• Class α - proteins with only α-helices
• Class β – proteins with only β-sheets
• Class α+β - proteins with α-helices and β-sheets
![Page 9: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/9.jpg)
Protein stability.
Anfinsen’s experiments:
![Page 10: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/10.jpg)
Native proteins have low stability…
Scale of interactions in proteins:
- Interactions less than kT~0.6 kcal/mol
are neglected.
- Interactions more than ΔG = 10 kcal/mol
are too large
Potential energy = Van der Waals + Electrostatic + Hydrophobic
U
F
G
Reaction coordinate
ΔG
![Page 11: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/11.jpg)
Electrostatic force.
d
qqEel
21
Coulomb’s law for two point charges in a vacuum:
q – point charge,
ε – dielectric constant
Na+ Cl-
d = 2.76 Å,
E = 120 kcal/mol
ε = 2-3 inside the protein,
ε = 80 in water
![Page 12: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/12.jpg)
Dipolar interactions.
qdD
521
321 ))((3)(
r
rr
rE dd
Dipole moment:
Interaction energy of two dipoles separated by the vector r: N
C
O
H
- 0.42
+0.42
+0.20
-0.20
Peptide bond:
μ = 3.5D,
Water molecule:
μ = 1.85D.
![Page 13: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/13.jpg)
Van der Waals interactions.
0.2
12 10 8 6 4 2
- 0.2
repulsion
attraction
0
6;)(66 nd
C
d
CdE
nn
δ+
δ-
δ-
δ+
London dispersion energy:
Lennard-Jones potential:
Distance between centers of atoms
E (kcal/mol)
![Page 14: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/14.jpg)
Hydrogen bonds
—N—H O==C N
H
O==
H
N
3 Ǻ
D A
HOH OHH
D A
+
HOH::::OHH
δ-δ+
![Page 15: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/15.jpg)
Hydrogen bonding patterns in globular proteins.
1. Most HB are local, close in sequence.
2. Most HB are between backbone atoms.
3. Most HB are within single elements of secondary structure.
4. Proteins are almost equally saturated by HB: 0.75 HB per amino acid.
![Page 16: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/16.jpg)
Disulfide bonds.
PROTEIN + GS-SG PROTEIN + GSHPROTEIN + 2GSH
SH HS SH S-SG
- Breakdown and formation of S-S bonds are catalyzed by disulfide isomerase.
- In the cell S-S bonds are reversible, the energetic equilibrium is close to zero.
- Secreted proteins have a lot of S-S bonds since outside the cell the equilibrium is shifted towards their formation.
![Page 17: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/17.jpg)
Hydrophobic effect.
Hydrophobic interaction – tendency of
nonpolar compounds to transfer from an
aqueous solution to an organic phase.
- The entropy of water molecules decreases when they
make a contact with a nonpolar surface, the energy increases.
- As a result, upon folding nonpolar AA are burried inside the protein, polar and charged AA – outside.
O
H
H
H
H
O
![Page 18: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/18.jpg)
Hydrophobicities of amino acids.
![Page 19: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/19.jpg)
Cooperativity of protein interactions
Protein denaturation is a first
order (“all-or-none”) transition.
As T increases:
1. Globule expansion, loose packing.
2. As expansion crosses the barrier,
liberation of side chains and
increase in enthropy.
E
T1 T2T’
W(E)
T2
T’
T1
![Page 20: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/20.jpg)
Summary:
- Hydrophobic effect is mostly responsible for making a compact globule. Final specific tertiary structure is formed by van der Waals interactions, HB, disulfide bonds.
- Secret of stability of native structures is not in the magnitude of the interactions but in their
cooperativity.
![Page 21: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/21.jpg)
Classwork I: CN3D viewer.
• Go to http://ncbi.nlm.nih.gov
• Select alpha-helical protein (hemoglobin)
• Select beta-stranded protein (immunoglobulin)
• Select multidomain protein 1I50, chain “A”
• View them in CN3D
![Page 22: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/22.jpg)
PDB databank.
• Archive of protein crystal structures was established in 1971 with several structures
in 2002 – 17000 structure including NMR structures
• Data processing: data deposition, annotation and validation
• PDB code – nXYZ, n – integer, X, Y, Z -characters
![Page 23: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/23.jpg)
Content of Data in the PDB.• Organism, species name
• Full protein sequence
• Chemical structure of cofactors and prosthetic groups
• Names of all components of the structure
• Qualitative description of the structural characteristics
• Literature citations
• Three-dimensional coordinates
![Page 24: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/24.jpg)
Protein secondary structure prediction.
Assumptions: • There should be a correlation between amino acid sequence
and secondary structure. Short aa sequence is more likely to form one type of SS than another.
• Local interactions determine SS. SS of a residues is determined by their neighbors (usually a sequence window of 13-17 residues is used).
Exceptions: short identical amino acid sequences can sometimes be found in different SS.
Accuracy: 65% - 75%, the highest accuracy – prediction of an α helix
![Page 25: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/25.jpg)
Methods of SS prediction.
• Chou-Fasman method
• GOR (Garnier,Osguthorpe and Robson)
• Neural network method
![Page 26: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/26.jpg)
Chou-Fasman method.
Analysis of frequences for all amino acids to be in different types of SS.
Ala, Glu, Leu and Met – strong predictors of alpha-helices,
Pro and Gly predict to break the helix.
)(/),(log(),( SfSafSaScore ii
![Page 27: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/27.jpg)
GOR method.
Assumption: formation of SS of an amino acid is determined by the neighboring residues (usually a window of 17 residues is used).
GOR uses principles of information theory for predictions.
Method maximizes the information difference between two competing hypothesis: that residue “a” is in structure “S”, and that “a” is not in conformation “S”.
)/log();( , SaS ffaSI
![Page 28: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/28.jpg)
Neural network method.
L
A
W
P
G
E
V
G
A
S
T
Y
P
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
HjOi
α
β
coil
1
0
0
Si
Hj Oi
Wij Sj
Input sequence window
Input layer
Hidden layer Output layer Predicted SS
![Page 29: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/29.jpg)
PHD – neural network program with multiple sequence alignments.
• Blast search of the input sequence is performed, similar sequences are collected.
• Multiple alignment of similar sequences is used as an input to a neural network.
• Sequence pattern in multiple alignment is enhanced compared to if one sequence used as an input.
![Page 30: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/30.jpg)
Classwork
• Go to http://ncbi.nlm.nih.gov, search for protein “flavodoxin” in Entrez, retrieve its amino acid sequence.
• Go to http://cubic.bioc.columbia.edu/predictprotein and run PHD on the sequence.
![Page 31: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/31.jpg)
Definition of protein domains.• Geometry: group of residues with the high contact density,
number of contacts within domains is higher than the number of contacts between domains.
- chain continuous domains - chain discontinous domains
• Kinetics: domain as an independently folding unit.
• Physics: domain as a rigid body linked to other domains by flexible linkers.
• Genetics: minimal fragment of gene that is capable of performing a specific function.
![Page 32: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/32.jpg)
Domains as recurrent units of proteins.
• The same or similar domains are found in different proteins.
• Each domain performs a specific function.
• Proteins evolve through the duplication and domain shuffling.
• The total number of different types of domains is small (~1000 – 3000).
![Page 33: Principles of protein structure and stability](https://reader036.vdocuments.mx/reader036/viewer/2022062518/56814411550346895db0af18/html5/thumbnails/33.jpg)
The Conserved Domain Architecture Retrieval Tool (CDART).
• Performs similarity searches of the NCBI Entrez Protein Database based on domain architecture, defined as the sequential order of conserved domains in proteins.
• The algorithm finds protein similarities across significant evolutionary distances using sensitive protein domain profiles. Proteins similar to a query protein are grouped and scored by architecture.