structures2 (1)

Upload: jorge-alencar

Post on 05-Apr-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/31/2019 structures2 (1)

    1/22

  • 7/31/2019 structures2 (1)

    2/22

    NMR data do not uniquely define a 3Dprotein structure (single set of

    coordinates)

    Restraints are ranges of allowed distances, angles etc. rather thansingle values, reflecting the fact that the experimental data contain

    uncertainties both in measurement and interpretation.

    Only a limited number of the possible restraints are observableexperimentallydue to peak overlap/chemical shift degeneracy, lack of

    stereospecific assignments, etc.

    View of protein structure as a single set of atomic coordinates mayitself be physically unrealistic!proteins are dynamic molecules

  • 7/31/2019 structures2 (1)

    3/22

    The NMR Ensemble

    NMR methods not calculate a single structure, but rather repeat a structurecalculation many times to generate an ensemble of structures

    The structure calculations are designed to thoroughly explore all regions ofconformational space that satisfy the experimentally derived restraints

    At the same time, they often impose some physical reasonableness on thesystem, such as bond angles, distances and proper stereochemistry.

    The ideal result is an ensemble whichA. satisfies all the experimental restraints (minimizes violations)

    B. at the same time accurately represents the full permissible conformationalspace under the restraints (maximizes RMSD between ensemble members)

    C. looks like a real protein

  • 7/31/2019 structures2 (1)

    4/22

    The NMR Ensemble

    At right, an ensemble of 25

    structures for Syrian hamster prionprotein(only the backbone isshown)

    Liu et al.Biochemistry (1999) 38, 5362.

    The fact that NMRstructures are reportedas ensembles givesthem a fuzzyappearance which isboth informative andsometimes annoying

  • 7/31/2019 structures2 (1)

    5/22

    NMR structures include hydrogen

    coordinates

    X-ray structures do not generally include hydrogen atoms in atomiccoordinate files, because the heavy atoms dominate the diffraction

    pattern and the hydrogen atoms are not explicitly seen.

    By contrast, NMR restraints such as NOE distance restraints andhydrogen bond restraints often explicitly include the positions ofhydrogen atoms. Therefore, these positions are reported in the PDBcoordinate files.

  • 7/31/2019 structures2 (1)

    6/22

    Methods for structure calculation

    distance geometry (DG) restrained molecular dynamics (rMD)

    simulated annealing (SA) hybrid methods

  • 7/31/2019 structures2 (1)

    7/22

    Starting points for calculations

    to get the most unbiased, representative ensemble, it is wise to start

    the calculations from a set of randomly generated starting structures. Alternatively, in some methods the same initial structure is used for

    each trial structure calculation, but the calculation trajectory is pushedin a different initial direction each time using a random-numbergenerator.

  • 7/31/2019 structures2 (1)

    8/22

    DG--Distance geometry

    In distance geometry, one uses the nOe-derived distance restraints togenerate a distance matrix, which one then uses as a guide incalculating a structure

    Structures calculated from distance geometry will produce the correct

    overall fold but usually have poor local geometry (e.g. improper bondangles, distances)

    hence distance geometry must be combined with some extensiveenergy minimization method to generate physically reasonable

    structures

  • 7/31/2019 structures2 (1)

    9/22

    rMD--Restrained molecular dynamics

    Molecular dynamics involves computing the potential energy Vwith respect to the atomic coordinates. Usually this is defined asthe sum of a number of terms:

    Vtotal=Vbond+ Vangle+ Vdihedr+ VvdW+ Vcoulomb+VNMR

    the first five termshere are real energy terms corresponding tosuch forces as van der Waals and electrostatic repulsions andattractions, cost of deforming bond lengths and angles...thesecome from some standard molecular force field like CHARMMor AMBER

    the NMR restraints are incorporated into the VNMRterm, which isa pseudoenergy or pseudopotential term included torepresent the cost of violating the restraints

  • 7/31/2019 structures2 (1)

    10/22

    Pseudo-energy potentials for rMD

    Generate fake energy potentials representing the cost of violating thedistance or angle restraints. Heres an example of a distance restraintpotential

    KNOE(rij-rij1)2 ifrijr

    ij

    u

    0 ifrijl

  • 7/31/2019 structures2 (1)

    11/22

    Example of nOe pseudopotential

    rijl rij

    u

    0

    VNOE potentialrisessteeplywith degree

    of violation

  • 7/31/2019 structures2 (1)

    12/22

    SA-Simulated annealing

    SA is essentially a special implementation of rMD and usessimilar potentials but employs raising the temperature of thesystem and then slow cooling in order not to get trapped in localenergy minima

    SA is very efficient at locating the global minimum of the targetfunction

  • 7/31/2019 structures2 (1)

    13/22

    Dealing with ambiguous restraints

    often not possible to tell which atoms are involved in a NOESYcrosspeak, either because of a lack of stereospecific assignments orbecause multiple protons have the same chemical shift

    sometimes an ambiguous restraint is included but is expressedambiguously in the restraint file, e.g. 3 HA --> 6 HB#, where the #

    wildcard indicates that the beta protons of residue 6 are notstereospecifically assigned. This is quite commonly done forstereochemical ambiguities.

    it is also possible to leave ambiguous restraints out and then try toresolve them iterativelyusing multiple cycles of calculation. This is

    often done for restraints that involve more complicated ambiguities, e.g.3 HA-->10 HN, 43 HN, or 57 HN, where three amides all have the sameshift.

    can also make stereospecific assignments iteratively using what arecalled floating chiralitymethods

  • 7/31/2019 structures2 (1)

    14/22

    A B

    C

    9.52 ppm

    4.34 ppm

    4.34 ppm

    Due to resonance overlap

    between atoms B and C,an NOE crosspeakbetween 9.52 ppmand 4.34 ppm couldbe A to C or A to B--this restraint is ambiguous

    But if an ensemble generated withthis ambiguous restraintleft out shows that A is neverclose to B, then the restraint mustbe A to C.

    Example of resolving an ambiguityduring structure calculation

    9-11

    3-4

    range of interatomicdistances observedin trial ensemble

  • 7/31/2019 structures2 (1)

    15/22

    Iterative structure calculation withassignment of ambiguous restraints

    source:

    http://www.pasteur.fr/recherche/unites/Binfs/aria/

    there are programs suchas ARIA, with automaticroutines for iterativeassignment of ambiguous

    restraints. The key tosuccess is to makeabsolutely sure therestraints you start withare right!

    start with some setof unambiguous NOEsand calculate an ensemble

  • 7/31/2019 structures2 (1)

    16/22

    Acceptance criteria: choosingstructures for an ensemble

    typical to generate 50 or more trial structures, but not all will convergeto a final structure that is physically reasonable or consistent withthe experimentally derived NMR restraints. We want to throw suchstructures away rather than include them in our reported ensemble.

    these are typical acceptance criteriafor including calculated structuresin the ensemble: no more than 1 nOe distance restraint violation greater than 0.4 no dihedral angle restraint violations greater than 5 no gross violations of reasonable molecular geometry

    sometimes structures are rejected on other grounds as well: too many residues with backbone angles in disfavored regions of

    Ramachandran space too high a final potential energy in the rMD calculation

  • 7/31/2019 structures2 (1)

    17/22

    Precision of NMR Structures(Resolution)

    judged by RMSD of superimposed ensemble of accepted structures

    RMSDs for both backbone (Ca, N, CC=O) and all heavy atoms (i.e.everything except hydrogen) are typically reported, e.g.

    bb 0.6 heavy 1.4

    sometimes only the more ordered regions are included in the reportedRMSD, e.g. for a 58 residue protein you will see RMSD (residues 5-58)

    if residues 1-4 are completely disordered.

  • 7/31/2019 structures2 (1)

    18/22

    Reporting ensemble RMSD

    two major ways of calculating RMSD of the ensemble:

    pairwise: compute RMSDs for all possible pairs of structures in theensemble, and calculate the mean of these RMSDs

    from mean: calculate a mean structure from the ensemble and

    measure RMSD of each ensemble structure from it, then calculatethe mean of these RMSDs

    pairwise will generally give a slightly higher number, so be awarethat these two ways of reporting RMSD are not completely equal.Usually the Materials and Methods, or a footnote somewhere in thepaper, will indicate which is being used.

  • 7/31/2019 structures2 (1)

    19/22

    Minimized average structure

    a minimized average is just that: a mean structure is calculated fromthe ensemble and then subjected to energy minimization to restorereasonable geometry, which is often lost in the calculation of a mean

    this is NMRs way ofgenerating a single representative structure fromthe data. It is much easier to visualize structural features from aminimized average than from the ensemble.

    for highly disordered regions a minimized average will not beinformative and may even be misleading--such regions are sometimesleft out of the minimized average

    sometimes when an NMR structure is deposited in the PDB, there willbe separate entries for both the ensemble and the minimized average.

    It is nice when people do this. Alternatively, a member of the ensemblemay be identified which is considered the most representative (oftenthe one closest to the mean).

  • 7/31/2019 structures2 (1)

    20/22

    How many restraints do we need to get

    a high-resolution NMR structure? usually ~15-20 nOe distance restraints per residue, but the total # is not

    as important as how many long-rangerestraints you have, meaninglong-range in the sequence: |i-j|> 5, where iandj are the two residues

    involved good NMR structures usually have ~ 3.5 long-range distance

    restraints per residue in the structured regions to get a very good quality structure, it is usually also necessary to have

    some stereospecificassignments, e.g. b hydrogens; Leu, Val methyls

  • 7/31/2019 structures2 (1)

    21/22

    Assessing Structure Quality

    NMR spectroscopists usually run their ensemble through the programPROCHECK-NMR to assess its quality

    high-resolution structure will have backbone RMSD ~0.8 , heavyatom RMSD ~1.5

    low RMS deviation from restraints (good agreement w/restraints) will have good stereochemical quality:

    ideally >90% of residues in core (most favorable) regions ofRamachandran plot

    very few unusual side chain angles and rotamers (as judged by

    those commonly found in crystal structures) low deviations from idealized covalent geometry

  • 7/31/2019 structures2 (1)

    22/22

    Structural Statistics Tables

    list of restraints,# and type

    precision ofstructure (RMSD)

    agreement ofensemble structureswith restraints (RMS)

    calculated energies

    sometimes also see listings of Ramachandran statistics, deviations

    from ideal covalent geometry, etc.