eccb 2020 gent introduction to protein structure validation (and improvement) gert vriend protein...

36
ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Upload: garey-perry

Post on 13-Dec-2015

218 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

ECCB 2020 Gent

Introduction to protein structure validation(and improvement)

Gert Vriend

Protein structure validation

Page 2: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

The plan for today:Gert Vriend: Introduction to validationRobbie Joosten: X-ray structure validation and improvementJurgen Doreleijers: NMR structure validation (and improvement )Bas Vroling (and you): YASARAAll (and you): General validation practicalsSplit-up in groups: General validation issues

X-ray specific issuesNMR specific issuesContinuation of validation practicals

At the end: Overview of validation and related facilities

And in-between we have coffee, lunch, tea, and whatever else they throw at us at any moment that anybody feels like it.

Protein structure validation

Page 3: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Structure validation

Everything that can go wrong, will go wrong, especially with things as complicated as protein structures.

Page 4: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

What is real?

Page 5: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

What is real?

ATOM 1 N LEU 1 -15.159 11.595 27.068 1.00 18.46ATOM 2 CA LEU 1 -14.294 10.672 26.323 1.00 9.92ATOM 3 C LEU 1 -14.694 9.210 26.499 1.00 12.20ATOM 4 O LEU 1 -14.350 8.577 27.502 1.00 13.43ATOM 5 CB LEU 1 -12.829 10.836 26.772 1.00 13.48ATOM 6 CG LEU 1 -11.745 10.348 25.834 1.00 15.93ATOM 7 CD1 LEU 1 -11.895 11.027 24.495 1.00 13.12ATOM 8 CD2 LEU 1 -10.378 10.636 26.402 1.00 15.12

Page 6: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

X-ray

Page 7: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

X-ray

Page 8: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

X-ray

‘FFT-inv’

FFT-inv

And now move the atoms around till the calculated reflections best match the observed ones.

Page 9: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Multiple minima

X-ray refinement / multiple minima

Page 10: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

X-ray R-factor

Error = Σ w.(obs-calc)2

R-factor = Σ w.|obs-calc|

Page 11: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

X-ray resolution

Page 12: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

NMR data collection

Page 13: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

NMR data

NMR data consists of short inter-atomic distances between atoms. We call these NOEs.Most NOEs are between close neighbours in the sequence. Those hold little information.The ‘good’ NOEs are between atoms far away in the sequence. There are few of those, normally.NOEs are known with low precision. E.g. NOEs are binned 2.5-4.0, 4.0-5.5, and 5.5-7.0.NMR can also measure some angles, and relative orientations. The latter, called RDCs are powerful.

Page 14: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

NMR Q-factor

Error = Σ NOE/RDC-violations + Energy term2

Page 15: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

NMR versus X-ray

With X-ray you measure reflections. Each reflection holds information about each atom. With NMR you measure pair-wise distances, angles, and orientations. These all hold local information.

X-ray requires crystals, and crystals cause/are artefacts.NMR is in solution, but provides much less precision.

Page 16: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

NMR versus X-ray

NMR X-ray‘Error’ 1-2 Å 0.1-0.5 ÅMobility yes not reallyCrystal artefacts no yesMaterial needed 20 mg 1 mgCost of hardware 4 M Euro near infinite (share)Drug design no almost

Better combine and use the best of both worlds.

Page 17: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Why validation ?

Why does a sane (?) human being spend twenty years to search for millions of errors in the PDB?

Page 18: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Validation because:

Everything we know about proteins comes from PDB files.

Errors become less dangerous when you know about them.

And, going back to the red thread through this series, if a template is wrong the model will be wrong.

Page 19: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

What kind of errors can the software find?

Administrative errors.Crystal-specific errors.NMR-specific errors.Really wrong things.Improbable things.Things worth looking at.Ad hoc things.

Page 20: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Smile or cry?

A 5RXN 1.2 B 7GPB 2.9 C 1DLP 3.3 D 1BIW 2.5

Page 21: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Why? Simple, proteins are very complex.

Page 22: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

X-ray specific

Page 23: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Little things hurt big

Page 24: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

How bad is bad?

X-ray

Page 25: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Check with force fields

A force field is a set of parameters together with a set of rules to use those parameters to see how normal something is. Most force fields are designed to score events, or to predict the future.

In structure validation we often look at structures and count ‘things’ For example, we count that the number of buried hydrogen bond donors that do not make a hydrogen bond is 4.6+/-1.2 per 100 amino acids in well-solved proteins. So we call that normal, and now, using ΔG=-RTln(K), we can calculate the energy penalty for proteins with more than 4.6 unsatisfied buried unsatisfied hydrogen bonds.

Page 26: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Contact Probability

Page 27: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Contact Probability

Page 28: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Contact probability box

Page 29: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

One slide about homology modelling

Page 30: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

His, Asn, Gln ‘flips’

Page 31: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Hydrogen bond network

Page 32: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Your best check:

Page 33: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

How difficult can it be?

1CBQ

2.2 A

Page 34: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

How difficult can it be?

Page 35: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Errors or discoveries?

Buried histidine.

Warning for buried histidine triggered biochemical follow -up and new mechanism for KH-module of Vigilin. (A. Pastore, 1VIG).

Page 36: ECCB 2020 Gent Introduction to protein structure validation (and improvement) Gert Vriend Protein structure validation

Acknowledgements:

Elmar Krieger Sander Nabuurs Chris SpronkMaarten Hekkelman

Rob Hooft

Robbie Joosten