advanced bioinformatics methods for proteomics

93
Advanced bioinformatics methods for proteomics Lars Juhl Jensen

Upload: lars-juhl-jensen

Post on 10-May-2015

443 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Advanced bioinformatics methods for proteomics

Advanced bioinformaticsmethods for proteomics

Lars Juhl Jensen

Page 2: Advanced bioinformatics methods for proteomics

three parts

Page 3: Advanced bioinformatics methods for proteomics

signaling networks

Page 4: Advanced bioinformatics methods for proteomics

association networks

Page 5: Advanced bioinformatics methods for proteomics

text mining

Page 6: Advanced bioinformatics methods for proteomics

Part 1signaling networks

Page 7: Advanced bioinformatics methods for proteomics

phosphoproteomics

Page 8: Advanced bioinformatics methods for proteomics

Linding, Jensen, Ostheimer et al., Cell, 2007

Page 9: Advanced bioinformatics methods for proteomics

in vivo phosphosites

Page 10: Advanced bioinformatics methods for proteomics

kinases are unknown

Page 11: Advanced bioinformatics methods for proteomics

sequence specificity

Page 12: Advanced bioinformatics methods for proteomics

Miller, Jensen et al., Science Signaling, 2008

Page 13: Advanced bioinformatics methods for proteomics

NetPhorest

Page 14: Advanced bioinformatics methods for proteomics

Miller, Jensen et al., Science Signaling, 2008

Page 15: Advanced bioinformatics methods for proteomics

motif atlas

Page 16: Advanced bioinformatics methods for proteomics

kinases

Page 17: Advanced bioinformatics methods for proteomics

phospho-binding proteins

Page 18: Advanced bioinformatics methods for proteomics

phosphatases

Page 19: Advanced bioinformatics methods for proteomics

protein-specific

Page 20: Advanced bioinformatics methods for proteomics

no context

Page 21: Advanced bioinformatics methods for proteomics

co-activators

Page 22: Advanced bioinformatics methods for proteomics

protein scaffolds

Page 23: Advanced bioinformatics methods for proteomics

localization

Page 24: Advanced bioinformatics methods for proteomics

expression

Page 25: Advanced bioinformatics methods for proteomics

association network

Page 26: Advanced bioinformatics methods for proteomics

Linding, Jensen, Ostheimer et al., Cell, 2007

Page 27: Advanced bioinformatics methods for proteomics

NetworKIN

Page 28: Advanced bioinformatics methods for proteomics

Linding, Jensen, Ostheimer et al., Cell, 2007

Page 29: Advanced bioinformatics methods for proteomics

web interface

Page 30: Advanced bioinformatics methods for proteomics
Page 31: Advanced bioinformatics methods for proteomics

Part 2association networks

Page 32: Advanced bioinformatics methods for proteomics

guilt by association

Page 33: Advanced bioinformatics methods for proteomics
Page 34: Advanced bioinformatics methods for proteomics

STRING

Page 35: Advanced bioinformatics methods for proteomics

Szklarczyk, Franceschini et al., Nucleic Acids Research, 2011

Page 36: Advanced bioinformatics methods for proteomics

>1100 genomes

Page 37: Advanced bioinformatics methods for proteomics

computational predictions

Page 38: Advanced bioinformatics methods for proteomics

genomic context

Page 39: Advanced bioinformatics methods for proteomics

gene fusion

Page 40: Advanced bioinformatics methods for proteomics

Korbel et al., Nature Biotechnology, 2004

Page 41: Advanced bioinformatics methods for proteomics

phylogenetic profiles

Page 42: Advanced bioinformatics methods for proteomics

Korbel et al., Nature Biotechnology, 2004

Page 43: Advanced bioinformatics methods for proteomics

experimental data

Page 44: Advanced bioinformatics methods for proteomics

physical interactions

Page 45: Advanced bioinformatics methods for proteomics

Jensen & Bork, Science, 2008

Page 46: Advanced bioinformatics methods for proteomics

gene coexpression

Page 47: Advanced bioinformatics methods for proteomics
Page 48: Advanced bioinformatics methods for proteomics

curated knowledge

Page 49: Advanced bioinformatics methods for proteomics

pathways

Page 50: Advanced bioinformatics methods for proteomics

Letunic & Bork, Trends in Biochemical Sciences, 2008

Page 51: Advanced bioinformatics methods for proteomics

many databases

Page 52: Advanced bioinformatics methods for proteomics

different formats

Page 53: Advanced bioinformatics methods for proteomics

different identifiers

Page 54: Advanced bioinformatics methods for proteomics

variable quality

Page 55: Advanced bioinformatics methods for proteomics

not comparable

Page 56: Advanced bioinformatics methods for proteomics

quality scores

Page 57: Advanced bioinformatics methods for proteomics

von Mering et al., Nucleic Acids Research, 2005

Page 58: Advanced bioinformatics methods for proteomics

calibrate vs. gold standard

Page 59: Advanced bioinformatics methods for proteomics

von Mering et al., Nucleic Acids Research, 2005

Page 60: Advanced bioinformatics methods for proteomics

missing most of the data

Page 61: Advanced bioinformatics methods for proteomics

Part 3text mining

Page 62: Advanced bioinformatics methods for proteomics

>10 km

Page 63: Advanced bioinformatics methods for proteomics

too much to read

Page 64: Advanced bioinformatics methods for proteomics

computer

Page 65: Advanced bioinformatics methods for proteomics

as smart as a dog

Page 66: Advanced bioinformatics methods for proteomics

teach it specific tricks

Page 67: Advanced bioinformatics methods for proteomics
Page 68: Advanced bioinformatics methods for proteomics
Page 69: Advanced bioinformatics methods for proteomics

named entity recognition

Page 70: Advanced bioinformatics methods for proteomics

comprehensive lexicon

Page 71: Advanced bioinformatics methods for proteomics

proteins

Page 72: Advanced bioinformatics methods for proteomics

cellular components

Page 73: Advanced bioinformatics methods for proteomics

compartments.jensenlab.org

Page 74: Advanced bioinformatics methods for proteomics

tissues

Page 75: Advanced bioinformatics methods for proteomics

tissues.jensenlab.org

Page 76: Advanced bioinformatics methods for proteomics

diseases

Page 77: Advanced bioinformatics methods for proteomics

orthographic variation

Page 78: Advanced bioinformatics methods for proteomics

singular vs. plural

Page 79: Advanced bioinformatics methods for proteomics

spaces and hyphens

Page 80: Advanced bioinformatics methods for proteomics

“black list”

Page 81: Advanced bioinformatics methods for proteomics

information extraction

Page 82: Advanced bioinformatics methods for proteomics

co-mentioning

Page 83: Advanced bioinformatics methods for proteomics
Page 84: Advanced bioinformatics methods for proteomics

NLPNatural Language Processing

Page 85: Advanced bioinformatics methods for proteomics

Gene and protein names

Cue words for entity recognition

Verbs for relation extraction

[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]

Page 86: Advanced bioinformatics methods for proteomics

summary

Page 87: Advanced bioinformatics methods for proteomics

bioinformatics

Page 88: Advanced bioinformatics methods for proteomics

more than BLAST

Page 89: Advanced bioinformatics methods for proteomics

data/text mining

Page 90: Advanced bioinformatics methods for proteomics

save you much time

Page 91: Advanced bioinformatics methods for proteomics

AcknowledgmentsNetPhorestRune Linding

Martin Lee Miller

Erwin Schoof

Francesca Diella

Claus Jørgensen

Michele Tinti

Lei Li

Marilyn Hsiung

Sirlester A. Parker

Jennifer Bordeaux

Thomas Sicheritz-Pontén

Marina Olhovsky

Adrian Pasculescu

Jes Alexander

Stefan Knapp

Nikolaj Blom

Peer Bork

Shawn Li

Gianni Cesareni

Tony Pawson

Benjamin E. Turk

Michael B. Yaffe

Søren Brunak

STRINGChristian von Mering

Damian Szklarczyk

Michael Kuhn

Manuel Stark

Samuel Chaffron

Chris Creevey

Jean Muller

Tobias Doerks

Philippe Julien

Alexander Roth

Milan Simonovic

Jan Korbel

Berend Snel

Martijn Huynen

Peer Bork

NetworKINRune Linding

Heiko Horn

Gerard Ostheimer

Martin Lee Miller

Francesca Diella

Karen Colwill

Jing Jin

Pavel Metalnikov

Vivian Nguyen

Adrian Pasculescu

Jin Gyoon Park

Leona D. Samson

Rob Russell

Peer Bork

Michael Yaffe

Tony Pawson

Text-miningSune Frankild

Evangelos Pafilis

Janos Binder

Heiko Horn

Michael Kuhn

Nigel Brown

Reinhardt Schneider

Sean O’Donoghue

Page 92: Advanced bioinformatics methods for proteomics

larsjuhljensen

Page 93: Advanced bioinformatics methods for proteomics