efficient algorithms for the contextualization of

40
Thomas Sauter System Biology group University of Luxembourg Efficient algorithms for the contextualization of molecular network models 1. Introduction: Network modeling / Systems Biology 2. Specific metabolic network models with fastcore / fastcormics 3. Specific signalling network models with optPBN / Falcon

Upload: others

Post on 18-Dec-2021

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Efficient algorithms for the contextualization of

Thomas SauterSystem Biology group

University of Luxembourg

Efficient algorithmsfor the contextualization

of molecular network models

1. Introduction: Network modeling / Systems Biology

2. Specific metabolic network models with fastcore / fastcormics

3. Specific signalling network models with optPBN / Falcon

Page 2: Efficient algorithms for the contextualization of

[S. Subramaniam, 2004, Jones, Research Trends, 2010Su, sulab.org, 2013;Gilbert et al, Stand. Genomic Sci., 2012]

“We are witnessing the emergence of

the "data rich” era in biology…”

Page 3: Efficient algorithms for the contextualization of

“The bottleneck … has shifted from generating

the data to interpreting results so as

to derive insights into biological mechanisms.”

Page 4: Efficient algorithms for the contextualization of

[O. Wolkenhauer, Front Physiol, 2014, “Why model?”]

S

R

P+

+ +

“We cannot understand complex systems without modeling.”

“The role of the model is to make something complex intelligible or

understandable.”

Page 5: Efficient algorithms for the contextualization of

“Why model?

The role of the model is to make something complex intelligible or

understandable.”

[O. Wolkenhauer, Front Physiol, 2014, “Why model?”]

Page 6: Efficient algorithms for the contextualization of

[O. Wolkenhauer, Front Physiol, 2014, “Why model?”]

“Why model?

The role of the model is to make something complex intelligible or

understandable.”

Page 7: Efficient algorithms for the contextualization of

“Systems biology is the science that studies how biological function

emerges from the interactions between the components of living

systems and how these emergent properties enable and constrain the

behavior of those components.”

[O. Wolkenhauer, Front Physiol, 2014, “Why model?”]

S

R

P+

+ +

Example Network Motif: Function?

Page 8: Efficient algorithms for the contextualization of

Systems approach / Function:

Output behavior as a function of input and time

S

R

P+

+ +

(System) RS

Page 9: Efficient algorithms for the contextualization of

Output as a function of input and time

S

R

0 10 200

0.2

0.4

0.6

0.8

1

time

S

0 10 200

0.2

0.4

0.6

0.8

1

time

R

Page 10: Efficient algorithms for the contextualization of

Output as a function of input and time

S

R+

0 10 200

0.2

0.4

0.6

0.8

1

time

S

0 10 200

5

10

15

time

R

Page 11: Efficient algorithms for the contextualization of

Output as a function of input and time

S

R+

0 10 200

0.2

0.4

0.6

0.8

1

time

S

0 10 200

0.2

0.4

0.6

0.8

1

time

R

Page 12: Efficient algorithms for the contextualization of

Output as a function of input and time

S

R

P

+ +

0 10 200

0.2

0.4

0.6

0.8

1

time

0 10 200

0.2

0.4

0.6

0.8

1

time

S R

Page 13: Efficient algorithms for the contextualization of

Output as a function of input and time

S

R

P+

+

0 10 200

0.2

0.4

0.6

0.8

1

time

S

0 10 200

0.2

0.4

0.6

0.8

1

time

R

0 10 200

0.2

0.4

0.6

0.8

1

time

P

Page 14: Efficient algorithms for the contextualization of

Output as a function of input and time

S

R

P+

+ +

0 10 20 300

0.2

0.4

0.6

0.8

1

time

S

0 10 20 300

0.5

1

time

R

Page 15: Efficient algorithms for the contextualization of

0 50 1000

0.2

0.4

0.6

0.8

1

time

S

0 50 1000

0.5

1

time

R

Function:

Change detection / Adaptation

(System) RS

Page 16: Efficient algorithms for the contextualization of

Systems approach:

Second example: Different wiring

S

R

P+

+ +(AND)

Input S

P

Output R

Function: Filter – removes short inputs & act upon long inputs

Page 17: Efficient algorithms for the contextualization of

Structure: Feedforward loops

incoherent: coherent:

S

R

P+

+

Functions:

Change detection Short input filter

S

R

P+

+ +

Page 18: Efficient algorithms for the contextualization of

Model based Data Integration and

Analysis of Disease specific Networks

Data integration Network analysis

Page 19: Efficient algorithms for the contextualization of

Model based Data Integration and

Analysis of Disease specific Networks

Data integration Network analysis

• Omics data

visualization

• Metabolic network reconstruction

• Signalling network

curation

• Gene regulatory networks from epigenetic data

• Data mining / machine learning

• Disease specificmetabolic and signalling networks

• Drug targetidentification

• Multi-scalemodelling concepts

optPBN3,4

FALCON5,7

1 Nucleic Acids Research 2013, 1;42(3):1474-96

2 PLoS Comput Biol. 2014, Jan;10(1)

3 Cell Commun Signal. 2013, Jul 1;11:46

4 PLoS One. 2014 Jul 1;9(7):e98001

5 Bioinformatics 2017 1;33(21):3431-3436

6 Nature Protocols, accepted

7 Frontiers Physiol 2018, 9:550

8-11 in prep, subm, under rev

EPIC-DREM10

IDARE1,8

Fastcore2,6,9

Fastcomp11

Page 20: Efficient algorithms for the contextualization of

Metabolic models are

scaffolds for the integration of omics data

FASTCORE: Vlassis, Pires Pacheco & Sauter, 2014, Plos Comp Biol

FASTCORMICS: Pires Pacheco et al, 2015, BMC genomics

Benchmarking: Pires Pacheco et al, 2016, Front Physiol

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

In COBRA toolbox: Nature Protocols, accepted

Page 21: Efficient algorithms for the contextualization of

S2 -2 0 0 0 0 0 1 -1 -1 0 0 0 0 1 0 0 -1 0 0 0 1 -1 0

Mathematical

representation:

S-matrix

A B

Cv1 v2

v3

v4 D

v5

Metabolic network

Constraints

Bounds: vmax, v>0...

Balances : S*v =0

A

B

C

D

v1 v2 v3 v4 v5 v6

2

Fast Reconstruction of Compact Context-

Specific Metabolic Network Models:

Constraint Based Modelling

Page 22: Efficient algorithms for the contextualization of

1) Core reactions are known to be expressed(red) in the context of interest.

2) Core reactions are forced to carry a non-zero flux by the maximization of an approximation of the cardinality function

3) The inclusion of non-core (blue) reactions is minimized by a L1-regularization

The FASTCORE family* principle

FASTCORE: Vlassis, Pires Pacheco & Sauter, 2014, Plos Comp Biol

FASTCORMICS: Pires Pacheco et al, 2015, BMC genomics

Benchmarking: Pires Pacheco et al, 2016, Front Physiol

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

In COBRA toolbox: Nature Protocols, accepted

Page 23: Efficient algorithms for the contextualization of

Criteria:

• computational demands

• accuracy

• Resolution power

(ability to capture metabolic variations between two context i.e. cell types)

Data used:

• Artificial data

• Real data

Strategies used:

• Comparison to independent references (Human Protein atlas,

HepatoNet, etc.)

• Cross-validations

• Functionalities

Benchmarking of the FASTCORE family

Pires Pacheco, Pfau & Sauter, 2016, Front Physiol.

Page 24: Efficient algorithms for the contextualization of

• Metabolic rewiring affects others hallmarks of cancer

• Specifically targeting cancer metabolism allows to reduce toxicity (on other cell types)

Cancer cells have an altered metabolism

that can be targeted

Page 25: Efficient algorithms for the contextualization of

• TCGA dataset contains 10005 RNA-seq samples across 24 cancer types

• All 10005 models were built in less than 5 days

Breast

Lung

Kidney

Brain

Colorectal

UterusThyroid

Prostate

Head And Neck

Skin

Bladder

Liver

Stomach

Ovary

Cervix

Adrenal Gland

Soft TissuePancreas

Other

10005 models were built using the

FASTCORMICS RNA-seq workflow

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

Page 26: Efficient algorithms for the contextualization of

Jaccard

Sim

ilari

ty I

nd

ex

KICH=Kidney Chromophobe

KIRC= Kidney renal clear cell carcinoma

KIRP=Kidney renal papillary cell carcinoma

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

FASTCORMICS RNA-seq captures metabolic

variation between different cancer types

Page 27: Efficient algorithms for the contextualization of

Pe

rce

nta

ge o

f a

cti

ve

re

ac

tio

ns in

a p

ath

wa

y

Cancer

coreHealthy

core107 10118

Cancer-cells have a more compact

metabolism

• Cancer cells contain more essential genes = more potential drug targets

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

Page 28: Efficient algorithms for the contextualization of

Application 2:

Identification of potential drug targets for drug repositioning

Page 29: Efficient algorithms for the contextualization of

in silico gene essentiality analysis

DRUG Databases

Selection of

potential non-

cancer drugs to

treat colorectal

cancer

• 107 essential genes

• 17 drugs without anticancer reference

Gene essentiality analysis and datamining to

find potential drug targets

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

• 17 drug targets

• 43 unique drugs for 17 targets

Page 30: Efficient algorithms for the contextualization of

Healthy (CCD18-CO – colonic fibroblast) vs. Cancer (T18 – primary) cells

T18

CCD18-CO

- 24 hour time interval because of missing time points for CCD18-CO cells

- Graphs show representative experiment in techn. triplicates +/- SD, experiments were repeated 3x (CCD18) and 10x (T18)

Page 31: Efficient algorithms for the contextualization of

• Refine the workflow to capture metabolic variation between patients sub-groups and to predict tailored treatments

FASTCORE: Vlassis, Pires Pacheco & Sauter, 2014, Plos Comp Biol

FASTCORMICS: Pires Pacheco et al, 2015, BMC genomics

FASTCORMICS RNAseq: Pires Pacheco et al, in preparation

In COBRA toolbox: Nature Protocols, accepted

Outlook

Page 32: Efficient algorithms for the contextualization of

Computational model based

data integration and

network analysis

Data integration Network analysis

• Omics data integration &

Visualization

• Compact network reconstruction

• Network curationusing (probabilistic) Boolean modelling

• Metabolic networks

• Signaling networks

• SensitivityAnalysis / Earlywarning

IDARE1

Fastcore2,7

optPBN3,5

FALCON6

DyGloSA4

1 Nucleic Acids Research 2013, 1;42(3):1474-96

2 PLoS Comput Biol. 2014, Jan;10(1)

3 Cell Commun Signal. 2013, Jul 1;11:46

4 PLoS One. 2013, Dec 18;8(12):e82973

5 PLoS One. 2014 Jul 1;9(7):e98001

6 Bioinformatics 2017 1;33(21):3431-3436

7 Nature Protoc., accepted

Page 33: Efficient algorithms for the contextualization of

DeLandtsheer et al. (2017) FALCON: A Toolbox for the Fast Contextualisation of Logical Networks.

Bioinformatics. 2017 Jun 29. doi: 10.1093/bioinformatics/btx380.

Page 34: Efficient algorithms for the contextualization of

Pipeline Structure: Optimization

DeLandtsheer et al. (2017) FALCON: A Toolbox for the Fast Contextualisation of Logical Networks.

Bioinformatics. 2017 Jun 29. doi: 10.1093/bioinformatics/btx380.

Page 35: Efficient algorithms for the contextualization of

Sums of activating signals Sums of inhibiting signals

𝑗+=1

𝑚

𝑘𝑗+(𝑖)= 1 0 ≤

𝑗−=1

𝑙

𝑘𝑗−(𝑖)≤ 1 .

𝑿𝒕(𝒊)=

𝒋+=𝟏

𝒎

𝒌𝒋+𝒊𝑷𝒂(𝑿 𝒊 )𝒕−𝟏

(𝒋+) ∗ 𝟏 −

𝒋−=𝟏

𝒍

𝒌𝒋−𝒊𝑷𝒂(𝑿 𝒊 )𝒕−𝟏

𝒋−

Biological equivalent Graphical form Algebraic computation

Activation A → Z (k) Zt+1 = At * k

Inhibition A -| Z (k) Zt+1 = 1 – (At * k)Complex formation A AND B → Z (k) Zt+1 = At * Bt * k

Competitive interaction A OR B → Z (k) Zt+1 = 1 – [ (1-At) * (1-Bt) * k]

Non-competitiveinteraction

A → Z (k1)B → Z (k2)

Zt+1 = At * k1 + Bt * k2(with k1 + k2 = 1)

𝑥

𝑥𝑎𝑛

𝑥𝑎1

𝑥𝑎2

𝑥𝑏𝑚

𝑥𝑏1

𝑥𝑏2

𝑤𝑎1

𝑤𝑏2𝑤𝑎2

𝑤𝑎𝑛

𝑤𝑏1

𝑤𝑏𝑚k

k

k

k

k

k

Concept

DeLandtsheer et al. (2017) FALCON: A Toolbox for the Fast Contextualisation of Logical Networks.

Bioinformatics. 2017 Jun 29. doi: 10.1093/bioinformatics/btx380.

Page 36: Efficient algorithms for the contextualization of

Typical study:

- WB dataset

- Different cell lines

- Different drugs

- Putative cross-talks

- Training on single

perturbations

- Predicting

combined

perturbations

FALCON: cancer specific PDGF signallingWith: P. Trairatphisan, M. Wiesinger, S. Haan, J. Pang, A. Mizera

Trairatphisan et al, PLoS One, 2014

Page 37: Efficient algorithms for the contextualization of

FALCON: cancer specific PDGF signallingWith: P. Trairatphisan, M. Wiesinger, S. Haan, J. Pang, A. Mizera

Page 38: Efficient algorithms for the contextualization of

FALCON: cancer specific PDGF signallingWith: P. Trairatphisan, M. Wiesinger, S. Haan, J. Pang, A. Mizera

Trairatphisan et al, PLoS One, 2014

Page 39: Efficient algorithms for the contextualization of

Our modelling framework is capable of

contextualizing logical networks with

phosphoproteomics data◦ Lower fitting costs

◦ Very fast

The low computational costs allow for:◦ Larger networks and big data

◦ Systems Analysis:

Parameter localization

Identifiability

KO simulations

Cell line comparison

Discussion & Conclusion FALCON

Page 40: Efficient algorithms for the contextualization of

AcknowledgementsL. Sinkkonen, P. Nguyen, T. Pfau, M. Galhardo, Y. Gui, P. Lucarelli,

P. Trairatphisan, M. Pires Pacheco, D. Gerard, S. Nickels, P. Martins Conde,

S. De Landtsheer, M. Albrecht, M. Wiesinger, T. Bintener, J. Becker, M. Gavriil, J. Ohnmacht

U Luxembourg: E. Lettelier, S. Haan, J.-L. Bueb, A. Mizera, J. Pang, E. Reckinger-Schaffner

LIH Luxembourg: P. Nazarov, L. Vallar, F. He, F. Azuaje

LCSB Luxembourg: N. Vlassis, J. Schwamborn, J. Schneider, R. Halder, R. Fleming, A. del Sol

U Dresden: D. Kulms Institute Curie Paris: A. Zinovyev

MPI Saarbrücken: M. Schulz CNRS Paris: L. Pauleve

U Kuopio: M. Heinäniemi, M. Liivrand King’s College London: M. Tavassoli

U Lorraine: T. Bastogne, S. Dobra U Bordeaux: G. Sciume

U Stuttgart: J. Witt, S. Barisic, R. Schlatter, S. Henkel, R. Feuer, M. Ederer, O. Sawodny

U Heidelberg / Synlab Holding GmbH: I. Gergei, W. März

Merrimack Pharmaceuticals: (B. Schöberl), A. Raue

Bayer Technology Services: T. Eissing, L. Küpfer

optimata Ltd: Y. Kogan, Z. Agur

ProtATonce: L. Alexopoulos

contact: [email protected]