nature-inspired smart info systems westra: piecewise linear dynamic modeling and identification of...
TRANSCRIPT
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 1
Nature-inspired Smart Info Systems
Ronald L. Westra, Department of Mathematics
Lars Eijssen, Joyce Corvers, Department of Genetics
Maastricht University
On the identifiability of piecewise linear gene-protein networks
relative to noise and chaos
GG22
GG11
PP22
PP11
PP33
GG33
GG44
GG11
PP55
PP44
PP33
GG33
GG66
Σ1 Σ2
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 2
Nature-inspired Smart Info Systems
1. Background and problem formulation
2. Modeling and identification of gene/proteins interactions
3. The implications of stochastic fluctuations and deterministic chaos
5. Example 1: Application on artificial reaction model
5. Example 2: Application on Tyson-Novak model for fission yeast
5. Example 3: Application on fission yeast expression data
6. Conclusions
Items in this Presentation
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 3
Nature-inspired Smart Info Systems
Question: Can gene regulatory networks be reconstructed from
time series of observations of (partial) genome wide and protein
concentrations?
1. Problem formulation
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 4
Nature-inspired Smart Info Systems
Relation between mathematical model and phys-chem-biol reality
Macroscopic complexity from simple microscopic interactions
Approximate modeling as partitioned in subsystems with local
dynamics
Modeling of subsystems as piecewise linear systems (PWL)
PWL-Identification algorithms: network reconstruction from
(partial) expression and RNA/protein data
Experimental conditions of poor data: lots of gene but little data
The role of stochasticity and chaos on the identifiability
Problems in modeling and identification
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 5
Nature-inspired Smart Info Systems
2. Modeling the Interactions between Genes and Proteins
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 6
Nature-inspired Smart Info Systems
2.1 Modeling the molecular dynamics and reaction kinetics as Stochastic Differential Equations
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 7
Nature-inspired Smart Info Systems
2.2 Gene-Protein Interaction Networks as Piecewise Linear Models
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 8
Nature-inspired Smart Info Systems
2.3 Problems concerning the identifiability of PieceWise Linear models
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 9
Nature-inspired Smart Info Systems
3. The Implications of Stochastic fluctuations and Deterministic Chaos
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 10
Nature-inspired Smart Info Systems
3.1 Stochastic fluctuations
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 11
Nature-inspired Smart Info Systems
3.2 Noise-induced control in single-cell gene expression
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 12
Nature-inspired Smart Info Systems
Influence of stochastic fluctuations on the evolution of the expression of two coupled genes.
.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 13
Nature-inspired Smart Info Systems
3.3 Deterministic Chaos
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 14
Nature-inspired Smart Info Systems
4. Identification of Interactions between Genes and Proteins
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 15
Nature-inspired Smart Info Systems
4.2 The identification of PIECEWISE linear networks by L1-minimization
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 16
Nature-inspired Smart Info Systems
Gene-Protein Interaction Networks asPiecewise Linear Models
The general case is complex and approximate
Strongly dependent on unknown microscopic details
Relevant parameters are unidentified and thus unknown
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 17
Nature-inspired Smart Info Systems
2. Modeling of PWL Systems as subspace models
Global dynamics:
Local attractors (uniform, cycles, strange)
Basins of Attraction
Each BoA is a
subsystem Σi
“checkpoints”State space
Σ1
Σ2Σ3
Σ4
Σ5
Σ6
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 18
Nature-inspired Smart Info Systems
Modeling of PWL Systems as subspace models
State vector moves through state space
driven by local dynamics (attractor, repeller) and inputs
in each subsystem Σ1
the dynamics is governed by the local equilibria.
approximation of subsystem as linear statespace model:
State space
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 19
Nature-inspired Smart Info Systems
Problems concerning the identifiability of Piecewise Linear models
1. Due to the huge costs and efforts involved in the experiments, only a limited number of time points are available in the data. Together with the high dimensionality of the system, this makes the problemseverely under-determined.
2. In the time series many genes exhibit strong correlation in their time-evolution, which is not per se indicative for a strong coupling between these genes but rather induced by the over-all dynamics ofthe ensemble of genes. This can be avoided by persistently exciting inputs.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 20
Nature-inspired Smart Info Systems
Problems concerning the identifiability of Piecewise Linear models
3. Not all genes are observed in the experiment, and certainly most of the RNAs and proteins are not considered. therefore, there are many hidden states.
4. Effects of stochastic fluctuations on genes with low transcription factors are severe and will obscure their true dependencies.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 21
Nature-inspired Smart Info Systems
Such are the problems relating to the identifiability of piecewise linear systems:
Are conditions for modeling rate equations met?
High stochasticity and chaos
Are piecewise linear approximations a valid metaphor?
Problems with stochastic modeling
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 22
Nature-inspired Smart Info Systems
The identification of PIECEWISE linear networks by L1-minimization
K linear time-invariant subsystems {Σ1, Σ2, .., ΣK}Continuous/Discrete time
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 23
Nature-inspired Smart Info Systems
4.2 The identification of PIECEWISE linear networks by L1-minimization
Weights wkj indicate membership of observation #k
to subsystem Σj :
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 24
Nature-inspired Smart Info Systems
Rich and Poor data
poor data: not sufficient empirical data is available to reliably estimate all system parameters, i.e. the resulting identification problem is under-determined.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 25
Nature-inspired Smart Info Systems
(un)known switching times,regular sampling intervals,rich / poor data,
Identification of PWL models with known switching times and regular sampling intervals from rich data
Identification of PWL models with known switching times and regular sampling intervals from poor data
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 26
Nature-inspired Smart Info Systems
1. unknown switching times,regular sampling intervals,poor data, known state derivatives
This is similar to simple linear case
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 27
Nature-inspired Smart Info Systems
This can thus be written as:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 28
Nature-inspired Smart Info Systems
with:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 29
Nature-inspired Smart Info Systems
with:
The approach is as follows:
(i) initialize A, B, and W,
(ii) perform the iteration:1. Compute H1 and H2, using the simple linear system approach 2. Using fixed W, compute A and B,3. Using fixed A and B, compute W
until: (iii) criterion E has converged sufficiently – or a maximum number of iterations.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 30
Nature-inspired Smart Info Systems
Linear L1-criterion:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 31
Nature-inspired Smart Info Systems
With linear L1-criterion E1 the problem can be formulated as LP-problem:
LP1: compute H1,H2 from simple linear case
LP2: A and B, using E1-criterion and extra constraints for W, H1,H2,
LP3: compute optimal weights W, using E1-criterion with constraints for W, H1,H2, A and B
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 32
Nature-inspired Smart Info Systems
2. unknown switching times,regular sampling intervals,poor data, unknown state derivatives
Use same philosophy as mentioned before
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 33
Nature-inspired Smart Info Systems
Subspace dynamics and linear L1-criterion :
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 34
Nature-inspired Smart Info Systems
System parameters and empirical data :
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 35
Nature-inspired Smart Info Systems
Quadratic Programming problem QP :
Problem: not well-posed: i.e.: Jacobian becomes zero and ill-conditioned near optimum
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 36
Nature-inspired Smart Info Systems
Therefore split in TWO Linear Programming problems:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 37
Nature-inspired Smart Info Systems
In case of sparse interactions replace LP1 with:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 38
Nature-inspired Smart Info Systems
Performance of robust Identification approach
Artificially produced data reconstructed with this approach
Compare reconstructed and original data
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 39
Nature-inspired Smart Info Systems
The influence of increasing intrinsic noise on the identifiability.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 40
Nature-inspired Smart Info Systems
a: CPU-time Tc as a function of the problem size N, b: Number of errors as a function of the number of nonzero entries k,
M = 150, m = 5, N = 50000.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 41
Nature-inspired Smart Info Systems
a: Number of errors versus M, b: Computation time versus M
N = 50000, k = 10, m = 0.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 42
Nature-inspired Smart Info Systems
a: Minimal number of measurements Mmin required to compute A free of error versus the problem size N,
b: Number of errors as a function of the intrinsic noise level σA
N = 10000, k = 10, m = 5, M = 150, measuring noise B = 0.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 43
Nature-inspired Smart Info Systems
Example 1: how to apply this method on current data sets
Spellman et al. data for cell-cycle of fission yeast :
Components: 6179 genes measured for 18-24 irregular time instants
Processing: fuzzy C-means, gene annotation with Go term finder and Fatigo, net recontruction with identification algorithm
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 44
Nature-inspired Smart Info Systems
Spellman et al. data for cell-cycle of fission yeast :
Processing:
Selection of most up/down-regulated genes: 3107 from 6179
Clustering: fuzzy C-means: best outcome 23 clusters
Gene annotation with Go term finder (4th level) and Fatigo, both for biological process and cellular component
Net recontruction with identification algorithm on 23 clusters
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 45
Nature-inspired Smart Info Systems
Centroids after clustering 23 clusters
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 46
Nature-inspired Smart Info Systems
Gene ontology
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 47
Nature-inspired Smart Info Systems
Gene ontologyCluster 1GO Term Finder: The genes are involved in spindle pole during the cell cycle, with relations to microtubuli and chromosomal structure.FatiGO: The main cellular component is the chromosome.
Cluster 2GO Term Finder: The genes are involved in proliferation and replications, especially bud neck and polarized growth.FatiGO: The results found by the GO Term Finder are confirmed. …………….
Cluster 22GO Term Finder: Only a few annotations are found and there are many unknown genes. The genes are involved in respiration and reproduction. The main cellular components are the actin/cortical skeleton and the mitochondrial inner membrane.FatiGO: No further clear annotations are found.
Cluster 23GO Term Finder: The genes are involved in RNA processing. The main cellular components are the nucleus, the RNA polymerase complex and the ribonucleoprotein complex.FatiGO: The main cellular component is the ribonucleoprotein complex.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 48
Nature-inspired Smart Info Systems
Reonstructed network
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 49
Nature-inspired Smart Info Systems
Example 2: artificial data of hierarchic/sparse network
Artificial reaction network with:
Components: 2 master genes with high transcription rates 3 slave genes with low transcription rates 4 agents (= RNA or proteins).
Processes: stimulation, inhibition, transcription, and reactions between ‘agents’
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 50
Nature-inspired Smart Info Systems
Dynamics:
– large hierarchic and sparse network
– implicit relation between genes with expression x
through agents (= proteins, RNA) with concentration a – system near equilibrium and small perturbations
– inputs: persistent excitation u
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 51
Nature-inspired Smart Info Systems
Dynamics:
– implicit system dynamics:
– linear statespace model makes gene interaction explicit:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 52
Nature-inspired Smart Info Systems
Dynamics:
– estimate gene-gene interaction matrix A from empirical data:
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 53
Nature-inspired Smart Info Systems
reactions
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 54
Nature-inspired Smart Info Systems
reactions
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 55
Nature-inspired Smart Info Systems
reactions
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 56
Nature-inspired Smart Info Systems
Matlab-simulation
y(1) = - 0.03*x(1) + 0.2*(1-x(1))*a(2)^2 - 0.2*x(1)*a(3) ;y(2) = - 0.05*x(2) + 0.3*(1-x(2))*a(1) - 0.1*x(2)*a(4) ;y(3) = - 0.02*x(3) + 0.1*(1-x(3))*a(2) - 0.1*x(3)*a(1) ;y(4) = - 0.01*x(4) + 0.2*(1-x(4))*a(1)*a(2) - 0.2*x(4)*a(3)^2;y(5) = - 0.02*x(5) + 0.3*(1-x(5))*a(3) - 0.1*x(5)*a(1);y(6) = - 0.02*a(1) + 0.4*x(1) - 0.2*a(1)*a(2) - 0.1*a(1)*a(3)^3;y(7) = - 0.01*a(2) + 0.15*x(2) - 0.2*a(1)*a(2);y(8) = - 0.01*a(3) + 0.2*a(1)*a(2) - 0.1*a(1)*a(3)^3;y(9) = - 0.05*a(4) + 0.9*a(1)*a(3);
rate equations
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 57
Nature-inspired Smart Info Systems
Real network structure: implicit
2211
aa
33
44
55
dd
bb
cc
pp
gg gene
agent
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 58
Nature-inspired Smart Info Systems
Real network structure: explicit
2211
33 44 55
slave slaveslave
mastermaster
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 59
Nature-inspired Smart Info Systems
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 60
Nature-inspired Smart Info Systems
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 61
Nature-inspired Smart Info Systems
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 62
Nature-inspired Smart Info Systems
2211
33 44 55
Reconstructed network structure: low noise
master master
slave slave slave
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 63
Nature-inspired Smart Info Systems
2211
33 44 55
Reconstructed network structure: moderate noise
slave slaveslave
mastermaster
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 64
Nature-inspired Smart Info Systems
Reconstructed network structure: high noise (an example)
2211
33 44 55
slave masterslave
slavemaster
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 65
Nature-inspired Smart Info Systems
Example 3: data of Tyson-Novak math. model for cell cycle
Tyson-Novak model for cell-cycle of fission yeast :
Components: 9 agents (= RNA or proteins).
Processes: stimulation, inhibition, transcription, and reactions between ‘agents’
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 66
Nature-inspired Smart Info Systems
The deterministic Tyson-Novak model.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 67
Nature-inspired Smart Info Systems
The stochastic Tyson-Novak model.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 68
Nature-inspired Smart Info Systems
Example: stochastic Tyson-Novak model
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 69
Nature-inspired Smart Info Systems
Example: stochastic Tyson-Novak model
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 70
Nature-inspired Smart Info Systems
4.2 The identification of PIECEWISE linear networks by L1-minimization
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.
Westra: Piecewise Linear Dynamic Modeling and Identification of Gene-Protein Interaction Networks 71
Nature-inspired Smart Info Systems
5. Epilogue: Lessons from Nature
Prerequisite for the successful reconstruction of gene-protein networks is the way in which the dynamics of their interactions is modeled.