processes and constraints in scientific model construction
DESCRIPTION
Processes and Constraints in Scientific Model Construction. Will Bridewell † and Pat Langley †‡ † Cognitive Systems Laboratory, CSLI, Stanford University ‡ CIRCAS, Arizona State University. Where Are We Going?. Introduction to inductive process modeling - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/1.jpg)
PROCESSES AND CONSTRAINTS IN
SCIENTIFIC MODEL CONSTRUCTION
Will Bridewell† and Pat Langley†‡
†Cognitive Systems Laboratory, CSLI, Stanford University
‡CIRCAS, Arizona State University
![Page 2: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/2.jpg)
Where Are We Going?
Introduction to inductive process modeling
Constraints in inductive process modeling
Learning constraints
![Page 3: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/3.jpg)
Inductive Process Modeling
Observations PredictionsModel
Model Objectives: Explanation and PredictionLangley et al. 2002, ICML; Bridewell et al.
2008, ML
![Page 4: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/4.jpg)
• Ordinary Differential Equations
• Processes
Quantitative Process Models
process exponential_growth equations d[hare.density, t, 1] = 2.5 * hare.density
process exponential_loss equations d[wolf.density, t, 1] = −1.2 * wolf.density
process predation_holling_type_1 equations d[hare.density, t, 1] = −0.1 * hare.density * wolf.density d[wolf.density, t, 1] = 0.3 * 0.1 * hare.density * wolf.density
dhare.density/dt = 2.5 * hare.density + −0.1 * hare.density * wolf.densitydwolf.density/dt = −1.2 * wolf.density + 0.3 * 0.1 * hare.density * wolf.density
![Page 5: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/5.jpg)
Advantages of Quantitative Process Models
Process models offer scientists a promising framework because:
· they embed quantitative relations within qualitative structure;
· that refer to notations and mechanisms familiar to experts;
· they provide dynamical predictions of changes over time;
· they offer causal and explanatory accounts of phenomena;
· while retaining the modularity needed for induction/abduction.
Quantitative process models provide an important alternative to formalisms used currently in computational discovery.
![Page 6: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/6.jpg)
• Ordinary Differential Equations
• Processes
Modularity in Quantitative Process Models
process exponential_growth equations d[hare.density, t, 1] = 2.5 * hare.density
process exponential_loss equations d[wolf.density, t, 1] = −1.2 * wolf.density
process predation_holling_type_1 equations d[hare.density, t, 1] = −0.1 * hare.density * wolf.density d[wolf.density, t, 1] = 0.3 * 0.1 * hare.density * wolf.density
dhare.density/dt = 2.5 * hare.density + −0.1 * hare.density * wolf.densitydwolf.density/dt = −1.2 * wolf.density + 0.3 * 0.1 * hare.density * wolf.density
![Page 7: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/7.jpg)
• Ordinary Differential Equations
• Processesprocess exponential_growth equations d[hare.density, t, 1] = 2.5 * hare.density
process exponential_loss equations d[wolf.density, t, 1] = −1.2 * wolf.density
dhare.density/dt = 2.5 * hare.densitydwolf.density/dt = −1.2 * wolf.density
Modularity in Quantitative Process Models
![Page 8: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/8.jpg)
• Ordinary Differential Equations
• Processesprocess exponential_growth equations d[hare.density, t, 1] = 2.5 * hare.density
process exponential_loss equations d[wolf.density, t, 1] = −1.2 * wolf.density
process predation_holling_type_2 equations d[hare.density, t, 1] = −0.1 * hare.density * wolf.density / (1 + 0.2 * –0.1 * hare.density) d[wolf.density, t, 1] = 0.3 * 0.1 * hare.density * wolf.density / (1 + 0.2 * –0.1 * hare.density)
dhare.density/dt = 2.5 * hare.density + −0.1 * hare.density * wolf.density / (1 + 0.2 * –0.1 * hare.density)dwolf.density/dt = −1.2 * wolf.density + 0.3 * 0.1 * hare.density * wolf.density / (1 + 0.2 * –0.1 * hare.density)
Modularity in Quantitative Process Models
![Page 9: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/9.jpg)
Generic Processesgeneric process predation_Holling_1 entities P1{prey}, P2{predator} parameters r[0, infinity], e[0, infinity] equations d[P1.density, t, 1] = −1 * r * P1.density * P2.density d[P2.density, t, 1] = e * r * P1.density * P2.density
![Page 10: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/10.jpg)
Generic Processesgeneric process predation_Holling_1 entities P1{prey}, P2{predator} parameters r[0, infinity], e[0, infinity] equations d[P1.density, t, 1] = −1 * r * P1.density * P2.density d[P2.density, t, 1] = e * r * P1.density * P2.density
InstantiationP1: hare P2: wolf
r: 0.1 e: 0.3
![Page 11: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/11.jpg)
Generic Processesgeneric process predation_Holling_1 entities P1{prey}, P2{predator} parameters r[0, infinity], e[0, infinity] equations d[P1.density, t, 1] = −1 * r * P1.density * P2.density d[P2.density, t, 1] = e * r * P1.density * P2.density
process wolves_eat_hares equations d[hare.density, t, 1] = −1 * 0.1 * hare.density * wolf.density d[wolf.density, t, 1] = 0.3 * 0.1 * hare.density * wolf.density
InstantiationP1: hare P2: wolf
r: 0.1 e: 0.3
![Page 12: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/12.jpg)
The IPM System• Given:
- A library of generic entities and processes- Instantiated entities- Data
Ground the generic processes with instantiated entities Generate all combinations of the ground
processes Fit the numeric parameters of each structure
• Output: The best models based on fit to the data
(a naive approach)
![Page 13: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/13.jpg)
Applications
Aquatic Ecosystems Fjord Dynamics
also, biochemical kinetics, protist interactions, photosynthesis
See Bridewell et al. 2008, Machine Learning, 71, 1–32
![Page 14: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/14.jpg)
Life After IPM
• help scientists formalize their modeling knowledge;
• let scientists consider several alternative models;
• reduce some of the drudgery of model construction;
• speed exploration and evaluation.
Early versions of inductive process modeling systems:
However, IPM produces several structurally implausible models, some of which account quite well for the data.
![Page 15: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/15.jpg)
Model Constraints
eliminate implausible models;
reduce the size of the search space;
make complex domains tractable;
improve model accuracy during incomplete search.
HIPM, Todorovski et al. AAAI-05
Constraints on the structure of models:
Structural constraints differ from constraints on model behavior most importantly because they do not require simulation.
![Page 16: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/16.jpg)
SC-IPM Constraints: Necessary
Name: Nutrient-ReplenishmentType: necessaryProcesses: nutrient_mixing(N), remineralization(N,_ )
Specifies Required Processes
P = primary producerG = grazerN = nutrient
![Page 17: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/17.jpg)
Name: Growth-LimitationType: always-togetherProcesses: limited(P), nutrient_limitation(P, N)
All or None
P = primary producerG = grazerN = nutrient
SC-IPM Constraints: Always-Together
![Page 18: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/18.jpg)
Name: Growth-AlternativesType: exactly-oneProcesses: exponential(P), logistic(P), limited(P)
Mutual Exclusion
P = primary producerG = grazerN = nutrient
SC-IPM Constraints: Exactly-One
![Page 19: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/19.jpg)
Name: Optional-GrazingType: at-most-oneProcesses:
holling_1(P,G), holling_2(P,G), holling_3(P,G)
Enables Optional Processes
P = primary producerG = grazerN = nutrient
SC-IPM Constraints: At-Most-One
![Page 20: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/20.jpg)
The SC-IPM System1. Ground the generic processes with instantiated
entities.
2. Treat ground processes as Boolean literals.
3. Conjoin the individual constraints.
4. Rewrite the constraints in conjunctive normal form.
5. Apply a SAT solver (e.g., DPLL,WalkSAT).
6. Instant model structure!
7. Fit parameters, etc.
![Page 21: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/21.jpg)
Advantages of SC-IPM
• constraints that limit the consideration of implausible models;
• constraint modularity that eases control of the search space.
SC-IPM adds several powerful features to IPM, such as:
The constraints used by SC-IPM typically come from a scientist’s implicit knowledge, and we can both elicit them through examples and learn them computationally.
![Page 22: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/22.jpg)
Goal:Identify implicit or unknown constraintsto use in future modeling tasks
Plan:Analyze the space of model structuresUse machine learning techniques to help
Key Idea:Don’t throw away any modelsEven the bad ones contain valuable
information
Learning Constraints
Bridewell & Todorovski 2007, ILP and KCAP
![Page 23: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/23.jpg)
Learning Constraints1. Build and parameterize process models
2. Store the models for analysis
3. Formally describe the structure of the models
4. Identify good and bad models
5. Use ILP to generate descriptions of accurate and inaccurate model structures
6. Convert the descriptions into SC-IPM constraints
We chose Aleph by Ashwin Srinivasan due to its ready availability and capabilities.
![Page 24: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/24.jpg)
Good and Bad Models
1996–1997 Ross Sea
Good Bad
![Page 25: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/25.jpg)
Extracted ConstraintsA model that includes a second-order exponential mortality process for phytoplankton will be inaccurate. (positive:560, negative: 0)
A model that includes the Lotka–Volterra grazing process will be inaccurate. (positive: 80, negative: 0)
A model that lacks both the first and second order Monod growth limitation process between iron and phytoplankton will be inaccurate. (positive: 448, negative: 0)
![Page 26: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/26.jpg)
Apply Constraints to Other Problems
Ross Sea Across YearsSearch Spaces: 9x–16x smallerModel Distribution: more accurate
![Page 27: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/27.jpg)
Apply Constraints toOther Domains
Ross Sea to Bled Lake
Bridewell & Todorovski AAAI-08 (Transfer Learning Workshop)
![Page 28: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/28.jpg)
Related Work Other quantitative modelers
LAGRAMGE (Todorovski & Dzeroski)
PRET (Bradley & Stolle)
Metalearning and others
Learning Constraint Networks via Version Spaces (Bessiere et al.)
Relational Clichés (Silverstein & Pazzani; Morin & Matwin)
Mode Declarations in ILP (McCreath & Sharma)
Rule Reliability from Prior Performance (Mark Reid)
![Page 29: Processes and Constraints in Scientific Model Construction](https://reader030.vdocuments.mx/reader030/viewer/2022032612/56813399550346895d9aa882/html5/thumbnails/29.jpg)
• continuing the analysis of constraint transfer;
• closing the automated modeling + constraint learning loop;
• basing new analyses and methodologies on model ensembles;
• adapting the general strategies to other tasks;
• supporting other modeling paradigms.
Future Directions
We are currently working in several directions which include:
Inductive process modeling is a fruitful paradigm for exploring knowledge representation, modeling, discovery, and creativity in scientific practice.