multi-scale molecular data visualizationprevious techniques are tailored for single-instance...

53
Multi-Scale Molecular Data Visualization Ivan Viola, TU Wien, Austria

Upload: others

Post on 10-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Multi-Scale Molecular Data VisualizationIvan Viola, TU Wien, Austria

Page 2: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

2

Illustration from The Machinery of Life by David S. Goodsell

2 µm ~ 15 billion atoms HIV

Page 3: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Multi-Scale Structural Visualization

Page 4: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

cellPack: Modeling Mesoscale Structure

4

HIV ~20 M atoms

Ivan Viola

[Johnson et al. 2014]

Page 5: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Scripps’ cellPack team created

large models of HIV virus,

Mycoplasma is being developed

Models represented as meshes

little utilization of semantics

slow rendering performance

For interactive display rates, models

have to be semantically organized

Protein Databank (PDB.org) stores

atomic molecular structures and molecular complexes

Meshes are replaced by instances of PDB entries

GPUs are utilized to generate structure using the

geometry and tessellation shaders5

Atomistic Structural Models

Ivan Viola

[Johnson et al. 2014]

Page 6: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

E.Coli - 15 billion C/N/O atoms

6Ivan Viola

Illustrations from The Machinery of Life by David S. Goodsell

Page 7: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Instancing and Impostor Rendering

7

One molecule is represented by

point position, orientation, type

All the atoms emitted from refe-

rence point for every draw call

Impostors used instead of

triangulated spheres

GPUVertex Shader

Geometry / Tesselation

Shader

Fragment Shader

Display

Ivan Viola

Page 8: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Heuristic Level-of-Detail Representation

LOD heuristics sorts atoms from the center of

gravity of the molecule

With increasing distance from the molecule

Skips every k atoms while preserving shape

Increases atom size with distance

8

3523 1761 1174 880

Level 1 Level 2 Level 3 Level 4

Ivan Viola

Page 9: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Clustering Level of Detail Representation

Perform clustering as preprocessing

K-means, Affinity Propagation, Hierarchical

9Ivan Viola

Page 10: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Genome Structure

10Ivan Viola

Page 11: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Nucleic acid strand is defined through

a cubic spline

Reparametrization of interpolation

points to define one revolution

segment

Computation of normal with smooth

variation

Tessellation shader creates pivots for

each for nucleotide base pair

Each pivot’s normal is twisted in the

tangent plane to form helix

The base pair atoms are emitted

and rendered11

Approach for Nucleic Acids

Ivan Viola

Page 12: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Temporal coherence first renders all visible

molecules from the previous rendering pass

Hierarchical Z-Buffer and molecule BB tests

for visibility of previously invisible molecules

Occlusion Culling

12Ivan Viola

Page 13: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Hierarchical Z-Buffer

Ivan Viola 13

http://rastergrid.com/blog/2010/10/hierarchical-z-map-based-occlusion-culling/

Page 14: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

14

15x109 Model Demo

Ivan Viola

Page 15: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

15

Mycoplasma DNA Model Demo

Ivan Viola

Page 16: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

cellView Demo

16Ivan Viola

Page 17: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Multi-Instance Cutaway Visualization

Densely packed data leads

to a massive occlusion

problem

Previous techniques are

tailored for single-

instance scenarios

Occlusion management

can benefit from multi-

instance nature of the data

Which instance to show /

eliminate from visualization

is an optimization problem

17Ivan Viola

Page 18: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Ivan Viola 18

Page 19: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Visibility Equalizers

19Ivan Viola

Page 20: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Visibility Equalizers

20Ivan Viola

Page 21: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

21

Single-Scale Color Mapping Problem

Ivan Viola

Page 22: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Ivan Viola 22

Multi-Scale Color Mapping Approach

Page 23: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

23

Multi-Scale Color Mapping

Ivan Viola

Page 24: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Multi-Scale Temporal Visualization

Page 25: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Reaction A + B → C

Modeling Physiology

Biological pathway describes

elements of physiological

function

Transcribed into simulation

models in computational biology

Simulation methods

Stochastic → individual-based

modeling (aka agent-based)

Deterministic → kinetic modeling

(aka quantitative), differential

equations system25

Page 26: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Simulating Molecular Interactions

Agent-based modeling of a stochastic process

Modelling diffusion-reaction process

Diffusion is modelled through random walk

When agents collide reaction rules apply

Diffusion event happens every timestep,

reaction every ~1000 timestep

MCell is a popular tool for ABM

CellBlender connects MCell with Blender

26

Reaction A + B → C

Page 27: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Spatially Realistic Molecular Simulations

27

Video created with CellBlender

Page 28: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Spatially Realistic Molecular Simulations

28

Video created with CellBlender

Page 29: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Illustrative Time-lapse: Requirements

R1: Show simulation in reasonable time

R2: Keep track of individual elements

R3: Ensure visibility of events of interest

R4: Ensure high level of realism

29

Page 30: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Scientific Animation

30 Drew Berry, Apoptosis, 2006

Integrating two temporal scales 2-3 orders of

magnitude apart

Page 31: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Illustrative Time-lapse: Approach

R1: Show simulation in reasonable time

Timelapse

R2: Keep track of individual elements

Trajectory Filtering

R3: Ensure visibility of events of interest

Time Warp, Highlighting

R4: Ensure high level of realism

Temporal Lens

31

Page 32: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Timelapse and Trajectory Filtering

32

ρ=0.5 ~ 50%

Page 33: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Illustrative Mitochondria Model

33

VDAC channel crossing

Page 34: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Timelapse and Trajectory Filtering

34

Timelapse only

Timelapse + Trajectory Filtering

Page 35: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Timewarp and Emphasis

35

Reactions, channel passing, … are the most

important physiologically-relevant events

Hard to spot

Lend “time” before reaction, after reaction

return “time” to get back in “presence”

Could cause causal inconsistencies, exploiting

diffusion-reaction scale difference

Page 36: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Without Timewarp and Emphasis

36

Missed event

Page 37: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

With Timewarp and Emphasis

37

Visible event

Page 38: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Lens

Combines filtered and original trajectories

38

ρ=0.5 ρ=1.0

Page 39: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Without Lens

39

Page 40: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

With Lens

40

Page 41: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Screen-Space Lens

41

Define maximum velocity in screen space

Pleasant perspective effect

Easy parameter setup

Page 42: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Speed x100

Original motion Illustrative Time-lapse

Page 43: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

ABM: Limitations for Visual Explanations

Designed for simulation of molecular

interactions, not for visual explanation

Bottom-up process, difficult to enforce the

illustration intent on the system

Visual clutter dominates the scene

Does not guide to follow or understand story

High computational cost of simulation

Large quantities of data to process

No ways to steer the simulation

43

CellBlender

Page 44: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Reaction A + B → C

Modeling Physiology

Biological pathway describes

elements of physiological

function

Transcribed into simulation

models in computational biology

Simulation methods

Stochastic → individual-based

modeling (aka agent-based)

Deterministic → kinetic modeling

(aka quantitative), differential

equations system44Ivan Viola

Page 45: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Quantitative Model of Molecular Interactions

Particle system enabling comprehensive

visual explanations with top-down global

system control

Uses results of quantitative simulation

instead of individual-based simulation

Mimicks the reaction-diffusion process

synchronized with the quantities over time

Molecules are obtained from the structural

biology DB, populated uniformly in space

45Ivan Viola

Page 46: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Visualizing Diffusion-Reaction Process

46

Random Walk

Ivan Viola

Page 47: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Visualizing Diffusion-Reaction Process

47

Query Simulation

Ivan Viola

Page 48: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Reaction Order

Visualizing Diffusion-Reaction Process

48Ivan Viola

Page 49: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Visualizing Diffusion-Reaction Process

49

Partner SearchSteering ForcesReaction

Ivan Viola

Page 50: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

User Control

Visualizing Diffusion-Reaction Process

50Ivan Viola

Page 51: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Simplified NAD Pathway

51Ivan Viola

Page 52: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

Wikipedia + Dynamic Google Earth for E. coliE. coli data are already available in separate scientific databases. One-click export into cellPack is a goal in data generation stage. On the fly packing in real-time.

Building molecular machineriesRepresenting the states of macromolecule in a state machine that acts and reacts to its environment. Level of detail for physiology

Combining the nano with microMicroscopic imaging modalities of cells can be enriched by nanoscopic detail by populating it on demand when zooming according to our present knowledge.

Creating a modeling framework for nanoscaleModeling biology assisted with machine learning. CAD of DNA constructs (DNA origami, vHelix) by designing new visual metaphors

52

Future Work

Ivan Viola

Page 53: Multi-Scale Molecular Data VisualizationPrevious techniques are tailored for single-instance scenarios Occlusion management can benefit from multi-instance nature of the data Which

53

Thanks

Mathieu Le Muzic, Peter Mindek, David Kouril, Johannes Sorger, Nicholas Waldin,

Matthias Reisacher, Julius Parulek, Manuela Waldner, Ludovic Autin, Art Olson,

David Goodsell, Graham Johnson, Drew Berry, Meister Eduard Gröller

www.cg.tuwien.ac.at/research/projects/illvisation/cellview/cellview.php

https://github.com/illvisation/cellVIEW

Ivan Viola