exploiting the unused part of the brain - caltech group at ... · 11/16/16 sc16, brain inspired...

Exploiting the UnusedPart of the Brain

Deep Learning and Emerging TechnologyFor High Energy Physics

Jean-Roch Vlimant

A 10 Megapixel Camera

CMS 100 Megapixel Camera

CMS Detector

CMS Readout

Highly heterogeneous system Raw data is 100M channelssampled every 25 ns : 1Pb/s50EB per day in readout and

online processing.

Event Filtering

From Big Data to Smart Datawith ultra fast decision

1000 Gb/s1000

L1 HLT

11/16/16 SC16, Brain Inspired Technologies, J.-R. Vlimant 7

Why Deep Learning● LHC Data Processing may use deep learning methods in many aspects (attend

other relevant talk at the Caltech booth)● Large volume of data and simulated data to analyze● Several class of LHC Data analysis make use of classifier for signal versus

background discrimination.✔ Use of BDT on high level features✔ Increasing use of MLP-like deep neural net

● Deep learning has delivered super-human performance at certain class of tasks(computer vision, speech recognition, ...)

✔ Use of convolutional neural net, recurrent topologies, long-short-term-memorycells, ...

● Deep learning has advantage at training on “raw” data➢ Several levels of data distillation in LHC data processing

● Neural net computation is highly parallelizable

➔ Going beyond fully connected networks with advanced deep neural net topologies➢ Multi-classification of LHC events from particle-level information➢ Charged particle tracking with recurrent and convolutional topologies➢ Particle identification and energy regression in highly granular future

calorimeter➢ ...

Advanced Machine Learningand Deep Learning

(my selection)

Machine Learning in a Nutshell

Balazs Kegl, CERN 2014

Scene Labeling

Karpathy, Fei-Fei, CVPR 2015

● Create a description of images

➢ Generate a decay process description from collisionrepresentation, with application to triggers

Scenery Interpretation

Farabet et al. ICML 2012, PAMI 2013

● Group and classify what each pixel belongs to● Real-time video processing with deep learning

➢ Multiple applications to pileup mitigation, objectidentification, tracking. All from “raw data”

Attention Learning

● Identify people from faces with multiple attentionfilters

➢ Object identification, noise subtraction, ...

Text Processing

● Question and Answer machine, languagetranslation, semantic arithmetic, ...

➢ Can the raw data of detector be interpreted astexts and translated into physics descriptions ?

Embedded Symmetries

T.S. Cohen, M. Welling ICML2016

p4m group

p4 group

● Introduction of convolutionallayers was a ground-breaking advancement

● Research on embedding morefundamental symmetries intoneural nets

● Symmetries operate on thedata or internalrepresentation of data

● Next is to implementsymmetries of physics to buildphysics-specific NN

Toolkit and Services

● Lots of libraries out there, several key components in each majorlanguages. Lots of big-data analytics services offered

● Common theme of going for spark-hdfs support➢ Question of having in-house software or embracing external

libraries is very much alive

Application to Intensity andEnergy Frontiers

(a selected few)

NOVA Event Classification

Slides on Paolo

Particle Jet Identification

Neural net

W to QCDdiscrimination

W tagger arXiv: 1511.05190, Oliveira, Kagan, Mackey, Nachman, Schwartzman

Top Tagger arXiv: 1501.05968 Almeida, Backovic, Cliche, Lee, Perelstein

W to QCDdiscrimination

3D Calorimetry Imaging

100GeV Photon 100GeV Pi0

≠LCD Calorimeter configurationhttp://lcd.web.cern.ch5x5 mm Pixel calorimeter28 layer deep for Ecal70 layer deep for Hcal

Photon and pion particle gunClassification, regression andcombined models

Irregular Geometry Challenge

Hexagonal cells

Projective Geometry

Variable Depth Segmentation

The images we are dealing with arenot as regular as standard images.Need for specific new treatment andmethods to feed neural nets

Collision Event Classification

● Full event classification using reconstructed particle 4-vectors● Recurrent neural nets, Long short term memory cells● Dedicated layer with Lorentz boosting● Step toward event classification with lower level data : low

level feature as opposed to analysis level variables

Ordering ChallengeText have naturalorder. RNN/LSTMcan correlate theinformation tointernalrepresentation

There isunderlying order incollision events.Smeared throughtiming resolution.No natural order in observable

Charged Particle Tracking

● Perfect example of pattern recognition● Data sparsity is not common in image processing● Several angles to tackle the problem. Deep Kalman filter,

RNN to learn dynamics, sparse image processing, ...● Kaggle challenge in preparation

https://indico.hep.caltech.edu/indico/event/102

Challenge of Tracking

HEP Trk.Xhttps://heptrkx.github.io/

● LBL, FNAL, Caltech consortium sponsored by the DOE● Preparation of simulated data

➢ Accurate, fast and light● Explore new approaches to charged particle tracking using

advanced pattern recognition techniques➢ Recurrent neural nets in learning track kinematics➢ Convolutional neural nets for pattern recognition➢ Hough-like transformation from hit position space to track

parameter space➢ Advanced Kalman filter parallelization➢ ... exploration has started

● Tracking competition (a la kaggle) in preparation

Other Applications

● Outliers selection● Anomaly detection● Data quality automation● Detector control● Experiment control● Data popularity prediction● Computing grid control● Denoising with auto-encoder● Fast simulation ● ...

Accelerating and EmergingTechnologies

Supermicro Server

Caltech Servers● 2 compute nodes :

Intel® Xeon® CPU E5-2697 v3 @ 2.60GHz processors pernode (28 cores per CPU) with 8 NVIDIA® Pascal® GTX 1080

● Theoretical Peak Performance :80 Tflops

➔ Theano, Tensorflow, Keras➔ MPI training➔ Spearmint hyper-optimization

Many thanks to our partners Nvidia and Supermicro

Cooley● 126 compute nodes :

Two 2.4 GHz Intel®Haswell® E5-2620 v3processors per node (6 coresper CPU, 12 cores total) andNVIDIA® Tesla® K80

● Theoretical Peak Performance :293 Tflops

➔ Development Project with 8kcore hours

➔ Theano, Tensorflow, Keras➔ MPI ready➔ Spearmint experimental

Piz Daint● 5272 compute nodes :

Intel® Xeon® E5-2670 @2.60GHz (8 cores, 16 virtualcores with hyperthreadingenabled, 32GB RAM) andNVIDIA® Tesla® K20X

● Theoretical Peak Performance :7.787 Pfops

● Scratch capacity : 2.7 PB

➔ Development Project with 36kcore hours

➔ Theano, Tensorflow, Keras➔ MPI ready➔ Spark experimental stage

Titan● 18688 compute nodes :

2.2GHz AMD® Opteron®6274 processors per node (16cores per CPU) and NVIDIA®Tesla® K20X

● Theoretical Peak Performance :20 Pflops

➔ Allocation in preparation

Distributed Learning

● Deep learning with elastic averaging SGDhttps://arxiv.org/abs/1412.6651

● Revisiting Distributed Synchronous SGDhttps://arxiv.org/abs/1604.00981

● Implementation with Spark and MPI for the Kerasframework https://keras.io/

➔ https://github.com/JoeriHermans/dist-keras ➔ https://github.com/duanders/mpi_learn

Titan X

GTX 1080

Performance

● Ran on the supermicro 8GPU server here at SC16● Normalized to 2 GPU performance point

➔ Please add a factor 2x

Linear speedup withnumber of workers

Saturation in co-scheduling GPU (2 workers on the same GPU)

7x max speedup

Performance

● Ran on ALCF Cooley 126 GPU cluster● Normalized to 2 GPU performance point

➔ Please add a factor 2x

14x max speedup

Non-linear speedup withnumber of GPU

Linear speedup withnumber of GPU

Applicability

Training vs Inference

● GPUs are the workhorse forparallel computing

● Enable training large models, withlarge dataset

● Deep learning facility clusters

● Emergence of smaller GPU● Not dedicated to training● Strike the balance between

Tflops/$ for inference● Deployment on the grid

Neuromorphic Hardware

http://www.nature.com/articles/srep14730

● Implementing plasticity in hardware ● Process signal from detector and adapt tocategories of pattern (unsupervised)

● Post-classified from data analysis or rate throttling➢ NCCR consortium assembling to develop thistechnology further, with our use case in mind

Cognitive Computing

● Spiking neural net as processing units : ➔Cognitive Computing Processing Unit : CCPU

● Adopt a new programming scheme, translateexisting software

● See Rebecca Carney's talk for more details

Summary

Impressive achievement and promise ofmodern machine learning and deep learning

From realistic to speculative applicability tofield of High Energy and Frontier Physics

Emerging tools and technology to embrace

Thanks to our sponsors

exploiting the unused part of the brain - caltech group at ... · 11/16/16 sc16, brain inspired...

Documents

deep brain stimulation in the treatment of...

deep brain stimulation therapy india

deep brain stimulation · 2018. 1. 30. · deep brain...

large scale training and optimization of neural networks ......

deep brain stimulation for alzheimer’s disease

surgery for deep brain stimulation for movement · pdf...

7.01.63 deep brain stimulation - lifewiseor.com

deep brain stimulation - ohsu...deep brain stimulation risks...

dr.amr sobhy anesthesia and deep brain stimulation

deep brain stimulation

deep brain structures

deep brain stimulation surgery: what to expect€¦ · deep...

deep brain stimulation surgery experience

deep brain stimulation procedure

deep brain stimulation for parkinson's disease

brain plasticity and behavior - deep blue

and demystifying deep brain stimulation (dbs) › neurology...

deep brain learning a to z

mechanism of action of deep brain stimulationof deep brain...

habenula deep brain stimulation for intractable