data intensive computing graph algorithms for irregular, unstructured data – john feo, pacific...

5

Data Intensive Computing • Graph algorithms for irregular, unstructured data – John Feo, Pacific Northwest National Laboratory • graph500 and data-intensive computing – Richard Murphy, Sandia National Laboratories • Large-scale knowledge discovery – Steve Reinhardt, Microsoft • Data intensive computing at SNL – Andrew Wilson, Sandia National Laboratories • IBM's InfoSphere Streams – Roger Rea, IBM • Graph500 and Data Intensive HPC – Richard Murphy, Sandia National Laboratory • Data Analytics – Phillip Morris, Platform Computing • Data Intensive Computing – Richard Altmaier, Intel

Upload: edward-murphy

Post on 19-Jan-2016

213 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Data Intensive Computing Graph algorithms for irregular, unstructured data – John Feo, Pacific Northwest National Laboratory graph500 and data-intensive

Data Intensive Computing• Graph algorithms for irregular, unstructured data

– John Feo, Pacific Northwest National Laboratory• graph500 and data-intensive computing

– Richard Murphy, Sandia National Laboratories• Large-scale knowledge discovery

– Steve Reinhardt, Microsoft• Data intensive computing at SNL

– Andrew Wilson, Sandia National Laboratories• IBM's InfoSphere Streams

– Roger Rea, IBM• Graph500 and Data Intensive HPC

– Richard Murphy, Sandia National Laboratory• Data Analytics

– Phillip Morris, Platform Computing• Data Intensive Computing

– Richard Altmaier, Intel

Page 2: Data Intensive Computing Graph algorithms for irregular, unstructured data – John Feo, Pacific Northwest National Laboratory graph500 and data-intensive

1. Please provide a definition for "Data Intensive Computing". Please explain the difference between "finding a needle in a haystack" and "knowledge discovery".

Page 3: Data Intensive Computing Graph algorithms for irregular, unstructured data – John Feo, Pacific Northwest National Laboratory graph500 and data-intensive

2. "Data Intensive Computing" generally involves analyses of non-numeric data and the number of combinatorial possibilities grows rapidly. The objective of the analysis is to find in the data, meaningful relationships. How do we (a) test for convergence when not evaluating all possible combinations and (b) test for statistical significance--when the data is non-numeric?

Page 4: Data Intensive Computing Graph algorithms for irregular, unstructured data – John Feo, Pacific Northwest National Laboratory graph500 and data-intensive

3. "Data Intensive Computing" often involves the use of incomplete data. How does this affect the analysis process?

Page 5: Data Intensive Computing Graph algorithms for irregular, unstructured data – John Feo, Pacific Northwest National Laboratory graph500 and data-intensive

4. If you could design an ideal computing architecture for Data Intensive Computing, what would it look like?

Large Scale Data Facility for Data Intensive Synchrotron

Data-Intensive Distributed Computing

Data Intensive Computing at Sandia

Prototyping Data Intensive Apps: TrendingTopics.org

Data Management Reading: Chapter 5: Data-Intensive Computing And A Network-Aware Distributed Storage Cache for Data Intensive Environments

Designing Data-Intensive Applications

Data-and-Compute Intensive Processing

Future of Data Intensive Applicaitons

Data Intensive Computing Frameworks

CompSci516 Data Intensive Computing SystemsCompSci516 Data Intensive Computing Systems Lecture 21 Datalog Instructor: SudeepaRoy Duke CS, Fall 2016 1 CompSci 516: Data Intensive Computing

Data Intensive Linguistics

Prototyping Data Intensive Apps: TrendingTopicsdatawrangling.s3.amazonaws.com/trendingtopics_talk.pdf · Prototyping Data Intensive Apps: TrendingTopics.org Pete Skomoroch Research

Highly Scalable Graph Search for the Graph500 … Scalable Graph Search for the Graph500 Benchmark Koji Ueno Tokyo Institute of Technology / JST CREST [email protected] Toyotaro

Some Interesting Applications€¦ · Sept. 2016 13 Graph500 Sept. 2016 14 Graph500: • Several years of reports on performance of BFS implementations on – Different size graphs

Data Intensive Applications on Clouds

Extreme Data-Intensive Scientific Computing

Data-Intensive Text Processingwith MapReduce

Data-Intensive Scientific Discovery

NUMA-aware thread-parallel breadth-first search for Graph500 and Green Graph500 Benchmarks on SGI UV 2000

Announcing the 11 th Graph500 List! Graph500 Co-Founders: David A. Bader, Georgia Tech Andrew Lumsdaine, Indiana University Richard Murphy, Micron Technology,

Data Intensive Research with DISPEL

DATA REPLICATION IN DATA INTENSIVE SCIENTIFIC …

CPS216: Data-Intensive Computing Systems

Abstractions for Data Intensive Computing

Data-Intensive Computing Symposium Data-Intensive Computing Symposium: Report Out Phillip B. Gibbons Intel Research Pittsburgh

Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14

Enabling Data-Intensive Science Through Data Infrastructures

Data Intensive Applications Fitzmaurice - Data Intensive... · Summary • Designing Data Intensive Applications is worth reading for any backend or storage engineer • Generators

Data classification algorithm for data-intensive computing ... · RESEARCH Open Access Data classification algorithm for data-intensive computing environments Tiedong Chen1, Shifeng

Data-intensive Image based Relighting

Scalable Peer-to-Peer Data Mining for Data-Intensive ... Borne.pdf · From Data-Driven to Data-Intensive • Astronomy has always been a data-driven science • It is now a data-intensive

Runtime Data Management for Data-Intensive Scientific Applications

Data -Intensive Computing Systems Data Access from Disks

Data Intensive Engineering and Science