a big data cheat sheet: the big pharma edition · 2015-06-08 · big data is not new. patient...
TRANSCRIPT
![Page 1: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/1.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
A BIG DATA CHEAT SHEET:
THE BIG PHARMA EDITION
TAMARA DULL, DIRECTOR OF EMERGING TECHNOLOGIES
![Page 2: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/2.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
Big data
is not
new.
PATIENT
RECORDS
HOSPITAL
ADMISSIONS
SCHEDULING
DATA
FINANCIAL
DATA
INSURANCE
DATA 20%
EMAIL PDF FILES RFID TAGS SPREAD-
SHEETS
WORD
PROCESSING
DOCUMENTS
GPS WEB LOG
DATA
SOCIAL
MEDIA
DATA
PHOTOS SATELLITE
IMAGES
RESEARCH
DATA FORUMS
CLINICAL
TRIALS
LAB
RESULTS VIDEOS
MOBILE
DATA
WEBSITE
CONTENT OPEN DATA
MARKETING
DATA
AUDIO
FILES
80%
![Page 3: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/3.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
3 Definitions
4 Trends
5 Questions
HERE’S OUR 3-4-5 PLAN:
![Page 4: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/4.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
3 DEFINITIONS
![Page 5: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/5.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
THE DEFINITIONS BIG DATA
SOURCE: Frost & Sullivan: “Drowning in Big Data? Reducing Information Technology
Complexities and Costs for Healthcare Organizations”
―Big Data refers to electronic health data
sets so large and complex that they are
difficult (or impossible) to manage with
traditional software and/or hardware; nor
can they be easily managed with
traditional or common data management
tools and methods…
Volume, Velocity, and Variety—often
referred to as the three V’s of Big Data—
capture the true meaning of Big Data.‖
―That amount of data
or complexity which
puts you out of your
comfort zone.‖
Paul Kent
VP of Big Data
SAS Institute
![Page 6: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/6.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
THE DEFINITIONS HADOOP
…or an ecosystem?
Is it a project…
NOTE:
Hadoop is not
synonymous
with big data
![Page 7: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/7.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
THE DEFINITIONS DATA LAKE
―A data lake is a storage repository that
holds a vast amount of raw data in its
native format, including structured, semi-
structured, and unstructured data. The
data structure and requirements are not
defined until the data is needed.‖
―If you think of a datamart as a store of
bottled water – cleansed and packaged
and structured for easy consumption – the
data lake is a large body of water in a
more natural state. The contents of the
data lake stream in from a source to fill
the lake, and various users of the lake can
come to examine, dive in, or take
samples.‖
James Dixon
CTO, Founder & Chief Geek
Pentaho
![Page 8: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/8.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
4 TRENDS
![Page 9: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/9.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
The market is growing.
SOURCE: http://wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017
![Page 10: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/10.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
The success rate is meh.
![Page 11: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/11.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
People issues trump technology issues.
![Page 12: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/12.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
Analytics keeps them coming back.
![Page 13: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/13.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
5 QUESTIONS
![Page 14: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/14.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
HERE’S THE 5 QUESTIONS:
1. What can Hadoop do that my data
warehouse can’t?
2. We’re not doing “big” data, so why do we
need Hadoop?
3. Is Hadoop enterprise-ready?
4. How is big data impacting Big Pharma
today?
5. What are the primary threats to big data
adoption?
![Page 15: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/15.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
QUESTION #1 WHAT CAN HADOOP DO THAT MY DATA WAREHOUSE CAN’T?
2. Process data more quickly
(and cheaply).
1. Store data more cheaply. $
![Page 16: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/16.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
QUESTION #2 WE’RE NOT DOING “BIG” DATA, SO WHY DO WE NEED HADOOP?
Stage structured data. Process structured data. Archive any data.
Process any data. Access any data. (via data warehouse)
Access any data. (via Hadoop)
![Page 17: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/17.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
QUESTION #3 IS HADOOP REALLY ENTERPRISE-READY?
For your organization: Maybe
For all organizations: No
Are we
there
yet?
![Page 18: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/18.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
QUESTION #4 HOW IS BIG DATA IMPACTING BIG PHARMA TODAY?
![Page 19: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/19.jpg)
C o p y r i g h t © 2 0 1 2 , S A S I n s t i t u t e I n c . A l l r i g h t s r e s e r v e d .
QUESTION #5 WHAT ARE THE PRIMARY THREATS TO BIG DATA ADOPTION?
PRIVACY SECURITY
IT
science
analytics
business
SKILLS
![Page 20: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/20.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
WRAP-UP
![Page 21: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/21.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
HERE ARE YOUR KEY TAKEAWAYS:
It’s the big data technologies – not the
data itself – that’s new
Understand the context when talking
about Hadoop
If you’re doing big data without
analytics, you’re wasting your time
Approach big data smartly and learn
from other…industries, mistakes, etc.
![Page 22: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/22.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved .
![Page 23: A BIG DATA CHEAT SHEET: THE BIG PHARMA EDITION · 2015-06-08 · big data is not new. patient records hospital admissions scheduling data financial data insurance data 20% email s](https://reader034.vdocuments.mx/reader034/viewer/2022042313/5edd1eaaad6a402d666817b3/html5/thumbnails/23.jpg)
Copyright © 2012 , SAS I ns ti tute I nc . A l l r ights reserved . sas.com
IT’S A BIG DATA WORLD OUT THERE.
NOW LET’S BE SAFE.
@tamaradull