the reality of bigdata - #beltech2014
TRANSCRIPT
![Page 1: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/1.jpg)
The Reality of Big Data
![Page 2: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/2.jpg)
#beltech2014
![Page 3: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/3.jpg)
#1 – What problem are you trying to solve?
![Page 4: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/4.jpg)
Most of SME’s problems aren’t Big Data, it’s just data.
![Page 5: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/5.jpg)
Without a question you are wasting your time.
![Page 6: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/6.jpg)
#2 – Data will need cleaning
![Page 7: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/7.jpg)
Roughly 80% of your data project will be getting the data into shape
before processing.
![Page 8: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/8.jpg)
Btiany Spears
![Page 9: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/9.jpg)
#3 – Hadoop, on it’s own, will NOT give you the answers.
![Page 10: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/10.jpg)
#3 – Hadoop, on it’s own, will NOT give you the answers.
(The Big Data version of “putting it in the cloud”)
![Page 11: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/11.jpg)
If anyone says, “will Hadoop just give us the answers” or “put it in the
cloud”, do this….
![Page 12: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/12.jpg)
Spit on one, or both, of their feet and bite your thumb while shouting:
“The fig of Spain!”.
![Page 13: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/13.jpg)
#4 – Do you actually need Hadoop?
![Page 14: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/14.jpg)
A well crafted algorithm may give you more benefit.
![Page 15: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/15.jpg)
It’s about knowing the right questions.
![Page 16: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/16.jpg)
And refining and refining and refining…..
![Page 17: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/17.jpg)
The first run won't work at allThe second only makes you wonder
The third will have you on your knees.....
![Page 18: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/18.jpg)
#5 – Data changes
![Page 19: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/19.jpg)
…especially when you don’t own it.
![Page 20: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/20.jpg)
If you feel your data has value then retain it.
![Page 21: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/21.jpg)
If your data passes over the “creepy line” then definitely retain it.
![Page 22: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/22.jpg)
#6 – Skills are in short supply
![Page 23: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/23.jpg)
Work with what you have.
![Page 24: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/24.jpg)
Play with data, it’s the best way to learn.
![Page 25: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/25.jpg)
Collaborate with others to fill the skills gaps.
![Page 26: The Reality of Bigdata - #Beltech2014](https://reader033.vdocuments.mx/reader033/viewer/2022052912/55a05c041a28abfe678b47ec/html5/thumbnails/26.jpg)
Thank you
http://about.me/jasebell
@hadooping