Download - Big data(Sandeep Chaudhary)
Sandeep ChaudharyB.tech(CSE) – B
102910059
Sandeep Chaudhary1
Outline What is Big Data ?
What makes Data Big Data ?
3 V’s of Big Data
Why do we need Big Data ?
Filtering Big Data Effectively
Risks of Big Data
Statistics about On-Line Usage
Sandeep Chaudhary 2
What is Big Data ? Big Data is about liberating
data that is large in Volume, broad in Variety, and high in Velocity.
Big Data refers to Data Sets where the size is beyond the ability of typical database Software tools to capture, store, manage and analyze.
Sandeep Chaudhary 3
What makes Data “Big Data”Big Data is characterized by the 3 V’s :
Volume : larger than “normal”, a challenge to load and process.
Velocity : Rate of arrival posses real-time constraints on what are typically “batch ETL” operations.
Variety : Mix of Data types and varying degrees of Structure.
Sandeep Chaudhary 4
3 V’s of Big Data
Sandeep Chaudhary 5
Why do we need Big Data ?Big Data : is a mix of Structured, Semi-structured and
unstructured data –
Typically breaks barriers for traditional RDB Storage.
Typically breaks limit of Indexing.
Typically requires intensive pre-processing before each query to extract.
Sandeep Chaudhary 6
Filtering Big Data Effectively The extract, transform and load (ETL) process.
Taking a raw feed of data, reducing it, and producing a uasable set of output.
Sandeep Chaudhary 7
Risks of Big Data Will be so over-whelmed
Need the right people and solve the right problems.
Costs escalate too fast
Isn’t necessay to captue 100%
Many sources of Big Data is private
Self-Regulation
Legal regulation
Sandeep Chaudhary 8
Some facts and figures related to Online Data Usage : How many Data in the world :
800 Terabytes, 2000
160 Exabytes, 2006
500 Exabytes, 2009
2.7 Zettabytes, 2012
35 Zettabytes, 2020
How many Data generated in ONE Day ?
7 Terabytes, Twitter
10 Terabytes, Facebook
Sandeep Chaudhary
9
Sandeep Chaudhary 10
Any Questions
Sandeep Chaudhary 11
Sandeep Chaudhary
12
For Your Patience Listening.