sizing up big data? hitting the "v"s
TRANSCRIPT
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
1/17
Clive Longbottom,
Service Director, Quocirca Ltd
Clive Longbottom,
Service Director, Quocirca Ltd
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
2/17
Quocirca 2012
Its not about databases per se
It is about:
Volume but not just databases
Velocity results need to beproduced in near real-time
Variety the aspect that is missedby many
Veracity how good are the inputs
Value is the data worth it?
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
3/17
Quocirca 2012
20 years ago: Only 20% of an organisationsinformation was in electronic form
80% of this was in a formal database
Today:
Well over 80% of an organisationsinformation is in electronic form
Less than 20% is in a formal database
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
4/17
Quocirca 2012
Inf. Silo
CRM ERP SCM
Inf. Silo Inf. Silo
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
5/17
Quocirca 2012
Not just text but images, video media assets, VoIP,
Videoconferencing Replicated/archived data a large part of growth
But is it completely unstructured?
Source: Ram Subramanyam Gopalan
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
6/17
Quocirca 2012
XML (or quasi-XML)
CSV/tab delimited
Text blocks
Meta data
TCP/IP packet header information Pattern recognition
Colour, shape, texture (CST)
Inferred data
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
7/17
Quocirca 2012
YourOrganisation
Supplier Supplierssupplier
CustomerCustomerscustomer
Information flows
Open information from e.g. search engines, social networks
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
8/17
Quocirca 2012
Organisation data:
Enterprise application data
Office documents
Reports, analytics
GRC information
Information on competitors
Financial performance data
Images, voice, video
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
9/17
Quocirca 2012
Supplier data
Logistics data
Inventory data Transactional data
Competitive information
Credit and background checks
Invoices, catalogues, contracts, images Voice, video
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
10/17
Quocirca 2012
Customer data:
Orders, payment details, returns information
Past purchases Credit and background checks
Searches, web analytics
Social media comments
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
11/17
Quocirca 2012
You no longer have control
The open value chain removesdirect control
Security of information assets iscritical
Identifying and aggregatinginformation assets
Capturing information when andwhere possible and legal
Bringing structured andunstructured together
Sifting through the dross to get tothe golden nuggets
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
12/17
Quocirca 2012
Information under your control:
Deduplicate
Taxonomise
Index
Tag
Information not under your control:
Filter (intelligently)
Tag and index when it crosses yourboundaries
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
13/17
Quocirca 2012
Link databases
Use master data management
Bring in unstructured data
Use Hadoop along with NoSQL datastores (e.g.Cassandra, MongoDB)
Use cross-function search and reporting tools
E.g. HP Autonomy, CommVault Simpana
Use analytics to present results in meaningful ways
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
14/17
Quocirca 2012
SQL NoSQL
MapReduce
Filter
Apply metadata
App
Search, analyse and report
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
15/17
Quocirca 2012
Its dj vu all over again
Remember in-memory databases?
Big data cannot remain as a jigsaw solution
Full-service solutions will come forward
Who will be the winners?
Oracle, IBM, Microsoft?
SAP?
EMC, Symantec?
The Open Source environment (e.g. 10Gen,Apache/Cassandra, CouchDB)?
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
16/17
Quocirca 2012
Big Data has many vectors
Volume, velocity, variety and veracity: each is asimportant as the others - value will accrue throughgetting them right
More information is outside the realm of your direct
control Capturing what can be captured in a useful manner is
key
The evolution of the market is rapid
NoSQL and Hadoop provide the underpinnings for anew, information centric approach
The formal database is not dead
But it is only on aspect of the problem and thesolution
-
7/31/2019 Sizing up Big Data? Hitting the "V"s
17/17
Quocirca 2012
Thank you
Contact details:
Further reading:
http://quocirca.com/reports/150http://quocirca.com/articles/617
http://quocirca.com/articles/637