sizing up big data? hitting the "v"s

Upload: quocirca

Post on 04-Apr-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    1/17

    Clive Longbottom,

    Service Director, Quocirca Ltd

    Clive Longbottom,

    Service Director, Quocirca Ltd

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    2/17

    Quocirca 2012

    Its not about databases per se

    It is about:

    Volume but not just databases

    Velocity results need to beproduced in near real-time

    Variety the aspect that is missedby many

    Veracity how good are the inputs

    Value is the data worth it?

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    3/17

    Quocirca 2012

    20 years ago: Only 20% of an organisationsinformation was in electronic form

    80% of this was in a formal database

    Today:

    Well over 80% of an organisationsinformation is in electronic form

    Less than 20% is in a formal database

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    4/17

    Quocirca 2012

    Inf. Silo

    CRM ERP SCM

    Inf. Silo Inf. Silo

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    5/17

    Quocirca 2012

    Not just text but images, video media assets, VoIP,

    Videoconferencing Replicated/archived data a large part of growth

    But is it completely unstructured?

    Source: Ram Subramanyam Gopalan

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    6/17

    Quocirca 2012

    XML (or quasi-XML)

    CSV/tab delimited

    Text blocks

    Meta data

    TCP/IP packet header information Pattern recognition

    Colour, shape, texture (CST)

    Inferred data

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    7/17

    Quocirca 2012

    YourOrganisation

    Supplier Supplierssupplier

    CustomerCustomerscustomer

    Information flows

    Open information from e.g. search engines, social networks

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    8/17

    Quocirca 2012

    Organisation data:

    Enterprise application data

    Office documents

    Reports, analytics

    GRC information

    Information on competitors

    Financial performance data

    Images, voice, video

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    9/17

    Quocirca 2012

    Supplier data

    Logistics data

    Inventory data Transactional data

    Competitive information

    Credit and background checks

    Invoices, catalogues, contracts, images Voice, video

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    10/17

    Quocirca 2012

    Customer data:

    Orders, payment details, returns information

    Past purchases Credit and background checks

    Searches, web analytics

    Social media comments

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    11/17

    Quocirca 2012

    You no longer have control

    The open value chain removesdirect control

    Security of information assets iscritical

    Identifying and aggregatinginformation assets

    Capturing information when andwhere possible and legal

    Bringing structured andunstructured together

    Sifting through the dross to get tothe golden nuggets

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    12/17

    Quocirca 2012

    Information under your control:

    Deduplicate

    Taxonomise

    Index

    Tag

    Information not under your control:

    Filter (intelligently)

    Tag and index when it crosses yourboundaries

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    13/17

    Quocirca 2012

    Link databases

    Use master data management

    Bring in unstructured data

    Use Hadoop along with NoSQL datastores (e.g.Cassandra, MongoDB)

    Use cross-function search and reporting tools

    E.g. HP Autonomy, CommVault Simpana

    Use analytics to present results in meaningful ways

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    14/17

    Quocirca 2012

    SQL NoSQL

    MapReduce

    Filter

    Apply metadata

    App

    Search, analyse and report

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    15/17

    Quocirca 2012

    Its dj vu all over again

    Remember in-memory databases?

    Big data cannot remain as a jigsaw solution

    Full-service solutions will come forward

    Who will be the winners?

    Oracle, IBM, Microsoft?

    SAP?

    EMC, Symantec?

    The Open Source environment (e.g. 10Gen,Apache/Cassandra, CouchDB)?

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    16/17

    Quocirca 2012

    Big Data has many vectors

    Volume, velocity, variety and veracity: each is asimportant as the others - value will accrue throughgetting them right

    More information is outside the realm of your direct

    control Capturing what can be captured in a useful manner is

    key

    The evolution of the market is rapid

    NoSQL and Hadoop provide the underpinnings for anew, information centric approach

    The formal database is not dead

    But it is only on aspect of the problem and thesolution

  • 7/31/2019 Sizing up Big Data? Hitting the "V"s

    17/17

    Quocirca 2012

    Thank you

    Contact details:

    [email protected]

    Further reading:

    http://quocirca.com/reports/150http://quocirca.com/articles/617

    http://quocirca.com/articles/637