welcome to greenplum day! · community greenplum database – – github code – mailing lists /...

17
1 1 Welcome to Greenplum Day! Roman Shaposhnik @rhatr Director of Open Source Strategy @Pivotal VP of Technology for ODPi @Linux Foundation

Upload: others

Post on 24-Oct-2020

17 views

Category:

Documents


0 download

TRANSCRIPT

  • 11

    Welcome to Greenplum Day!

    Roman Shaposhnik @rhatrDirector of Open Source Strategy @PivotalVP of Technology for ODPi @Linux Foundation

  • 22

    The First Open Source MPP Data Warehouse:

    Greenplum Database

  • 3

    MPP Shared Nothing ArchitectureFlexible framework for processing large datasets

    MasterHost

    SQL➔ Master Host + Standby Master Host

    Master coordinates work with Segment Hosts

    ➔ Segment Host with one or more Segment Instances

    ➔ Segments (Postgres Instances) process queries in parallel

    ➔ Segment Hosts have their own CPU, disk and memory (shared nothing)

    ➔ High speed interconnect for continuous pipelining of data processing

    Segment HostSegment InstanceSegment InstanceSegment InstanceSegment Instance

    node1

    Segment HostSegment InstanceSegment InstanceSegment InstanceSegment Instance

    node2

    Segment HostSegment InstanceSegment InstanceSegment InstanceSegment Instance

    node3

    Segment HostSegment InstanceSegment InstanceSegment InstanceSegment Instance

  • 4

    An ambitious project– 10 years in the making – Investment of hundred of millions of dollars– Potential to define a new market and disrupt traditional EDW vendors

    Community Greenplum Database– https://www.greenplum.org– Github code– mailing lists / community engagement– Global project w/ external contributors

    Pivotal Greenplum– Enterprise software distribution & release management– Pivotal expertise– 24-7 global support– Command Center, Workload Manager, More Add-Ons

    Greenplum Open Source

  • 5

    1986 … 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012 2014

    1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015

    Greenplum Database journey to Open Source

    Michael Stonebraker develops Postgres at UCB

    Postgres adds support for SQL

    Open Source PostgreSQL

    PostgreSQL 7.0 released

    PostgreSQL 8.0 released

    Greenplum forks PostgreSQL

    HAWQ & MADlibgo Apache

    HAWQ launched

    Hadoop 2.0 Released

    MADliblaunched

    Greenplum open sourcedHadoop born at Yahoo!

  • 6

    Pivotal• Founded in April 2013

    • 2000+ employees

    • Funded by Dell/EMC, GE, Ford, Microsoft

    • HQ in Palo Alto & San Francisco

    • Part of Dell EMC Technology

    • Hundreds of millions in revenue (growing 50% YOY)

    • Hundreds of enterprise customers

  • 77

    UK&I France, Spain & Italy

    DACH/NL MEA/Russia

    A quick sample of our customers

  • Pivotal’s mission is to transformhow the world builds software

  • 75%of application development supporting digital business will be built not bought by 2020

    Source: Gartner

  • Drivers of digital transformation are fear and hope

    Fear of disruption

    Hope of transformation

  • Source: JPMorgan Chase Annual Shareholder Letter (2015)

    “Silicon Valley is coming…andthey all want to eat our lunch.”- Jamie Dimon CEO, JPMorgan Chase

  • Source: User Summit (2014)

    “If you went to bed last night as an industrial company, you’re going to wake up in the morning as a software and analytics company.”- Jeff Immelt CEO, General Electric

  • Hundreds of thousands of “trip” events each day

    400+ billion of viewing-related events per day

    Five billion training data

    points for Price Tip feature

    Pivotal helps turn traditional enterprises into...

  • Virtuous cycle: Apps, Data, Analytics.

    Apps power businesses, and those apps generate data

    Analytic insights from that data drive new app functionality, which in-turn drives new data

    The faster you can move around that cycle, the faster you learn, innovate & pull away from the competition

  • c Modern Software MethodologyModern Cloud Native Platform

    Modern Data Platform

    TRANSFORMING INTO A MODERN SOFTWARE COMPANY

    Building high-quality software at start-up speed requires modern software methodologies, cloud platform, and data tools

  • OpenLDW: Functional Architecture

    Inte

    grat

    ed D

    ata

    Inge

    st la

    yer

    (A

    pach

    e K

    afka

    )In

    tegr

    ated

    Dat

    a In

    gest

    laye

    r (

    Apa

    che

    Kaf

    ka)

    Data Sources Glacial & backupGlacial & backup

    Dat

    a-dr

    iven

    AP

    ID

    ata-

    driv

    en A

    PIOperational Analytics (MPP Engine)Operational Analytics (MPP Engine)

    In-memory Serving LayerIn-memory Serving Layer

    Analytics Landing ZonesAnalytics Landing ZonesClo

    ud E

    TL

    2.0

    Clo

    ud E

    TL

    2.0

  • cHelp us build the future of

    BUSINESS

    Slide 1Slide 2Slide 3Greenplum Open SourceGreenplum Database journey to Open SourcePivotalSlide 7Slide 8Slide 9Slide 10Slide 11Slide 12Slide 13Virtuous cycle: Apps, Data, Analytics.Slide 15Slide 16Slide 17