get started with big data discovery

Download Get Started with Big Data Discovery

Post on 15-Dec-2015

2 views

Category:

Documents

0 download

Embed Size (px)

DESCRIPTION

Get Started with Oracle Big Data Discovery

TRANSCRIPT

  • Oracle Big Data DiscoveryGetting Started Guide

    Version 1.1.0 August 2015

  • Copyright and disclaimerCopyright 2003, 2015, Oracle and/or its affiliates. All rights reserved.Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks oftheir respective owners. UNIX is a registered trademark of The Open Group.This software and related documentation are provided under a license agreement containing restrictions onuse and disclosure and are protected by intellectual property laws. Except as expressly permitted in yourlicense agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license,transmit, distribute, exhibit, perform, publish or display any part, in any form, or by any means. Reverseengineering, disassembly, or decompilation of this software, unless required by law for interoperability, isprohibited.The information contained herein is subject to change without notice and is not warranted to be error-free. Ifyou find any errors, please report them to us in writing.If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it onbehalf of the U.S. Government, the following notice is applicable:U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software,any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are"commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of theprograms, including any operating system, integrated software, any programs installed on the hardware,and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. Noother rights are granted to the U.S. Government.This software or hardware is developed for general use in a variety of information management applications. Itis not developed or intended for use in any inherently dangerous applications, including applications that maycreate a risk of personal injury. If you use this software or hardware in dangerous applications, then you shallbe responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safeuse. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this softwareor hardware in dangerous applications.This software or hardware and documentation may provide access to or information on content, products andservices from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaimall warranties of any kind with respect to third-party content, products, and services. Oracle Corporation andits affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use ofthird-party content, products, or services.

    Oracle Big Data Discovery: Getting Started Guide Version 1.1.0 August 2015

  • Table of Contents

    Copyright and disclaimer ..........................................................2

    Preface..........................................................................5About this guide ................................................................5Audience......................................................................5Conventions ...................................................................5Contacting Oracle Customer Support .................................................6

    Chapter 1: Welcome to Big Data Discovery ...........................................7Value of using Big Data Discovery ...................................................7Addressing your goals and needs....................................................8Results of working with Big Data Discovery.............................................9About Studio ...................................................................9About Data Processing ..........................................................10About the Dgraph ..............................................................11

    Chapter 2: Taking a Tour of Studio .................................................12Workflow in Studio..............................................................12Finding data sets...............................................................13Profiling and Enriching Data.......................................................14Using Explore .................................................................15Using Transform ...............................................................16Using Discover ................................................................17Using filtering, Guided Navigation, and Search .........................................18

    Chapter 3: Data Sets in Big Data Discovery..........................................20About data sets................................................................20Data sets in Catalog vs data sets in projects...........................................20Data Set Manager ..............................................................21Sampled and full data sets........................................................22Data set lifecycle in Studio........................................................22Projects and applications in Big Data Discovery ........................................24Data set access, access to projects, and user roles......................................26

    Chapter 4: Data Loading and Updates ..............................................27Data loading options ............................................................27Data loading and sample size .....................................................28Data update options ............................................................28Studio-loaded files: data update diagram .............................................30DP CLI-loaded files: data update diagram.............................................32

    Chapter 5: Where You Go from Here................................................34Quick reference to areas of interest .................................................34

    Oracle Big Data Discovery: Getting Started Guide Version 1.1.0 August 2015

  • Table of Contents 4

    Oracle Big Data Discovery: Getting Started Guide Version 1.1.0 August 2015

  • PrefaceOracle Big Data Discovery is a set of end-to-end visual analytic capabilities that leverage the power of Hadoopto transform raw data into business insight in minutes, without the need to learn complex products or rely onlyon highly skilled resources.

    About this guideThis guide introduces Oracle Big Data Discovery and orients you how to use the product for your needs, fromloading data to exploring, transforming, and updating it.Along the way, the guide introduces key terms, components and user interfaces in Big Data Discovery. Thisguide is complementary to the Getting Started video series.

    AudienceThis guide is for business analysts, data scientists, and data engineers who work with Hadoop. Also, if you areinterested in working with big data, this guide is for you.

    ConventionsThe following conventions are used in this document.

    Typographic conventionsThe following table describes the typographic conventions used in this document.

    Typeface Meaning

    User Interface Elements This formatting is used for graphical user interface elements such aspages, dialog boxes, buttons, and fields.

    Code Sample This formatting is used for sample code segments within a paragraph.

    Variable This formatting is used for variable values.For variables within a code sample, the formatting is Variable.

    File Path This formatting is used for file names and paths.

    Symbol conventionsThe following table describes symbol conventions used in this document.

    Oracle Big Data Discovery: Getting Started Guide Version 1.1.0 August 2015

  • Preface 6

    Symbol Description Example Meaning

    > The right angle bracket, File > New > Project From the File menu,or greater-than sign, choose New, then fromindicates menu item the New submenu,selections in a graphic choose Project.user interface.

    Path variable conventionsThis table describes the path variable conventions used in this document.

    Path variable Meaning

    $MW_HOME Indicates the absolute path to your Oracle Middleware home directory,which is the root directory for your WebLogic installation.

    $DOMAIN_HOME Indicates the absolute path to your WebLogic domain home directory. Forexample, if bdd_domain is the domain name, then the $DOMAIN_HOMEvalue is the $MW_HOME/user_projects/domains/bdd_domaindirectory.

    $BDD_HOME Indicates the absolute path to your Oracle Big Data Discovery homedirectory. For example, if BDD1.1 is the name you specified for the OracleBig Data Discovery installation, then the $BDD_HOME value is the$MW_HOME/BDD1.1 directory.

    $DGRAPH_HOME Indicates the absolute path to your Dgraph home directory. For example,the $DGRAPH_HOME value might be the $BDD_HOME/dgraph directory.

    Contacting Oracle Customer SupportOracle customers that have purchased support have access to electronic support through My Oracle Support.This includes important information regarding Oracle software, implementation questions, product and solutionhelp, as well as overall news and updates from Oracle.You can contact Oracle Customer Support through Oracle's Support portal, My Oracle Support athttps://support.oracle.com.

    Oracle Big Data Discovery: Getting Started Guide Version 1.1.0 August 2015

  • Chapter 1Welcome to Big Data Discovery

    This section introduces Big Data Discovery (BDD), and includes overviews of its components.

    Value of using Big Data DiscoveryAddressing your goal