what is hadoop & its use cases-promtpcloud

22

Upload: promptcloud

Post on 01-Nov-2014

415 views

Category:

Data & Analytics


3 download

DESCRIPTION

Get a bird's eye view of Hadoop-the Big Data analytics software. See how Big Data analytics helps enterprises via Hadoop.

TRANSCRIPT

Page 1: What is Hadoop & its Use cases-PromtpCloud
Page 2: What is Hadoop & its Use cases-PromtpCloud

HADOOP THE SIGNIFICANT BIG DATA SOFTWARE

Page 3: What is Hadoop & its Use cases-PromtpCloud

The Apache™ Hadoop® project

• Open-source software for reliable, scalable, distributed computing

• Allows distributed processing of large data sets across clusters of computers

Designed for:

• Scale up from single servers to thousands of machines, each offering local computation and storage

• To detect and handle failures at the application layer

• Delivering a highly-available service on top of a cluster of computers

Page 4: What is Hadoop & its Use cases-PromtpCloud

• Hadoop Common (utilities) that support Hadoop models

• Hadoop Distributed File System (HDFS) for high-throughput access to application data

• Hadoop YARN for job scheduling and cluster resource management

• Hadoop MapReduce for parallel processing of large data-sets

Modules designed assuming hardware failures

should be handled by framework

The Apache™ Hadoop® project

Page 5: What is Hadoop & its Use cases-PromtpCloud

RELATED PROJECTS

Page 6: What is Hadoop & its Use cases-PromtpCloud

RELATED PROJECTS

• Ambari™

A tool for provisioning, managing, and monitoring Apache Hadoop clusters

• Avro™

A data serialization system

• Cassandra™

A scalable multi-master database with no single points of failure

• Chukwa™

A data collection system for managing large distributed systems

Page 7: What is Hadoop & its Use cases-PromtpCloud

• HBase™A scalable, distributed database supporting structured data storage for large tables

• Hive™A data warehouse infrastructure for data summarization and ad hoc querying

• Mahout™A scalable machine learning and data mining library

• Pig™A high-level data-flow language and execution framework for parallel computation

RELATED PROJECTS

Page 8: What is Hadoop & its Use cases-PromtpCloud

• Spark™A fast and general compute engine for Hadoop data

• Tez™A generalized data-flow programming framework providing a powerful and flexible engine to execute data processing for both batch and interactive use-cases

• ZooKeeper™A high-performance coordination service for distributed applications

RELATED PROJECTS

Page 9: What is Hadoop & its Use cases-PromtpCloud

Hadoop is useful because…

BIG DATA STORAGE

FAST PROCESSING

BETTER RESULTS & INSIGHTS

Page 10: What is Hadoop & its Use cases-PromtpCloud

Hadoop is Big Data software that…

best meets industry needs

allows movement of large volumes of complex and relational data into a single repository

is affordable storage and retrieval for analytic applications

makes raw data always available

simultaneously processes Big Data divided into multiple parts

Page 11: What is Hadoop & its Use cases-PromtpCloud

Hadoop Uses

PUBLIC HEALTH PRODUCT

DEVELOPMENT

R&D STOCK & COMMODITIES TRADING

SALES & MARKETING

Page 12: What is Hadoop & its Use cases-PromtpCloud

The Hadoop Advantage…

Insights from everywhere, any where

• Hadoop can handle all types of data :

structured | unstructured | log files | pictures |audio files |communications records |email

• No prior need for a schema

• Lets you decide query later

• Makes all data useable, not just database

Page 13: What is Hadoop & its Use cases-PromtpCloud

The Hadoop Advantage…

Economics of everything online

• Legacy systems are far too expensive for general use with large data-sets

• Hadoop relies on internally redundant data. Storing data not previously viable is possible

• Keep data for real-time interactive querying, business intelligence, analysis and visualization

Page 14: What is Hadoop & its Use cases-PromtpCloud

The Hadoop Advantage…

Streamline Data Usage

• Unstructured data accounts for 90% of the data

• Data storage, management and analytics must be re-looked at

• Legacy systems will complement Hadoop-optimized data management

• Hadoop is cost-effective, scalable, and provided streamlined architecture

Page 15: What is Hadoop & its Use cases-PromtpCloud

USE CASES

FOR BUSINESS ENTERPRISES

Page 16: What is Hadoop & its Use cases-PromtpCloud

Hadoop helps

DATA PROCESSING

• extract, transform, and load (ETL) data from source systems

• to transfer data stored in Hadoop to and from a database management

• batch process large quantities of unstructured and semi-structured data

NETWORK MANAGEMENT

• capture, analyze, and display data collected from servers, storage devices, and other IT hardware

• monitor network activity and diagnose bottleneck and other issues

Page 17: What is Hadoop & its Use cases-PromtpCloud

RETAIL FRAUD

• monitor, model, and analyze high volumes of data from transactions

• extract features and patterns, retailers can help prevent credit card account fraud

RECOMMENDATION TOOL

• match and recommend users to one another

• compare products and services based on analysis of user profiles and behavioral data

Hadoop helps

Page 18: What is Hadoop & its Use cases-PromtpCloud

SENTIMENT ANALYSIS

• advanced text analytics tools analyze unstructured text of social media

• tweets and Facebook posts determine user sentiment related to particular companies, brands, or products

FINANCIAL RISK MODELING

• analysis of large volumes of transactional data to determine risk and exposure of financial assets,

• prepare for potential "what-if" scenarios based on simulated market behavior

• due diligence tasks

• rate potential clients for risk

Hadoop helps

Page 19: What is Hadoop & its Use cases-PromtpCloud

MARKETING CAMPAIGN ANALYSIS

• monitor and determine the effectiveness of marketing campaigns

• increase the accuracy of analysis by incorporating higher volumes of detailed data

CUSTOMER INFLUENCER ANALYSIS

• Mine social networking data for mapping customer influence over others

• help enterprises determine customers most important and influential for focused marketing

Hadoop helps

Page 20: What is Hadoop & its Use cases-PromtpCloud

CUSTOMER EXPERIENCE ANALYSIS

• integrate data from previously siloed customer interaction channels

• understand impact of customer interaction to optimize customer lifecycle experience

RESEARCH & DEVELOPMENT

• comb through volumes of text-based research and historical data to support development of new products

Hadoop helps

Page 21: What is Hadoop & its Use cases-PromtpCloud

Hadoop provides a solid foundation on which to build critical big data solutions.

As a tool, using it the right way from the very beginning can help ensure success.

Page 22: What is Hadoop & its Use cases-PromtpCloud

Visit our blog for more interesting articles on Big Data, Crawling &

Extraction, and Analytics