intro to hadoop

Post on 05-Dec-2014

346 Views

Category:

Technology

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

This is an offline about Hadoop, organized by Contemi Vietnam. Our presenter are Quang Nguyen (Sebastian) and Hoang Le (Ethan). This event is co-ordinated by Phuong Dung (Keziah).

TRANSCRIPT

INTRODUCTION TO HADOOPPresented by

Quang Nguyen & Hoang Le

CONTENT

• Introduction to Hadoop

• Scalability on AWS / Azure

• Reality

• First 100-hour Award

• Second 100-hour Overview

• Career Path

• Q & A

INTRODUCTION TO HADOOP

SCALABILITY ON AWS / AZURE

DISTRIBUTED SYSTEMS

MPI vs Hadoop

DRIVERIGHT PROJECT

• Target: collect driving experience by mobile application for analyzing driving habits

• Purposes:

• Improve driving ability

• Supply driver’s information to

needed companies

• Market: China

• Scale-out Problem:

• Millions of users with rich data resources (records in milliseconds)

• MySQL database is not efficient for Big Data Analytics

R - Python

Tableau

Mobile Apps

Mobile Platform

DRIVE-RIGHT

ARCHITECTURE

REALITY

FIRST 100-HOUR AWARD

SECOND 100-HOUR OVERVIEW

#101 – Java & IntelliJ setup#102 – Java programming part 1#103 – Java programming part 2#104 – Java programming part 3#105 – Java programming part 4#106 – Single-node Hadoop#107 – Multi-node Hadoop#108 – Map Reduce basis

#109 – Intro to Map Reduce programming

#110 – Map Reduce Design Pattern part 1

#111 – Map Reduce Design Pattern part 2

#112 – Apache Mahout 1 – Setting Up

#113 – Apache Mahout 2 –Building Recommenders

#114 – Apache Mahout 3 –Building Clustering Systems

#115 – Final Project II

PLAN FOR THE YEAR

Internship Program

Club

Sponsorship

Capable & young

data scientists

$$$

Effort

Smart students

Q & A

top related