intro to hadoop
DESCRIPTION
This is an offline about Hadoop, organized by Contemi Vietnam. Our presenter are Quang Nguyen (Sebastian) and Hoang Le (Ethan). This event is co-ordinated by Phuong Dung (Keziah).TRANSCRIPT
INTRODUCTION TO HADOOPPresented by
Quang Nguyen & Hoang Le
CONTENT
• Introduction to Hadoop
• Scalability on AWS / Azure
• Reality
• First 100-hour Award
• Second 100-hour Overview
• Career Path
• Q & A
INTRODUCTION TO HADOOP
SCALABILITY ON AWS / AZURE
DISTRIBUTED SYSTEMS
MPI vs Hadoop
DRIVERIGHT PROJECT
• Target: collect driving experience by mobile application for analyzing driving habits
• Purposes:
• Improve driving ability
• Supply driver’s information to
needed companies
• Market: China
• Scale-out Problem:
• Millions of users with rich data resources (records in milliseconds)
• MySQL database is not efficient for Big Data Analytics
R - Python
Tableau
Mobile Apps
Mobile Platform
DRIVE-RIGHT
ARCHITECTURE
REALITY
FIRST 100-HOUR AWARD
SECOND 100-HOUR OVERVIEW
#101 – Java & IntelliJ setup#102 – Java programming part 1#103 – Java programming part 2#104 – Java programming part 3#105 – Java programming part 4#106 – Single-node Hadoop#107 – Multi-node Hadoop#108 – Map Reduce basis
#109 – Intro to Map Reduce programming
#110 – Map Reduce Design Pattern part 1
#111 – Map Reduce Design Pattern part 2
#112 – Apache Mahout 1 – Setting Up
#113 – Apache Mahout 2 –Building Recommenders
#114 – Apache Mahout 3 –Building Clustering Systems
#115 – Final Project II
PLAN FOR THE YEAR
Internship Program
Club
Sponsorship
Capable & young
data scientists
$$$
Effort
Smart students
Q & A