hadoop acm presentation
DESCRIPTION
Microsoft Hadoop presentation for ACM Data Mining Hackathon competition.TRANSCRIPT
![Page 1: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/1.jpg)
Hadoop and Microsoft.
Brad Sarsfield | Senior Software Engineer @bradoop
![Page 2: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/2.jpg)
BIG DATA
HADOOP
MICROSOFT & HADOOP
Agenda
![Page 3: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/3.jpg)
How Big is Big Data?
![Page 4: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/4.jpg)
It’s all about your BigDataProblemsBig Problems
![Page 5: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/5.jpg)
Hadoop is for Big Data.
![Page 6: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/6.jpg)
Data is the Platform.
![Page 7: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/7.jpg)
Hadoop Data Science.
![Page 8: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/8.jpg)
Hadoop Capabilities.
Machine Learning
Graph Processing
Distributed Compute
Extract Load Transform
Predictive
Analysis
![Page 9: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/9.jpg)
Distributed Storage(HDFS)
Query(Hive)
Hadoop architecture.
Distributed Processing(Map Reduce)
Scripting
(Pig)
NoSQ
L Data
base
(HB
ase
)
Metadata(HCatalog)
Data
Inte
gra
tion
( OD
BC
/ SQ
OO
P/
REST)
Busin
ess In
tellig
ence
(E
xcel, Po
werV
iew
…)
Machine Learning(Mahout)
Graph(Pegasus)
Stats processin
g(RHadoop
)
Pipelin
e /
workflo
w(O
ozie
)
Log file
aggre
gatio
n(Flu
me)
![Page 10: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/10.jpg)
Hadoop and Microsoft.
We are delivering• Apache Hadoop on Windows Server• Apache Hadoop on Windows Azure
Big engineering investment• Big Data Business Intelligence tooling• Big Data Apache Hadoop• Big Data Parallel Data Warehouse
Open source Commitment• Apache Software Foundation• Hortonworks Partnership
![Page 11: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/11.jpg)
Microsoft Hadoop Vision.
Microsoft Business Intelligence (BI) • ODBC Connectivity
Better on Windows and Azure • Active Directory• System Center
Microsoft Data Connectivity• SQL Server / SQL Parallel Data Warehouse• Azure Storage / Azure Data Market
![Page 12: Hadoop acm presentation](https://reader035.vdocuments.mx/reader035/viewer/2022062513/556617cdd8b42a06318b50a6/html5/thumbnails/12.jpg)
ACM Hackathon.
Hadoop on Azure demo
Free Hadoop on Azure• Code: acmhackathon
Free 30 day Azure account • No credit card• 750h small compute / 35GB storage• Email [email protected] for code