hadoop vs mongodb
DESCRIPTION
Hadoop vs MongoDB performanceTRANSCRIPT
![Page 1: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/1.jpg)
Performance Evaluation of a MongoDB and Hadoop Platform for Scientific Data Analysis
E. Dede, M.Govindaru (University Binghamton), D.Gunter, R. Canon, L. Ramakrishnan (Lawrence Berekely National Lab)
http://dl.acm.org/citation.cfm?id=2465849
![Page 2: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/2.jpg)
Introduction
MongoDB : NoSQL DBMSHadoop : Framework -> Big Data ProcessingHDFS : Hadoop FileSystemMapReduce : for scalable Parallel Analysis
![Page 3: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/3.jpg)
MotivationAdvanced Light Source (ALS) & Joint Genome Institute have project -> Materials Project.Now ALS & Materials Projects use MongoDB+MapReduceHadoop -> most popular FOSS implementation of MapReduce.How if we make a comparison of both ->
evaluate the performancescalabilityfault tolerance
![Page 4: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/4.jpg)
System OverviewNow the ALS have system with native MogoDB+MapReduce, and they want to know and compare about the Hadoop MongoDB vs (native Hadoop)Hadoop HDFS
![Page 5: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/5.jpg)
Experiment Setup● Hadoop Setup using Standard Setup● Data: US Census data 300 GB● MongoDB : run with 2 mode, single and sharding● Hopper Machine : Cray XE6 with 153,216 compute
cores, 217 TB RAM, 2 PetaBytes Disk, Each Node has 24 cores, 2 twelve-core AMD 2.1 GHz processor and 32 GB RAM.
![Page 6: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/6.jpg)
Result
Evaluate PerformanceScalabilityFault Tolerance
![Page 7: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/7.jpg)
Evaluate Performance
![Page 8: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/8.jpg)
Scalability 37.2 milion input records
![Page 9: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/9.jpg)
Fault Tolerancetotal 32 node processing 37 milion input records.
Hadoop after 8 faulted nodes Hadoop loses too many data nodes and fails to complete the MapReduce Job.
MogoDB-Hadoop can finish job even with half of the cluster lost.
![Page 10: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/10.jpg)
Conclusion & Contribution
Case -> Scientific Big Data Processing● HDFS-Hadoop is better than MongoDB for Read and
Write Performance.● HDFS-Hadoop is better than MongoDB-hadoop for
Scalability case.● MongoDB-hadoop is better than HDFS-Hadoop for
Fault Tolerance.
![Page 11: Hadoop vs MongoDB](https://reader036.vdocuments.mx/reader036/viewer/2022081809/55cf9874550346d03397bb88/html5/thumbnails/11.jpg)
Thank you
Any Questions?