“evaluating mapreduce for multi-core and multiprocessor systems” colby ranger, ramanan...

“Evaluating MapReduce for Multi- core and Multiprocessor Systems” Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis Computer Systems Laboratory Stanford University Presented by JP Cafaro

Upload: suzan-waters

Post on 23-Dec-2015

213 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

“Evaluating MapReduce for Multi-core and Multiprocessor Systems”

Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis

Computer Systems LaboratoryStanford University

Presented by JP Cafaro

Page 2: “Evaluating MapReduce for Multi-core and Multiprocessor Systems” Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis Computer

2ECE 259 / CPS 221

Introduction to MapReduce

• MapReduce is a programming model created by Google to help with the automatic parallelization and distribution of code over thousands of servers.

• It allows for the programmer to write simple functional code without needing to worry about all of the low-level parallelization under the hood.

• It works by taking an input data, and mapping it to intermediate <key,value> pairs. Disjoint portions of the input data can be worked on in parallel.

• The intermediate pairs are then reduced to produce the final output. This can also be done in parallel.

Page 3: “Evaluating MapReduce for Multi-core and Multiprocessor Systems” Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis Computer

3ECE 259 / CPS 221

Proposal and Features

• MapReduce is for thousands of distributed systems and relies on remote file accesses. The researchers wanted to create a shared memory system implementation of MapReduce for commercial systems (Phoenix)

• Phoenix can do a number of really cool things like dynamically spawn threads taking into account the number of cores, hardware threads per core, system load, etc.

• Work Stealing/Load Balancing, Prefetching, Granularity, Fault Tolerance

• It deals with a lot of the low level stuff automatically to create a simplistic programming model to greatly facilitate programmer efficiency.

Page 4: “Evaluating MapReduce for Multi-core and Multiprocessor Systems” Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis Computer

4ECE 259 / CPS 221

Benchmark and Results

• The researchers used a number of parallelizable types of programs including word count, matrix multiply, reverse index, etc.

• Speedups were determined based on comparisons to sequential versions of the code.

• In all cases, using the MapReduce implementation was better than using the sequential version.

• In some cases, the overhead introduced by Phoenix made it less efficient than a low-level implementation in P-Threads.

Page 5: “Evaluating MapReduce for Multi-core and Multiprocessor Systems” Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis Computer

5ECE 259 / CPS 221

Questions

• The main question is the tradeoff between programming simplicity and performance.

• The low level P-threads implementation didn’t use dynamic scheduling because of programming complexity even though it would have probably made the Phoenix implementation look less attractive from a performance standpoint.

• Are we giving up too much to make programmers’ lives easier?

• How many types of applications can we use this MapReduce implementation on?

• Are there other types of programming models that are similar to MapReduce that we could fit to other problems types?

Page 6: “Evaluating MapReduce for Multi-core and Multiprocessor Systems” Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, Christos Kozyrakis Computer

6ECE 259 / CPS 221

Conclusions

• MapReduce/Phoenix can be really useful for some algorithms that map nicely onto this programming model as shown by the results.

• Other types of programs that this model isn’t naturally suited for experience less speedups. The overhead introduced by Phoenix makes alternatives such as using a lower level P-threads implementation perform better.

• Overall, this model is extremely simple and techniques such as MapReduce which automatically parallelize code are important to think about as we try and figure out how to write software for tons of cores.

Presentation v Raghuraman SessionV

1 1 OpenCV Tutorial Omri Perez Adapted from: Gary Bradski Senior Scientist, Willow Garage Consulting Professor: Stanford CS Dept.

How to use Windows Live Account for JJC Email & eResources Presented by: Rama Raghuraman Assistive Technology and Computer Systems Specialist Student Accommodations

Open CV intro References: 1."Learning OpenCV: Computer Vision with the OpenCV Library", Bradski & Kaehler (O'Reilley 2008) 2.//opencv.willowgarage.com/wiki

Introduction to Mean-Shift Trackingrtc12/CSE598C/meanshiftIntro.pdf · Example: Face Tracking using Mean -Shift Gray Bradski, “Computer Vision Face Tracking for use in a Perceptual

1 Human Detection under Partial Occlusions using Markov Logic Networks Raghuraman Gopalan and William Schwartz Center for Automation Research University

NBER WORKING PAPER SERIES · 2020. 10. 31. · and Shwetha Raghuraman provided outstanding research assistance. Results, information and opinions solely represent the analysis, information

2D Shape Matching (and Object Recognition)libvolume8.xyz/telecommunications/btech/semester7/imageprocessi… · 2D Shape Matching (and Object Recognition) Raghuraman Gopalan Center

Room temperature dielectric and antibacterial behavior of ... · Penmetsa Bhavani1; Veluri Swaminadham3; Kondareddy Anji Reddy1 1Department of Engineering Chemistry, SRKR Engineering

Chythanya K.K., Advocate1 CHYTHANYA K.K., B.Com, FCA, LLB Raghuraman & Chythanya Advocates #32, 1 st Floor, Patalamma Temple Street, Basavanagudi, Bangalore,

1 2D Shape Matching (and Object Recognition) Raghuraman Gopalan Center for Automation Research University of Maryland, College Park

Structure From Motion Sebastian Thrun, Gary Bradski, Daniel Russakoff Stanford CS223B Computer Vision