midterm e xam review

8
Midterm Exam Review

Upload: langer

Post on 22-Feb-2016

42 views

Category:

Documents


0 download

DESCRIPTION

Midterm E xam Review. General Information. Date: 3/13/2014 Time: 11-12.20 Location: 101 Davis Closed book, closed notes. Topics. Doing data science text: Ch.2 Statistical inference, exploratory data analysis, and data science process Population and samples, sample sizes Data model - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Midterm  E xam Review

Midterm Exam Review

Page 2: Midterm  E xam Review

General Information

• Date: 3/13/2014• Time: 11-12.20• Location: 101 Davis• Closed book, closed notes

Page 3: Midterm  E xam Review

Topics

• Doing data science text: Ch.2 – Statistical inference, exploratory data analysis, and data

science process– Population and samples, sample sizes– Data model

• Statistical model• Algorithms

– Fitting a model– Probability distributions– EDA: plots, graphs and summaries

• One question

Page 4: Midterm  E xam Review

Topics (contd.)• Doing data science: Ch. 3• Comparison of algorithms and stat models• Three basic algorithms

– Linear regression– K-NN (semi-supervised.. Classification)– K-means (unsupervised clustering)

• Intuitive idea • Algorithmic steps for each of these algorithms• Representative examples• Why and when would you use each of these algorithms?• 2 questions

Page 5: Midterm  E xam Review

Topics: Lin & Dyer’s text

• Hadoop: HDFS as in Chapter 2• MapReduce: MR data-flow including

combiners and partitioners• 2 questions

Page 6: Midterm  E xam Review

Bloomberg Tech Talk on ML

• Building Intelligent solution• See the presentation• Up to slide#16 (No NLP or MT)• 1 question

Page 7: Midterm  E xam Review

Format

• 5 questions not equally weighed• HDFS: direct• Ch.2 dds: direct• MR and K-NN: little tricky• K-means: direct• Questions will test your understanding of the

concepts• Example: what is the effect of large K vs smaller K in

K-NN?

Page 8: Midterm  E xam Review

Seating for the exam

• Question, space for answer format• Designated seating: Will let you know the plan