my three ex’s: a data science approach for applied machine learning

33
Recruiting Solutions 1 Danie l My Three Ex’s: A Data Science Approach for Applied Machine Learning

Upload: daniel-tunkelang

Post on 20-Aug-2015

6.387 views

Category:

Technology


3 download

TRANSCRIPT

Page 1: My Three Ex’s: A Data Science Approach for Applied Machine Learning

Recruiting SolutionsRecruiting SolutionsRecruiting Solutions1

Daniel

My Three Ex’s:A Data Science Approach for

Applied Machine Learning

Page 2: My Three Ex’s: A Data Science Approach for Applied Machine Learning

Dedicated to 3 of my favorite ex co-workers.

Page 3: My Three Ex’s: A Data Science Approach for Applied Machine Learning

3

First, a disclosure.

This isn’t a talk about machine learning.

It’s a talk about applying machine learning.

What’s the difference?

Page 4: My Three Ex’s: A Data Science Approach for Applied Machine Learning

4

Let’s talk about something else for a moment.

Hash Tables

Page 5: My Three Ex’s: A Data Science Approach for Applied Machine Learning

5

What you (need to) know about hash tables.

Theory Application

Class HashMap<K,V>

java.lang.Objectjava.util.AbstractMap<K,V>

java.util.HashMap<K,V>

Type Parameters:

K - the type of keys maintained by this mapV - the type of mapped values

All Implemented Interfaces:Serializable, Cloneable, Map<K,V>

Page 6: My Three Ex’s: A Data Science Approach for Applied Machine Learning

6

Now let’s get back to machine learning!

Page 7: My Three Ex’s: A Data Science Approach for Applied Machine Learning

7

Please allow me to introduce my three ex’s.

Express.

Explain.

Experiment.

Page 8: My Three Ex’s: A Data Science Approach for Applied Machine Learning

8

Embrace the data science mindset.

ExpressUnderstand your utility and inputs.

ExplainUnderstand your models and metrics.

ExperimentOptimize for the speed of learning.

Page 9: My Three Ex’s: A Data Science Approach for Applied Machine Learning

9

Express.

Page 10: My Three Ex’s: A Data Science Approach for Applied Machine Learning

10

How to train your machine learning model.

1. Define your objective function.

2. Collect training data.

3. Build models.

4. Profit!

Page 11: My Three Ex’s: A Data Science Approach for Applied Machine Learning

11

You can only improve what you measure.

Clicks?

Actions?

Outcomes?

Page 12: My Three Ex’s: A Data Science Approach for Applied Machine Learning

12

Be careful how you define precision…

Page 13: My Three Ex’s: A Data Science Approach for Applied Machine Learning

13

Account for non-uniform inputs and costs.

Page 14: My Three Ex’s: A Data Science Approach for Applied Machine Learning

14

Stratified sampling is your friend.

Page 15: My Three Ex’s: A Data Science Approach for Applied Machine Learning

15

An example of segmenting models.

Searcher: Recruiter

Query: Person Name

Searcher: Job Seeker

Query: Person Name

Searcher: Recruiter

Query: Job Title

Searcher: Job Seeker

Query: Job Title

Page 16: My Three Ex’s: A Data Science Approach for Applied Machine Learning

16

Express yourself in your feature vectors.

Page 17: My Three Ex’s: A Data Science Approach for Applied Machine Learning

17

Express: Summary.

Choose an objective function that models utility.

Be careful how you define precision.

Account for non-uniform inputs and costs.

Stratified sampling is your friend.

Express yourself in your feature vectors.

Page 18: My Three Ex’s: A Data Science Approach for Applied Machine Learning

18

Explain.

Page 19: My Three Ex’s: A Data Science Approach for Applied Machine Learning

19

With apologies to the little prince.

Page 20: My Three Ex’s: A Data Science Approach for Applied Machine Learning

20

Everyone is talking about Deep Learning.

Page 21: My Three Ex’s: A Data Science Approach for Applied Machine Learning

21

But accuracy isn’t everything.

Page 22: My Three Ex’s: A Data Science Approach for Applied Machine Learning

22

Explainable models, explainable features.

Less is more when it comes to explainability.

Algorithms can protect you from overfitting, but they can’t protect you from the biases you introduce.

Introspection into your models and features makes it easier for you and others to debug them.

Especially if you don’t completely trust your objective function or the representativeness of your training data.

Page 23: My Three Ex’s: A Data Science Approach for Applied Machine Learning

23

Linear regression? Decision trees?

Linear regression and decision trees favor explainability over accuracy, compared to more sophisticated models.

But size matters. If you have too many features or too deep a decision tree, you lose explainability.

You can always upgrade to a more sophisticated model when you trust your objective function and training data.

Build a machine learning model is an iterative process. Optimize for the speed of your own learning.

Page 24: My Three Ex’s: A Data Science Approach for Applied Machine Learning

24

Explain: Summary.

Accuracy isn’t everything.

Less is more when it comes to explainability.

Don’t knock linear models and decision trees!

Start with simple models, then upgrade.

Page 25: My Three Ex’s: A Data Science Approach for Applied Machine Learning

25

Experiment.

Page 26: My Three Ex’s: A Data Science Approach for Applied Machine Learning

26

Why experiments matter.

“You have to kiss a lot of frogs to find one prince. So how can you find your prince faster?

By finding more frogs and kissing them faster and faster.”

-- Mike Moran

Page 27: My Three Ex’s: A Data Science Approach for Applied Machine Learning

27

Life in the age of big data.

Yesterday Today

Experiments are expensive,

choose hypotheses wisely.Experiments are cheap,

do as many as you can!

Page 28: My Three Ex’s: A Data Science Approach for Applied Machine Learning

28

So should we just test everything?

Page 29: My Three Ex’s: A Data Science Approach for Applied Machine Learning

29

Optimize for the speed of learning.

vs

Page 30: My Three Ex’s: A Data Science Approach for Applied Machine Learning

30

Be disciplined: test one variable at a time.

• Autocomplete• Entity Tagging• Vertical Intent• # of Suggestions• Suggestion Order• Language• Query Construction• Ranking Model

Page 31: My Three Ex’s: A Data Science Approach for Applied Machine Learning

31

Experiment: Summary.

Kiss lots of frogs: experiments are cheap.

But test in good faith – don’t just flip coins.

Optimize for the speed of learning.

Be disciplined: test one variable at a time.

Page 32: My Three Ex’s: A Data Science Approach for Applied Machine Learning

32

Bringing it all together.

ExpressUnderstand your utility and inputs.

ExplainUnderstand your models and metrics.

ExperimentOptimize for the speed of learning.

Page 33: My Three Ex’s: A Data Science Approach for Applied Machine Learning

33

Daniel [email protected]://linkedin.com/in/dtunkelang