webinar on deep learning's impact on nlp - amazon s3deep+learning+-+dec...why deep learning is...

40
Webinar on Deep Learning's Impact on NLP Kfir Bar Chief Scientist

Upload: others

Post on 22-May-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Webinar on Deep Learning's Impact on NLP

Kfir BarChief Scientist

Page 2: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Who is Kfir Bar and what does Basis Tech do?

AI, Machine Learning, Deep Learning, Text Analytics...

"With industry leading NLP tools we enable Smart Search, Advanced Name Matching and Identity Analytics.”

Page 3: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Visit our website

www.rosette.com

Page 4: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Deep Learning's Impact on NLP: In 30 Minutes

Kfir BarChief Scientist

Page 5: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Why deep learning is so hyped?

“deep learning”

Page 6: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Deep learning papers in top NLP conferences

https://arxiv.org/pdf/1708.02709.pdf

Page 7: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Why deep learning is so hyped?

1. Data

2. Computational power

3. Algorithms

4. Rebranding

Page 8: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Why deep learning is so hyped?

1. Data

2. Computational power

3. Algorithms

4. Rebranding

Deep learning algorithms get better with more data

Amount of data

Traditional Machine Learning

Perf

orm

ance

Deep Learning

Page 9: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms
Page 10: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Why deep learning is so hyped?

1. Data

2. Computational power

3. Algorithms

4. Rebranding

➔ Deep learning algorithms require significantly more computational power

➔ GPUs become available at reasonable prices

Page 11: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Why deep learning is so hyped?

1. Data

2. Computational power

3. Algorithms

4. Rebranding

➔ Recent advances in training procedures make deep learning a feasible device

Page 12: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Why deep learning is so hyped?

1. Data

2. Computational power

3. Algorithms

4. Rebranding

1955

Perceptron1970

Artificial Neural Network

2010

Deep learning

Page 13: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Advantages of using Deep Learning

1. Deep Learning models outperform nearly every other machine learning algorithms

2. They don’t require feature engineering

Page 14: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Deep Learning models outperform nearly every other machine learning algorithms

https://www.dsiac.org/resources/journals/dsiac/winter-2017-volume-4-number-1/real-time-situ-intelligent-video-analytics

Page 15: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Traditional ML vs. Deep LearningCongratulations to @Cristiano for winning the "Best International Soccer Player" award at the 2018 @ESPYS!

words, part of speech tags, lemmas, brown clusters

[00010010110000101001…..001]

SPORTS

Feature extraction

Vectorization

Modeling

Embeddings lookup

[0.323, -0.3434, 0.901, …, -0.267][-0.4923, 0.554, 0.001, …, -0.365]

[1.58845, 0.478, 0.0901, …, -0.171]…

[-0.0592, 0.588, -0.01, …, -0.111]

Modeling

SPORTS

15

Congratulations to @Cristiano for winning the "Best International Soccer Player" award at the 2018 @ESPYS!

Page 16: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Word embeddings

- + BerlinJapan Germany

German

European

Europe

Africa

Tokyo =

Page 17: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Multilingual embeddings

Machine Learning

Eagleלמידה חישוביתPharmaceuticals Inc.

Eagle Drugs, Co.

Tesla

Energy Storage

טסלה

AI

تیسال موتورز

計算学習

אחסון אנרגיה

Page 18: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

1. Explainability

2. Need more data

3. Computationally expensive

Disadvantages of using Deep Learning

Page 19: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Disadvantages of DL

1. Explainability

2. Need more data

3. Computationally expensive

➔ It’s difficult to understand why a DL model decided on something

1

1.2

3.2

-0.3

0.5

2

Page 20: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Google’s AI won the game Go

Page 21: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Google’s AI won the game Go

Page 22: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

22

By Siddhartha Mukherjee

The dying algorithm - predicts death for oncological patients

“Here is the strange rub of such a deep learning system: It learns, but it cannot tell us why it has learned…

...the algorithm looks vacantly at us when we ask, Why? It is, like death, another black box.”

Jan 2018

Page 23: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Traditional algorithm for gender classification

1. Female writers use more pronouns (I, you, she, their, myself)

2. Males prefer words that identify or determine nouns (a, the, that) and words that quantify them (one, two, more)

Koppel et al., 2002, Automatically Categorizing Written Texts by Author Gender

Page 24: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Disadvantages of DL

1. Explainability

2. Need more data

3. Computationally expensive

➔ Neural Networks usually require more data than traditional algorithms

➔ Usually they need at least tens of thousand (if not millions) of labeled samples

Page 25: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Disadvantages of DL

1. Explainability

2. Need more data

3. Computationally expensive

➔ State of the art DL algorithms can take days and sometimes even weeks to train completely from scratch

➔ The complex structure and relatively large number of parameters result in a slower prediction process

➔ Usually DL algorithms require GPU to maintain a reasonable running time

Page 26: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

26

Example: Named Entity Recognition

Page 27: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

27

Automatically find names of people,

organizations, locations, and more in text

across many languages.

Named entity recognition (NER)

Page 28: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

According to Elon Musk, Mars rocket will

fly ‘short flights’ next year.

28

Page 29: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

?

Page 30: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

30

Context is important

Edward AdelsonNeuroscientist, MIT

Checker shadow illusion

The squares represented by A and B are of the same color

Page 31: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

31

Context is important

Edward AdelsonNeuroscientist, MIT

Checker shadow illusion

The squares represented by A and B are of the same color

Page 32: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Can't play Spain? Improve your playing via easy step-by-step video lessons!

32

But sometimes it gets ambiguous...

Page 33: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

33

But sometimes it gets ambiguous...

Can't play Spain? Improve your playing via easy step-by-step video lessons!

Page 34: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

34

Feed forward network for NER

listen

to

while

I

B-PER

B-LOC

...

...

Layer 1 Layer 2 Output

Spain I-PER...

Page 35: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

+

35

Bidirectional LSTM for Sequence Labeling

LSTM

Washington

B-PER

LSTM

+

LSTM

said

OTHER

LSTM

+

LSTM

in

OTHER

LSTM

+

LSTM

Chicago

B-LOC

LSTM

+

LSTM

last

OTHER

LSTM

...

Page 36: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

36

Overall: better accuracy in multiple languages for NER!

English Arabic Korean

Deep learning model 91.3 83.3 86.4

Traditional model 89.3 80.3 80.7

Page 37: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Some takeaways

➔ Deep Learning algorithms perform better for some NLP tasks

➔ They don’t require feature engineering

➔ For us it provides a more generic approach for NLP, so supporting new languages becomes easier

➔ They are slower than the traditional algorithms

Page 38: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Want to learn more?

Page 39: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Want to learn more?

Page 40: Webinar on Deep Learning's Impact on NLP - Amazon S3Deep+Learning+-+Dec...Why deep learning is so hyped? 1. Data 2. Computational power 3. Algorithms 4. Rebranding Deep learning algorithms

Thank you!

40

Questions?Use the chatbox!

[email protected]

@kfirbar

www.rosette.com