analysis of similarity measures between short text...

21
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Text Conversation Task Kozo Chikai Graduate school of information science and technology, Osaka University 1 Yuki Arase Graduate school of information science and technology, Osaka University

Upload: others

Post on 21-Jun-2020

24 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

Analysis of Similarity Measures between Short Text for the NTCIR-12

Short Text Conversation TaskKozo Chikai

Graduate school of information science and

technology, Osaka University

1

Yuki Arase Graduate school of

information science and technology,

Osaka University

Page 2: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

① IntroductionØOverview of STC task and goal of our study

② Methods & System Design• WTMF model• Word2vec• Random Forest and its learning method

③ Experiment④ Result & Discussion

2

AGENDA

Page 3: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

• Short Text Conversation (STC) TasküRetrieve suitable replies from a pool of tweets

→ Systemʼs output = Ranking of replies

üSystem design : Input → Text (similar to input) → Output (reply)

3

INTRODUCTION

Page 4: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

① IntroductionØOverview of STC task and Goal of our study

② Methods & System Design• WTMF model• Word2vec• Random Forest and its learning method

③ Experiment④ Result & Discussion

4

AGENDA

Page 5: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

METHODS• Weighted Text Matrix Factorization (WTMF)

üSimply models the text-to-word information without leveraging the correlation between short text.

ü𝑋: term-document matrix (row: document vector, cell: TF-IDF value)→ Factorized into two matrix such that 𝑋 ≈ 𝑃$𝑄 (where 𝑃 ∈ ℝ(×*, Q ∈ ℝ(×,)

5

Source:Modeling Sentences in the Latent Space

Page 6: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

METHODS• Word Embedding

üGenerate word vectors using a neural networküLiner operation between words is possible

(ex: ”king”-”man”+”woman” = “queen”)üAdopted the implementation known as Word2vec tool

→ Output: low-dimensional vector representing words

6Source:Efficient estimation of word representations in vector space

Page 7: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

• Learning Post-Comment Pairs using Random Forest

7

tweet tweet

tweet pair set

・・・

Reduce the dimension of tweet vectors to 100

using SVD

Randomly sample 300,000

positive/negative examples

Random Forest

𝑎.,. ⋯ 𝑎.,1..⋮ ⋱ ⋮

𝑎4.....,. ⋯ 𝑎4.....,1..

Training

Decide if an input pair is likely post-comment

METHODS

post comment

Page 8: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

• Combination of Random Forest and TF-IDF

8

tweet

tweet set

・・・

Input post

Probability:0.6Probability:0.2Probability:0.89

Combine probability output by Random Forest and the value of TF-IDF

A) Random Forest → TF-IDFTweets have the probability more than 0.5 → TF-IDF

B) Random Forest + TF-IDFSum the probability and TF-IDF (cos similarity) value

METHODS

Page 9: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

① TF-IDF② WTMF③ Word2vec → TF-IDF④ Random Forest → TF-IDF⑤ Random Forest + TF-IDF

DESIGN OF CONVERSATION SYSTEM

Ranking 9

Page 10: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

① IntroductionØOverview of STC task and Goal of our study

② Methods & System Design• WTMF model• Word2vec• Random Forest and its learning method

③ Experiment④ Result & Discussion

10

AGENDA

Page 11: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

EXPERIMENT• Setting

üGoal: Evaluate a ranked list of potential replies to an input tweet

ü10 annotators assigned a score to each reply (tweet)→ score: +2, +1, and 0→ The larger score the tweet has, the better reply it is!

üEvaluation criteria: nDCG@1, nERR@5, and Accuracy → Each method has a value between 0.0 and 1.0 using these criteria→ The larger score is better

11

Name of method Technique

Oni-J-R1 ④Random Forest → TF-IDF

Oni-J-R2 ⑤Random Forest + TF-IDF

Oni-J-R3 ①TF-IDF

Oni-J-R4 ③Word2Vec → TF-IDF

Oni-J-R5 ②Weighted Text Matrix Factorization

Page 12: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

① IntroductionØOverview of STC task and Goal of our study

② Methods & System Design• WTMF model• Word2vec• Random Forest and its learning method

③ Experiment④ Result & Discussion

12

OVERVIEW OF THIS PRESENTATION

Page 13: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

13

RESULT

Page 14: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

14

DISCUSSION

Page 15: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

DISCUSSION

15

( Oni-J-R1: ʻRandom Forest → TF-IDFʼ, Oni-J-R3ʼs method: ʻTF-IDFʼ )

Page 16: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

DISCUSSION(2)

16( Oni-J-R1ʼs method is ʻRandom Forest → TF-IDFʼ )

Page 17: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

DISCUSSION(3)

17( Oni-J-R3ʼs method: ʻTF-IDFʼ, Oni-J-R4: ʻWord2vec → TF-IDFʼ)

Page 18: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

SUMMARY• In this study, we compare the conventional methods to handle short text.

• We use unsupervised methods to generate vectors

• We also use supervised methods to learn if a pair of tweets can be a post-comment pair.

• As the result of the formal run in STC, ①Random Forest → TF-IDFoutperformed other methods.

• The method using Word2vec shows interesting results in some context, however there is room for improvement in this method.

18

Page 19: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

REFERENCES

19

• Modeling Sentences in the Latent SpaceW. Guo, and M. DiabIn Proceeding of ACL (2012)

• Distributed Representations of Sentences and DocumentsT. Mikolov, Q. V. LeIn Proceedings of ICML(2014)

• Overview of the NTCIR-12 short text conversation taskL. Shang, T. Sakai, Z. Lu, H. Li, R. Higashinaka, and Y. MiyaoIn Proceedings of the NTCIR Workshop Meeting on Evaluation of Information Access Technologies(2016)

Page 20: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

APPENDIX (1)• The way of Word2Vec → TF-IDF

1. Extract nouns and verbs in an input post 2. Extract 20 most similar words for each noun and verb using word2vec3. Search tweets containing these words in the corpus 4. Generate the reply list from these tweets in the same manner with ①TF-IDF

20

Did you finish your homework ?

rank word1 “end”2 “complete”… …20 “close”

rank word1 “assignment”2 “work”… …20 “question”

1.

“finish”

“homework”

2.

Have you got the homework done?

Iʼve finishedtodayʼs work.

・・・3.

candidates

4. Generatethe reply listusing cosine similarity

Page 21: Analysis of Similarity Measures between Short Text …research.nii.ac.jp/ntcir/workshop/OnlineProceedings12/...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short

APPENDIX (2)• Method of Generating the reply list

üWe use the relationship of a pair of post-comment in Twitter.

üWhy is the similar tweet itself included in the reply list?→ Because the similar tweet can be useful as a reply in some context.

üIn this case, the direction of post-comment can be reversed. Thus each tweet can be a reply for an input post.

21

For example…

Post : It is pretty cool in Hokkaido today.Comment : Summer is the best season in Hokkaido.