data sience coffe

Post on 22-Jan-2017

22 Views

Category:

Data & Analytics

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Chat Bot topic summary

TeamFICC Tech macro

Self introduction

• Noboru Kano– 2016 new grad

• Interesting topicsNLP(Natural language processing), Statistics, ML(Machine learning)

• Experience1year part time on a start up company as a ML engineer(Did 3 NLP projects)

• HobbyHandball, Chinese food, drink party(sometimes), 2ch

Agenda

• What is Bot ?• Chat bot history • type of chat bot

algorithm and demo• case study

What is Bot ?

• A computer program that simulates human conversation, or chat, through artificial intelligence.(From wiki)

Why Chat Bot ?

• This year, chat bot has attracted a great deal of public attention.

History of Chat bot

• ELIZA “doctor”(1966)An early example of primitive NLP chat bot

• A simulation of a psychotherapist. On your Emacs

• you can find free source about ELIZA in internet

Type of Chat bot

I bought a computer

I like the computer

Hello

Hello I’m kanono !

1. If-then-eles Type

• If the words in dictionaries, bot can respond to you.• Accuracy depends on the volume of the dictionary

example : ELIZA

Type of Chat bot2. Use dialogue dataStore dialogue data into DB and response a similar message.

• The response would be a human-like message.• Accuracy depends on the volume of the dialogue

Are you free now?

sorry I’m really busy

That too bad...

What happened?

Are:1, you:1, free:1, now:1

dialogue DB

ex : パン太一郎

Type of Chat bot3. Generate model• generate sentence with statistical method.• calculate the next words appearance ratio.

which dessert do you like the best ?

Yogurt is dessert

I really like frozen yogurt

I watched “Frozen” last night

• Can use variety of phrase and expression• Difficult to control context in message

Dialogue corpus

Generate Model

Algorithm for Generate model

1. Markov chainI am John.I am kanono.I do not like English

I

do

am

not

0.66

0.33

John

kanono

0.5

0.5

・・・

Strong in make a short sentence.

not good at generating long story.with large scale corpus data

Demo

What I made?

Input: ImageOut put: generate a sentence related to the image

Hi ! I’m going to take a flight to NY training, see you soon!

技術解説(画像分類部分)

アルゴリズム1 位 aircraft( 飛行機 )2 位 plane ...

上位 10 クラスのスコアを出力→ 日本語に翻訳

ラベル

技術解説(ツイート生成部分)1 位 aircraft (飛行機)

ラベル

ラベルに該当するツイートをたくさん取得♡

・・・

テキストを自動生成(自作)今日は飛行機にのるよー!名古屋みんな待っててね!

ヒミツの自作アルゴリズム♥

Algorithm for Generate model

2. RNN(Reccurent Neural Network) ex.) Allo, りんな

RNN for semantic analysis RNN for generate response

Case study

• Check my Qiita page• http://qiita.com/kanottyan/items/

2783bf91c8ea6a8a4ce8

top related