“i'm sorry, dave, i'm afraid i can't do that”: can computers really

37
“I’m sorry, Dave, I’m afraid I can’t do that”: Can computers really understand what we say? Cornell summer evening lecture , July 2012 Lillian Lee Professor, Computer Science http://www.cs.cornell.edu/home/llee

Upload: hadan

Post on 13-Feb-2017

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

“I’m sorry, Dave, ���I’m afraid I can’t do that”:���

Can computers really understand what we say?

Cornell summer evening lecture, July 2012

Lillian Lee Professor, Computer Science

http://www.cs.cornell.edu/home/llee

Page 2: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

the dream

Page 3: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Why is this man smiling?

http

://w

ww

.nat

ure.

com

/nat

ure/

jour

nal/v

482/

n738

6/fu

ll/48

2440

a.ht

ml

Page 4: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

The Turing test:��� Intelligence è human-level language use

http

://bi

tter

swee

tsag

e.bl

ogsp

ot.c

om/2

010/

01/c

omic

-con

vers

e-tu

ring

-tes

t.htm

l

Turing predicted we’d be close in about 50 years.

]http://ghostradio.files.wordpress.com

/2011/03/blade_runner_fondo.jpg

Page 5: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

http

://w

ww

.blo

gcdn

.com

/ww

w.tu

aw.c

om/m

edia

/201

0/04

/jarv

ism

ac.jp

g

http

://w

ww

.nav

tone

s.co

m/m

edia

/imag

e/ca

ched

_kni

ght_

ride

r_ki

tt.jp

g

http

://up

load

.wik

imed

ia.o

rg/w

ikip

edia

/en/

0/09

/Dat

aTN

G.jp

g

Do authors dream of electric speech?

“Jarvis”, the A.I. system in Iron Man

Page 6: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Why is this man not smiling?

http

://w

ww

.net

braw

l.com

/mat

chup

.php

?mid

=11

131&

brac

ketid

=49

7

http

://4.

bp.b

logs

pot.c

om/_

Qm

9Cek

v5Jj4

/S8I

q3em

ehgI

/AA

AA

AA

AA

ARU

/oBZ

6Ih5

J4fI/

s200

/200

1-a-

spac

e-od

ysse

y.jpg

Open the pod bay doors, Hal.

I’m sorry, Dave, I’m afraid I can’t do that.

Page 7: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

from sci-fi to science and engineering

Page 8: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Goal: create systems that use human language as input/output

•  speech-based interfaces

•  information retrieval / question answering

•  automatic summarization of news, emails, postings, etc.

•  automatic translation

… and much more!

Interdisciplinary: computer science; linguistics, psychology, communication; probability & statistics, information theory…

Natural-language processing (NLP)

Page 9: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Recently deployed (in beta): Siri

http://www.apple.com/iphone/features/siri.html

Page 10: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

State of the art: Watson C

redi

t: A

P Ph

oto/

Jeop

ardy

Pro

duct

ions

Inc.

The Watson system beat human Jeopardy! champions (and didn’t have internet access; it learned by “reading” before the match)

Page 11: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Why is this man smiling?

Charles (Bud) Jermy, Jr., host of this lecture series

Page 12: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

But we’re not all the way there yet

Page 13: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Real-life error (1)

Hey bunch of grapes

isto

ck |

blan

kabo

skov

A bunch of grapes. ht

tp://

rand

omha

ndpr

ints

.blo

gspo

t.com

/201

1_01

_01_

arch

ive.

htm

l

Page 14: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Real-life error (2)

We can email you when you're fat.

We can email you when we're back.

http

://ca

tand

girl

.com

/?p=

2678

isto

ck |

blan

kabo

skov

Page 15: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Real-life error (3)

[This U.S. city’s] largest airport …

What is Toronto???

http

://je

opar

dy.e

dogo

.com

/wp-

cont

ent/

uplo

ads/

2009

/01/

prog

ram

-jeop

ardy

1.jp

g

Page 16: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

why is understanding language so hard?

Page 17: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

List all flights on Tuesday

Challenge: ambiguity

List all flights on Tuesday = List all the flights leaving on Tuesday.

List all flights on Tuesday = Wait ‘til Tuesday, then list all flights.

Page 18: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Retrieve all the local patient files

More realistic example

Page 19: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Baroque example

I saw her duck with a telescope.

[http://www.supercoloring.com/pages/duck-outline/] [http://casablancapa.blogspot.com/2010/05/fore.htm]l

Page 20: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Baroque example

I saw her duck with a telescope.

[http://www.supercoloring.com/pages/duck-outline/]

http

://w

ww

.clip

artm

ojo.

com

/plu

gins

/Clip

art/

Clip

artS

tock

1/st

ar%

20ga

zing

.png

http

://w

ww

.geo

citie

s.w

s/lo

oney

ebay

/del

l/bb0

40.jp

g

http://pokerfoldingtable.com/wp-content/uploads/2009/02/three-men-gambling-sitting-at-poker-table-playing-cards-betting-party-pen-ink-drawing-300x234.png

Page 21: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Conversation complications

[Grishman 1986]

Q: Do you know when the train to Boston leaves?

A: Yes.

Q: I want to know when the train to Boston leaves.

A: I understand.

Images: http://3.bp.blogspot.com/_o4kq5TNL0Z4/TUx0j6E5BLI/AAAAAAAAA5k/J7xjhvrcNlU/s1600/Trillian-hitchhikers-guide-to-the-galaxy-the-2005.jpg, http://www.tvacres.com/images/robots_androids_marvin_movie.jpg

Page 22: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

[http

://br

owse

.dev

iant

art.c

om/?

qh=

&se

ctio

n=&

glob

al=

1&q=

mus

cled

uck#

/d14

nst5

] I’m sorry, Dave, I’m afraid I can’t do that.

I’m afraid you might be right.

[htt

p://s

elco

uth.

com

/201

1/03

]

Page 23: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Meeting these challenges: a brief history

Page 24: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

1940s – 50s: ���From language to probability

“The fundamental problem of communication is that of reproducing at one point either exactly or approximately a message selected at another point ...

[The] semantic aspects of communication are irrelevant to the engineering problem.

The significant aspect is that the actual message is one selected from a set of possible messages.”

--C. Shannon, 1948

Page 25: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Language, statistics, cryptography

WWII: Turing helps break the German “Enigma” code

(An original Enigma machine for encrypting messages is on display now in the Kroch Library in Olin.)

Page 26: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Why is this man smiling? ht

tp://

arto

frev

olut

ion.

co.u

k/ne

w/in

dex.

php?

mai

n_pa

ge=

prod

uct_

info

&cP

ath=

1_3&

prod

ucts

_id=

239&

zeni

d=vi

d94s

pfpa

9vtr

18sb

tgug

1h64

I can see Alaska from my house!

Encryption process

[W. Weaver memo on translation, 1949]

Page 27: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Two probabilities to infer

I can see Alaska from my house!

Encryption process

[Russian]

Prob. of generating this original message?

Prob. of doing this encryption of the original?

Page 28: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Another use of message probs: speech recognition

(1) It’s hard to recognize speech

(2) It’s hard to wreck a nice beach

Both messages have almost the same acoustics, but different likelihoods.

Page 29: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

1950s-1980s: Breaking with statistics

(a) Colorless green ideas sleep furiously

(b) Furiously sleep ideas green colorless

N. Chomsky (1957):

The argument: Neither sentence has ever occurred in the history of English. So any statistical model would given them the same probability (zero).

The field moved to sophisticated non-probabilistic models of language.

Page 30: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

1990s: The empiricists strike back

•  Huge amounts of data start coming online

•  Advances in algorithms and computational power

“Every time I fire a linguist, my [system’s] performance goes up” -- F. Jelinek (apocryphal)

Page 31: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

2000s and beyond: ���integrating language insights and

statistical techniques

[All 8 results were from March 2011 or earlier]

Is Snooki on stork watch?

(wondered in March 2012)

Page 32: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Integrating lang and stats (cont)

Snooki and fiancé Jionni LaValle are expecting their first child together

Angie Harmon on Stork Watch By Marcus Errico

Angie Harmon's going from assistant district attorneying to diaper duty. The former Law & Order legal dish is expecting her first child with football stud hubby Jason Sehorn, her publicist confirmed Tuesday.

Bowie & Iman On Stork Watch BY GEORGE RUSH DAILY NEWS COLUMNIST Monday, February 14, 2000 Rock legend David Bowie and supermodel Iman said yesterday they're expecting their first child

Is Snooki on stork watch?

Snooki?!!

Page 33: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

the game-changers:

•  data-driven approaches

• models of language

Page 34: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

computers can also (help us) understand us

Page 35: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Why is this man smiling?

 C. Danescu-Niculescu-Mizil et al. ACL 2012

Beyond situational effects, phrasing also affects memorability:

•  memorable movie quotes (in aggregate) are unusual word choices built on a scaffolding of common part-of-speech patterns ‣ shown via language models

•  carries over to ad slogans

http

://w

ww

.sch

wim

mer

lega

l.com

/200

6/11

/evi

denc

e-of

-sec

onda

ry-m

eani

ng-in

-tv-

catc

hphr

ases

.htm

l

Page 36: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Social interaction: who has the lead? Communicative behaviors are “patterned and coordinated, like a dance” [Niederhoffer and Pennebaker, ‘02]

http

://m

inim

alm

ovie

post

ers.

tum

blr.c

om/p

ost/

1608

2323

317/

pulp

-fict

ion-

by-a

na-b

alde

rram

as

adah ja ad to the adajkj the

adah ja ad at a adajkj the

adah ja ad of adajkj the

adah ja ad of adajkj the

adah to ja ad an adajkj gh

adah ja ad the adajkj forhgh

Those with less power tend to immediately match the function-word choices of those with more power. [C. Danescu-Niculescu-Mizil et al. WWW 2012]

Page 37: “I'm sorry, Dave, I'm afraid I can't do that”: Can computers really

Why is this man smiling?

We  may  hope  that  machines  will  eventually  compete  with  men  in  all  purely  intellectual  fields.  But  which  are  the  best  ones  to  start  with?  Even  this  is  a  difficult  decision....  I  do  not  know  what  the  right  answer  is,  but  I  think  [different]  approaches  should  be  tried.  

We  can  only  see  a  short  distance  ahead,    but  we  can  see  plenty  there  that  needs  to  be  done.