elie bursztein steven bethard stanford university · elie bursztein steven bethard stanford...

48
Stanford Computer Security Lab Breaking E-bay audio captcha Elie Bursztein Steven Bethard Stanford University

Upload: others

Post on 22-Mar-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Stan

ford

Com

pute

r Se

curi

ty L

ab

Breaking E-bay audio captchaElie Bursztein Steven Bethard

Stanford University

Page 2: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Breaking E-bay audio captchaElie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/

Outline

• Breaking an audio captcha

• E-bay audio captcha

• What is next ?

Page 3: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Bad or good user ?

Breaking E-bay audio captcha

Page 4: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Bad or good user ?

Breaking E-bay audio captcha

?

Page 5: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Bad or good user ?

Breaking E-bay audio captcha

?users

Page 6: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Bad or good user ?

Breaking E-bay audio captcha

?usersbots

Page 7: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

CAPTCHA

Completely Automated Public Turing test to tell Computers and Humans Apart

Term coined in 2000 [VBHL]

Page 8: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Breaking the registration process

Scraping

Page 9: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Breaking the registration process

Scraping

Solving

Page 10: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Breaking the registration process

Scraping

Solving

Registering

Page 11: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Breaking captcha

Pre-processing

Page 12: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Breaking captcha

Pre-processing

Segmentation

Page 13: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Breaking captcha

Pre-processing

Segmentation

Classification

Page 14: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

captcha breaker efficiency metric

• Coverage

• Precision

• Accuracy

Accuracy = Coverage x Precision

Catpcha goal = 0.01% accuracy [CLC05]

Page 15: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Stan

ford

Com

pute

r Se

curi

ty L

ab

Breaking E-Bay captcha

Page 16: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

The registration page

Page 17: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Registration Security

• Allow to download/try multiple time the same captcha

• Limit poorly the number of captcha download

Page 18: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Fetching performance

Page 19: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Fetching performance

0

2

4

6

8

10

12

14

16

18

5

35

65

95

12

5

15

5

18

5

21

5

24

5

27

5

30

5

33

5

36

5

39

5

42

5

45

5

48

5

51

5

54

5

57

5

60

5

63

5

Set 4

Set 5

Set 6

Set 7

Set 8

Page 20: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

Page 21: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

3

Page 22: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

3 1

Page 23: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

3 1 1

Page 24: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

3 1 1 7

Page 25: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

3 1 1 7 4

Page 26: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

E-Bay captcha

3 1 1 7 4 2

Page 27: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Captcha implementation

• Fixed number of digit

• Multiple downloads with different voices

Page 28: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Building a corpus

• We downloaded 26 000 captchas overall

• 8x 2000 samples (training set)

• 50 x 200 samples (evaluation set)

Page 29: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Building a breaker

• Two options

• Use state of art speech recognizer

• Build a specialized classifier

Page 30: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Sphinx 4

• State of art speech recognizer

• Available models

• HUB4 : 64 000 words model

• Isolated digits

• Connected digits

Page 31: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Breaking E-bay audio captchaElie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/

Attempt 1 result

Model Acc Prec Cov

TIDGITS 0.3% 9.6% 3.6%

HUB4 1% 28.9% 3.6%

Page 32: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Attempt 2 : build our own classifier

Page 33: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Attempt 2 : build our own classifier

Buggy / U

gly C co

de !

Page 34: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

How Decaptcha works

Page 35: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

How Decaptcha works

Page 36: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

How Decaptcha works

Page 37: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

How Decaptcha works

0:26645 67191 119061 89244 54427 65733 123753 34290 228898 99908 201288 241704 187527 187323 355112 131920 67751 152565 129169 143524 76310 76563 202572 143785 225241 711726 817683 1044637 146120 479933 350394 942188 209828 270356 279296 120812 268639 140728 98185 223023 269431 277125 236198 246403 132344 289054 102437 105509 212886 295669 197393 318710 431137 785228 843192 3190283 24248 399464 830038 880644 111876 760679 469023 78097 297356 146268 298207 257063 556285 606488 352779 159524 229851 164489 102580 201637 81613 112075 249878 226535 477520 866948 902598 4626601 148464 208997 592865 1087574 74635 509526 382653 238383 202327 243430 268997 606916 301822 358115 208761 111799 119561 142713 172740 202756 76463 66374 146260 274962 246128 982006 1654120 5740142 45956 258133 1005985 1098416 57227 828851 1339151 314516 513288 179292 550734 519032 799286 231594 498671 342832 202360 197459 185155 315956 209138 234810 160295 300425 329821 1353146 2571746 4564688 81712 161237 417391 506261 185897 119133 338142 140221 337865 401714 465674 350169 270428 365631 253015 139859 124894 180852 114128 147683 100545 138302 202106 381131 388385 747495 1390303 3786484

Page 38: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Breaking E-bay audio captchaElie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/

Result

Model Acc Prec Cov

3x100 53.4% 65.7% 81.3%

3x500 59.6% 73.3% 81.3%

6x100 65.6% 80.7% 81.3%

8x1000 75.1% 92.3% 81.3%

Page 39: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

More captcha or more samples ?

Accuracy Precision

10x50 44% 55%

25x20 76% 95%

50x10 72% 90%

Page 40: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Stan

ford

Com

pute

r Se

curi

ty L

ab

What’s next

Page 41: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Building a generic attack framework

• What is difficult to computer ?

• What is easy for human ?

• Difference between visual and audio captcha ?

Page 42: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Visual captcha Hardness

• What is hard for a computer ?

• Decomposition is hard

• Recognition is easy

! "

#$%&'!()*+!,-../,,!0#*/,!*+#*!#!&/-0#$!&/*(%01!.#&!#.+)/2/!*%!0/.%'&),/!*+/34!5*!),!/,*#6$),+/7!

)&! 89:! *+#*! );! *+/! <%,)*)%&,! %;! .+#0#.*/0,! #0/! 1&%(&! )&! .+#$$/&'/! )3#'/,! '/&/0#*/7! 6=! #!

>?@A>B?C!*+/&!60/#1)&'!*+),!,.+/3/!),! D-,*!#!<-0/!0/.%'&)*)%&!<0%6$/3C!(+).+!),!#! *0)2)#$!

*#,1!()*+!,*#&7#07!3#.+)&/!$/#0&)&'!*/.+&)E-/,!,-.+!#,!&/-0#$!&/*(%01,!8F":4!!

!

Characters under typical distortions Recognition rate

!GFHHI!

!J9KI!

!FHHI!

!JLI!

!GFHHI!

!JMKI!

!Table 1. Recognition rate for individual characters under different distortions N#$$!7#*#!)&!*+),!

*#6$/!#0/!*#1/&!;0%3!89:O

!

B%(/2/0C!(+/&! *+/! $%.#*)%&! %;! .+#0#.*/0,! )&! #!>?@A>B?!.+#$$/&'/! ),! &%*! 1&%(&! #P<0)%0)!

N/4'4! )&! *+/! ;%$$%()&'! )3#'/,! *#1/&! ;0%3! 8Q:OC! !,*#*/!%;! *+/!#0*! N)&.$-7)&'!3#.+)&/! $/#0&)&'O!

3/*+%7,!7%!&%*!(%01!(/$$!)&!$%.#*)&'!*+/!.+#0#.*/0,C!$/*!#$%&/!0/.%'&),)&'!*+/34!!

!

!!! !!

A+/!<0%6$/3!%;! )7/&*);=)&'!.+#0#.*/0! $%.#*)%&,! )&! *+/! 0)'+*!%07/0C!%0!segmentationC! ),! ,*)$$! #!

.+#$$/&')&'! <0%6$/3! )&! *+/! ;)/$7,! ,-.+! #,! +#&7(0)*)&'! 0/.%'&)*)%&! #&7! .%3<-*/0! 2),)%&4! 5&!

'/&/0#$C! ,/'3/&*#*)%&! ),! .%3<-*#*)%&#$$=! /R</&,)2/C! #&7! %;*/&! #! .%36)&#*%0)#$$=! +#07!

<0%6$/3!8Q:4!!

!

A+/!,*#*/!%;!*+/!#0*!%;!>?@A>B?!7/,)'&!,-''/,*,!*+#*!*+/!0%6-,*&/,,!%;!*/R*P6#,/7!,.+/3/,!

,+%-$7! 0/$=! %&! *+/! 7);;).-$*=! %;! ;)&7)&'! (+/0/! *+/! .+#0#.*/0! ),! N,/'3/&*#*)%&OC! 0#*+/0! *+#&!

(+).+! .+#0#.*/0! )*! ),! N0/.%'&)*)%&O! 8FFC! SC! QC! MC! 9:4! A+#*! ),C! ,-.+! >?@A>B?,! ,+%-$7! 6/!

segmentation-resistant4! 5&! %*+/0! (%07,C! if breaking a (text-based) CAPTCHA can be

successfully reduced to a problem of individual character recognition, then this scheme is

effectively broken.!!

!

5&!*+),!<#</0C!(/!/R#3)&/!*+/!,/.-0)*=!%;!#!T).0%,%;*!>?@A>B?C!#!,.+/3/!*+#*!),!7/,)'&/7!

*%! 6/! ,/'3/&*#*)%&! 0/,),*#&*4! A+),! ,.+/3/! (#,! 7/,)'&/7! 6=! #&! )&*/07),.)<$)&#0=! */#3! %;!

7)2/0,/! /R</0*),/! )&!T).0%,%;*! )&.$-7)&'! 7%.-3/&*! <0%./,,)&'! #&7! -&7/0,*#&7)&'C! 3#.+)&/!

$/#0&)&'C!B>5!#&7!,/.-0)*=4! 5&!;#.*C! *+/!()7/$=!#../<*/7!Usegmentation resistanceV <0)&.)<$/

(#,!/,*#6$),+/7!6=!*+),!*/#34!A+),!T).0%,%;*!>?@A>B?!+#,!6//&!7/<$%=/7!)&!3#&=!%;!*+/)0!

%&$)&/! ,/02)./,! )&.$-7)&'!B%*3#)$C!TWX!#&7!Y)&7%(,!Z)2/! ;%0! =/#0,4! 5*,! ;)0,*! 2/0,)%&!(#,!

7/<$%=/7! )&!B%*3#)$[,!-,/0! 0/'),*0#*)%&!,=,*/3!)&!"HH"!8FF:C!#&7!/2/0!,)&./! *+/!,.+/3/!+#,!

-&7/0'%&/!/R*/&,)2/! )3<0%2/3/&*! )&! */03,!%;!6%*+!0%6-,*&/,,! 8SC!QC!9:!#&7!-,#6)$)*=! 8QC!M:4!

T).0%,%;*!+#,!#$,%!;)$/7!*+0//!\W!<#*/&*!#<<$).#*)%&,!*%!<0%*/.*!*+/!-&7/0$=)&'!*/.+&%$%'=!8L:4!

>$/#0$=C!*+),!,.+/3/!),!.#0/;-$$=!#&7!(/$$!7/,)'&/74!!

!

Page 43: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Decomposition hardness

• Separation should be harder for some type

• Separated digits

• Letters

• Numbers

• Words

• Continuous speech

Page 44: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Recognition hardness

• Introduce noise

• Introduce distortion

• Larger corpus

Page 45: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

What about user ?

Rec

ogni

tion

Difficulty

AI ?

Human

Page 46: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Visual vs Audio

• It is all about data pool :

• Infinite for visual captcha

• finite for audio captcha

Page 47: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Conclusion

• Things we are currently doing

• Building a generic attack framework

• Defining a security metric

• Studying usability

• Breaking more captcha scheme :)

• Things we hope to do

• Find a new way to do audio captcha

• Going after visual captcha

Page 48: Elie Bursztein Steven Bethard Stanford University · Elie Bursztein Steven Bethard Stanford University. ... Building a generic attack framework ... (%07,C! if breaking a (text-based)

Elie Bursztein, Steven Bethard http://dontrythisathome.com/decaptcha/Breaking E-bay audio captcha

Thanks / questions

• Thanks to

• Jerry Louis and the E-bay security team

• Celine Fabry

• Questions ?

If you want to try decaptcha

http://www.dontrythisathome.com/decaptcha/