accumulo summit 2015: fraud analytics using accumulo, julia and fast sql [leveraging accumulo]

9
Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 1 2014 the Year the Fraud and Cyber-Security Dam Broke Rewriting the Book on Fraud Analytics

Upload: accumulo-summit

Post on 15-Jul-2015

111 views

Category:

Technology


5 download

TRANSCRIPT

Page 1: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 1

2014 the Year the

Fraud and Cyber-Security

Dam Broke

Rewriting the Book on Fraud

Analytics

Page 2: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 2

Stack ShiftHadoop

Accumulo

Presto

Julia

Packet Ingestion

Page 3: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 3

Do What Couldn’t Be DonePetabyte Scale

100m Inserts/Second

Interactive SQL Queries

Real-Time Machine Learning

See Everything Non-Intrusively

Page 4: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 4

Mobile Fraud $46.3 Billion

Subscription $5.2B

Wangiri $2B

SMS Phishing $1.7B

PBX $4.4B

Roaming $6.1B

Premium Rate 4.7B

IRSF 1.8B

Arbitrage $2.2B

Page 5: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 5

Grab from the Network

Feature/Floop Generation

Combine Network and Business Data

Apply the Right ML Algorithms

Page 6: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 6

Prove Real-Time Fraud Analytics

at National Scale

World’s Largest Mobile Carrier

2% of Revenue to Fraud

4 Month Rollout

Numerous Multi-Million Attacks

Thwarted

Page 7: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 7

Julia

fresh start for scientific compute

speed of C meets / dynamism of Ruby

Page 8: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 8

Non-parametric Methods

No / Low Assumptions

but

Computationally Untenable

Page 9: Accumulo Summit 2015: Fraud Analytics using Accumulo, Julia and Fast SQL [Leveraging Accumulo]

Copyright © 2014 by Argyle Data Inc. All Rights Reserved. 9

Non-Parametrics in

distributed Julia!

good news

Accumulo doesn’t melt!