csci 568a -...

14
CSCI 568A Discussion 01: Data Mining

Upload: others

Post on 19-Jul-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

CSCI 568ADiscussion 01: Data Mining

Page 3: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

DATAWHAT?

Page 4: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 5: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 6: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 7: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

What Data Mining Isn’t

• Crawling / harvesting / screen scraping

• Querying (fishing)

• Collecting

• Drinking

Page 8: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

Statistics

AI / ML

(big) Databases

data mining sandwich

Page 9: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

“The process of discovering [useful] patterns in large amounts of data.”

Fry, B. Visualizing Data. 2008.

Page 10: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 11: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large
Page 12: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

milk cereal diapers beer

1 1 0 0

1 1 0 0

1 1 1 1

0 0 1 1

What patterns do we see?

Page 13: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

6 Core Topics

• Data & Big Data

• Classification

• Association Analysis

• Clustering

• Anomaly Detection

• Data Visualization & Interaction

Page 14: CSCI 568A - mines.humanoriented.commines.humanoriented.com/classes/2011/fall/csci568/presentations/0… · Data Mining is the process of discovering interesting patterns in large

Homework

• Project 1

• Reading 1