![Page 1: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/1.jpg)
Mining and Querying Multimedia DataFan GuoSep 19, 2011Committee Members:Christos Faloutsos, ChairEric P. XingWilliam W. CohenAmbuj K. Singh, University of California at Santa Babara
![Page 2: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/2.jpg)
What this talk is about?
2
![Page 3: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/3.jpg)
What this talk is about?
3
![Page 4: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/4.jpg)
Going Multimedia
4
![Page 5: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/5.jpg)
Beyond Text and Images
5
![Page 6: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/6.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
6
![Page 7: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/7.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
7
![Page 8: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/8.jpg)
Mining Multimedia Data (1)
•Labeling Satellite Imagery
8
Input Output
![Page 9: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/9.jpg)
Mining Multimedia Data (2)
•Network Traffic Log Analysis
9
![Page 10: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/10.jpg)
Mining Multimedia Data (3)
•Web Knowledge Base
10
![Page 11: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/11.jpg)
Mining Multimedia Data
•Data-driven problem solving over multiple modes at a non-trivial scale.
11
![Page 12: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/12.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
12
![Page 13: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/13.jpg)
Querying Multimedia Data (1)
•A querying system provides an interface to retrieve records that best match users’ information need.
13
![Page 14: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/14.jpg)
Querying Multimedia Data (1)
•Here is another example:
14
https://www.facebook.com/pages/browser.php
![Page 15: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/15.jpg)
Querying Multimedia Data (1)
•May be transformed into a graph search problem
15
![Page 16: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/16.jpg)
Querying Multimedia Data (2)
•Calibrate ranking from user feedback
16
![Page 17: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/17.jpg)
Querying Multimedia Data (2)
•Calibrate ranking from user feedback
17
![Page 18: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/18.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
18
![Page 19: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/19.jpg)
Data
•Large-Scale Heterogeneous Networks
19
Port
198.129.1.2131.243.2.10
131.243.2.5
128.3.10.40 128.3.1.50
IP-source IP-destination
80 (HTTP)
80 (HTTP)
993 (IMAP)
![Page 20: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/20.jpg)
Goal
•How can we automatically detect and visualize patterns within a local community of nodes?
20
![Page 21: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/21.jpg)
Preliminary
•Tensor for high-order data representation▫3 data modes: source IP, destination IP,
port #
21
![Page 22: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/22.jpg)
Approach
22
![Page 23: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/23.jpg)
Data Decomposition
•The canonical polyadic (CP) decomposition can factor tensor into a sum of rank-1 tensors
23
![Page 24: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/24.jpg)
Data Decomposition
•A special case is Singular Value Decomposition
24
![Page 25: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/25.jpg)
Attribute Plot
25
How to compute?
![Page 26: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/26.jpg)
Spike Detection
•Iteratively search for spikes in the histogram plot along each data mode.
26
“ “” ”
![Page 27: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/27.jpg)
Substructure Discovery
•Focus on part of the data within the spike
•Categorize into a few subgraph patterns
27
![Page 28: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/28.jpg)
Pattern 1: Generalized Star (1)
28
IP-src’s sending
packets to the same IP-
dst & the same portTypical
client/server system
![Page 29: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/29.jpg)
Pattern 1: Generalized Star (1)
29
A ‘bar’ in a carefully
reordered tensor
![Page 30: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/30.jpg)
Pattern 1: Generalized Star (2)
30
Extending along “Port-Number”
![Page 31: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/31.jpg)
Pattern 1: Generalized Star (2)
31
Port scanning or P2P
Port numbers used in
packets from the same IP-
src to the same IP-dst
![Page 32: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/32.jpg)
Pattern 2: Generalized Bipartite-Core (1)
32
A ‘plane’ in a carefully
reordered tensor
![Page 33: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/33.jpg)
Pattern 2: Generalized Bipartite-Core (1)
33
IP-src’s sending
packets to the same IP-dst’s & the same portClients
talking to a shared server
pool
![Page 34: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/34.jpg)
Pattern 2: Generalized Bipartite-Core (2)
34
A ‘plane’ in a carefully
reordered tensor
![Page 35: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/35.jpg)
Pattern 2: Generalized Bipartite-Core (2)
35
IP-src’s sending
packets over multiple
ports to one IP-dstA multi-
purpose windows server
![Page 36: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/36.jpg)
M1: MultiAspectForensics
•Automatically detects novel patterns in heterogenous networks
36
![Page 37: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/37.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
37
![Page 38: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/38.jpg)
QMAS: Mining Satellite Imagery (1)•Low-labor labeling
38
Input Output
![Page 39: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/39.jpg)
QMAS: Mining Satellite Imagery (2)•Low-labor labeling•Identification of Representatives
39
![Page 40: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/40.jpg)
QMAS: Mining Satellite Imagery (2)•Low-labor labeling•Identification of Representatives and
Outliers
40
![Page 41: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/41.jpg)
QMAS: Mining Satellite Imagery (2)•Low-labor labeling•Identification of Representatives and
Outliers
41
![Page 42: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/42.jpg)
QMAS: Mining Satellite Imagery (3)•Low-labor labeling•Identification of Representatives and
Outliers•Linear in time & space
42
![Page 43: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/43.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
43
![Page 44: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/44.jpg)
Web Search
44
![Page 45: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/45.jpg)
User Clicks as Quality Feedback
45
# of total clicks
![Page 46: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/46.jpg)
Motivation
•Leverage the signal from click data to improve search ranking.
46
![Page 47: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/47.jpg)
Click Through Rate (CTR)
•CTR = # of Clicks / # of Impressions
47
![Page 48: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/48.jpg)
Position Bias
48
![Page 49: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/49.jpg)
Relevance of Web Document
•Relevance = CTR @ Position 1
49
# Clicks @ Position 1# Impressions @ Position 1
=
![Page 50: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/50.jpg)
Problem Definition
•Estimate the relevance of web documents given clicks and their positions.
50
![Page 51: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/51.jpg)
Design Goals / Constraints
•Scalable: single-pass, easy to parallel.
•Incremental: real-time updates possible.
•Accurate: consistent with past and future observations.
51
![Page 52: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/52.jpg)
Approach
52
![Page 53: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/53.jpg)
User Behavior Model
53
![Page 54: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/54.jpg)
Last Clicked Position
54
![Page 55: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/55.jpg)
Empirical Results
•Click data after pre-processing▫110K distinct queries, 8.8M query sessions.
•Training time: <6 mins
•Online update:▫Bump impression and click counters▫No data retention required
55
![Page 56: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/56.jpg)
Empirical Results
•Higher log-likelihood indicates better quality.
56
27% accuracy in prediction2% improvement over ICM, the baseline model
![Page 57: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/57.jpg)
Empirical Results
•Position-bias visualized
57
Ground Truth
DCM
![Page 58: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/58.jpg)
Scaling to Terabytes
•265TB data, 1.15B document relevance results,running time on wall clock ~ 3 hours
58
![Page 59: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/59.jpg)
Q1: Click Models
•A statistical approach to leveraging click data for better ranking aware of position-bias.
•They are incremental, more accurate than the baseline, scaling to almost petabyte-scale data.
59
![Page 60: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/60.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
60
![Page 61: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/61.jpg)
Q2: C-DEM
•A flexible query interface for 3-mode data: images, genes, annotation terms.
61
![Page 62: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/62.jpg)
Q2: C-DEM
62
Images
Terms Genes
![Page 63: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/63.jpg)
Q2: C-DEM
•Solution: random walk with restart on graphs.
63
![Page 64: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/64.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
64
![Page 65: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/65.jpg)
Q3: BEFH (1)• Bayesian exponential family harmonium• Deriving topical representations for
multimedia corpora (e.g., video snapshots and captions)
65
Input Model
![Page 66: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/66.jpg)
Q3: BEFH (2)• Bayesian exponential family harmonium• Deriving topical representations for
multimedia corpora (e.g., video snapshots and captions)
66
Validation – Synthetic Data
Validation – TRECVID Data
Better Quality
Better Quality
![Page 67: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/67.jpg)
Thesis Outline
MiningM1: MultiAspectForensics
M2: QMAS
Querying
Q1: Click Models
Q2: C-DEM
Q3: BEFH
67
![Page 68: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/68.jpg)
Conclusion
•Data-driven research under the theme of pattern mining and similarity querying.
68
![Page 69: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/69.jpg)
Conclusion
•Data-driven research under the theme of pattern mining and similarity querying.
•An array of practical tasks addressed:▫Internet traffic surveillance (M1)
69
![Page 70: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/70.jpg)
Conclusion
•Data-driven research under the theme of pattern mining and similarity querying.
•An array of practical tasks addressed:▫Internet traffic surveillance (M1)▫Satellite image analysis (M2)
70
![Page 71: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/71.jpg)
Conclusion
•Data-driven research under the theme of pattern mining and similarity querying.
•An array of practical tasks addressed:▫Internet traffic surveillance (M1)▫Satellite image analysis (M2)▫Web search (Q1)
71
![Page 72: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/72.jpg)
Conclusion
•Data-driven research under the theme of pattern mining and similarity querying.
•An array of practical tasks addressed:▫Internet traffic surveillance (M1)▫Satellite image analysis (M2)▫Web search (Q1)▫…
72
![Page 73: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/73.jpg)
Thank You!
•http://www.cs.cmu.edu/~fanguo/dissertation/
73
![Page 74: Mining and Querying Multimedia Data Fan Guo Sep 19, 2011 Committee Members: Christos Faloutsos, Chair Eric P. Xing William W. Cohen Ambuj K. Singh, University](https://reader035.vdocuments.mx/reader035/viewer/2022062520/56649ef65503460f94c09b88/html5/thumbnails/74.jpg)
74