Download - Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search
![Page 1: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/1.jpg)
Kevin C. Chang
![Page 2: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/2.jpg)
About the collaboration -- Cazoodle
2
Coming next week: Vacation Rental Search
![Page 3: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/3.jpg)
3
How do you greet people in your culture?
What have you been searching
lately?
![Page 4: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/4.jpg)
4
What have you been searching lately? The university and areas of Kevin Chang? The email of Marc Snir? Customer service phone number of Amazon? What profs are doing databases at UIUC? The papers and presentations of SIGMOD 2007? Due date of SIGMOD 2008? Sale price of “Canon PowerShot A400”? “Hamlet” books available at bookstores?
![Page 5: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/5.jpg)
5
The Web is a Big Library.Huge Supermarket!
![Page 6: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/6.jpg)
6
Queries can be any things, too!
Search Engine
![Page 7: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/7.jpg)
7
Are there certain “regularities” to
exploit?
![Page 8: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/8.jpg)
Let’s try out…
8
![Page 9: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/9.jpg)
Survey 1: How likely does a query follow a pattern?
9
9 out of 10 samples share a pattern with others!
![Page 10: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/10.jpg)
Survey 2: How likely do queries in a domain follow patterns w.r.t. pre-specified attributes?
10
Over 28,000 manually labeled queries:
Some domains have as high as 90+% patterned queries.
![Page 11: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/11.jpg)
Survey 3: How many patterns are there?
11
Hundreds of patterns needed to cover 80% queries.
![Page 12: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/12.jpg)
Simple concept: What is Query Template?
(this paper) Sequence of keywords and attributes #celebrity affairs #category jobs in #location #movie showtimes in #zipcode …
(In general) Patterns that can be induced from queries e.g., regular expressions.
12
![Page 13: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/13.jpg)
13
How would such templates be
useful?
![Page 14: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/14.jpg)
We advocate Rich Query Interpretation. t = “#category jobs in #location” for Job
q = “accounting jobs in chicago”
By matching query q to template t:
1) Intent Classifier: recognize intended domain. q Job
2) Query Parser: recognize associated attributes. #category = “accounting”, #location = “chicago”
14
![Page 15: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/15.jpg)
Rich query interpretation is useful.
Tailored responses by query patterns:
Finding results directly No longer 10 blue links.
Ranking results Relevant to attributes desired.
Dispatching verticals Bring verticals into search.
Matching ads More likely to click.
15
![Page 16: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/16.jpg)
Query: Finding flights
16
![Page 17: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/17.jpg)
Query: Finding movie showtimes
17
![Page 18: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/18.jpg)
Query: Finding weather
18
![Page 19: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/19.jpg)
But much more patterns can be leveraged!
19
![Page 20: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/20.jpg)
20
Now, how to systematically discover such
templates?
![Page 21: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/21.jpg)
Problem: Query Template Discovery Given:
Query log L e.g., we use MSN query log 2006.
Domain schema D e.g., (#category, #location, #title) with vocabulary. Incomplete schema can be handled, too.
Seed knowledge (queries, sites, templates, or mix) E.g., 5 queries; or 2 sites; or 2 templates.
Output: “Good” templates T* = {t1, t2, …}
t1 = #location jobs t2 = #location #category positions ……..
21
![Page 22: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/22.jpg)
22
Step 1:Define quality metrics.
![Page 23: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/23.jpg)
How to measure quality of templates?
Some templates are more “popular.” “#city1 #city2”, “#make #model”
Some templates are more “accurate.” “#city1 #city2 flights”, “#location #make used cars”
23
Precision:
Recall:
![Page 24: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/24.jpg)
24
Step 2:From seeds, infer
templates with good quality.
![Page 25: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/25.jpg)
1) Can P and R be “inferred”? (or, estimated.)
Probabilistic Recall:
Probabilistic Precision:
25
![Page 26: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/26.jpg)
Sites Ss1: monster.coms2: motorola.coms3: us401k.com
Queries Qq1: jobs in chicagoq2: jobs in bostonq3: jobs in microsoftq4: jobs in motorolaq5: marketing jobs in motorolaq6: 401k plansq7: illinois employment statistics
Templates Tt1: jobs in #locationt2: jobs in #companyt3: #category jobs in #company t4: #location employment statistics
t1
t2
t3
t4
q1
q2
q3
q4
q5
q6
q7
s1
s2
s3
1025
124
4
24
1
1
11
1
1
1
2) What relationships can we use to infer? Log QST “Quest” Graph
![Page 27: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/27.jpg)
3) How to infer on this graph?
Duality of Random Walk:
When we walk back and forth, we are inferring precision and recall, respectively.
R(t) is forward random walk from seeds.
P(t) is backward random walk to seeds.
27
![Page 28: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/28.jpg)
Recall is forward random walk from seeds.
28
tq
xIq Iqt
DR0(x)
Recall is just like (personalized) PageRank.
![Page 29: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/29.jpg)
Precision is backward random walk to seeds.
29
Precision is harmonic energy minimization.
tq
xItIqt
DP0(x)
![Page 30: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/30.jpg)
Experimental results
Quest is effective in finding templates by inferred P and R, achieving 90% on actual F-measures.
Top results:
30
![Page 31: Kevin C. Chang. About the collaboration -- Cazoodle 2 Coming next week: Vacation Rental Search](https://reader036.vdocuments.mx/reader036/viewer/2022062322/5697bfc91a28abf838ca8f05/html5/thumbnails/31.jpg)
31
Thank You!
And they did the real work…
Ganesh Agarwal Govind Kabra