click models for web search - lecture...

Evaluation Data and tools Results

Click Models for Web SearchLecture 3

Aleksandr Chuklin§,¶ Ilya Markov§ Maarten de Rijke§

a.chuklin@uva.nl i.markov@uva.nl derijke@uva.nl

§University of Amsterdam¶Google Research Europe

AC–IM–MdR Click Models for Web Search 1

Course overview

Basic Click Models

Parameter Estimation Evaluation

Data and ToolsResultsApplications

Advanced Models

Recent Studies

Future Research

This lecture

Basic Click Models

Advanced Models

Recent Studies

Future Research

What do click models give us?

General:

Understanding of user behavior

Specific:

Conditional click probabilities

Full click probabilities

Attractiveness and satisfactoriness for query-document pairs

Lecture outline

1 EvaluationLikelihoodPerplexityRanking evaluation

2 Data and tools

3 Results

Evaluation summary

Click model’s output Evaluation

Conditional click probabilities Log-likelihoodFull click probabilities PerplexityParameter values Ranking evaluation

Lecture outline

Likelihood

Likelihood measures how well a click model estimatesconditional click probabilities given observed clicks.

LL(M) =1

|S|∑s∈S

(C1 = c

(s)1 , . . . ,Cn = c

)Cr – binary random variable denoting a click at rank r

c(s)r – observed click at rank r in a search session s

P(Cr = c

)– probability of observing c

(s)r in session s

P(C1 = c

(s)1 , . . . ,Cn = c

)– probability of observing

sequence c(s)1 , . . . , c

(s)n in session s

Likelihood

(C1 = c

(s)1 , . . . ,Cn = c

(C1 = c

)· PM

(C2 = c

(s)2 , . . . ,Cn = c

(s)n | C1 = c

(C1 = c

)· PM

(C2 = c

(s)2 | C1 = c

)· PM

(C3 = c

(s)3 , . . . ,Cn = c

(s)n | C1 = c

(s)1 ,C2 = c

n∏r=1

(Cr = c

(s)r | C<r = c

Likelihood: summary

LL(M) =1

|S|∑s∈S

(C1 = c

(s)1 , . . . ,Cn = c

)LL(M) =

|S|∑s∈S

n∑r=1

(Cr = c

(s)r | C<r = c

Likelihood measures how well a click model estimatesconditional click probabilities given observed clicks.

LL(M) ∈ [−∞..0]

Lecture outline

Perplexity

Perplexity measures how well a click model estimatesfull click probabilities (i.e., when clicks are not observed).

pr (M) = 2− 1

|S|∑

(log2 PM(C

(s)r =c

(s)r ))

pr (M) ∈ [1..2]

Lecture outline

Ranking evaluation

R̂el i Reli

αu1q 4

αu2q 2

αu3q 1

αu4q 4

αu5q 2

DCG =n∑

2Reli − 1

log2(i + 1)

Evaluation summary

Click model’s output Evaluation

Conditional click probabilities Log-likelihoodFull click probabilities PerplexityParameter values Ranking evaluation

Lecture outline

1 Evaluation

2 Data and tools

3 Results

Datasets

AOL2006: raw queries and clicked documents (no SERPs)

MSN2006: contains only clicked documents (no SERPs)

Workshop on Web Search Click Data (WSCD)

WSCD2012: predict document relevanceWSCD2013: detect search engine switchWSCD2014: search personalization

SogouQ

Tsinghua University: eye fixation

Dataset statistics

Dataset Queries URLs Users Sessions

AOL 2006 10,154,742 1,632,788 657,426 21,011,340MSN 2006 8,831,280 4,975,897 – 7,470,915SogouQ 2012 8,939,569 15,095,269 9,739,704 25,530,711WSCD 2012 30,717,251 117,093,258 – 146,278,823WSCD 2013 10,139,547 49,029,185 956,536 17,784,583WSCD 2014 21,073,569 70,348,426 5,736,333 65,172,853

Software

Click model packages

clickmodels project by Aleksandr ChuklinPyClick by Ilya Markov et al.

Infer.NET

General-purpose languages

OctaveMatlab

Lecture outline

1 Evaluation

2 Data and tools

3 ResultsLog-likelihoodPerplexityTraining timeLarge-scale evaluation

Experimental setup

first 1M query sessions from WSCD 2012 dataset75% for training, 25% for testingrepeat 15 times, each time with next 1M sessions

PyClick

50 iterations for EM

Studied click models

CTR models: counting clicks

Position-based model (PBM): examination and attractiveness

Cascade model (CM): previous examinations and clicks matter

Dynamic Bayesian network model (DBN): satisfactoriness

User browsing model (UBM): rank of previous click

Lecture outline

Log-likelihood

RCM RCTR DCTR PBM CM UBM SDCM CCM DBN SDBN0.40

ikelih

Cascade model: LL = −∞Complex models (DBN, UBM) win over simple onesMany examination parameters win over few: UBM > PBMSatisfaction parameters help: DBN > PBM

Lecture outline

Perplexity

RCM RCTR DCTR PBM CM UBM SDCM CCM DBN SDBN1.0

Complex models win over simple ones

Most complex models have similar perplexity

Perplexity by rank

1 2 3 4 5 6 7 8 9 100.0

GCTRRCTR

DCTRPBM

DCMCCM

DBNSDBN

Picture taken from A. Grotov, A. Chuklin, I. Markov, L. Stout, F. Xumara, and M. de Rijke. A comparative studyof click models for web search. In CLEF. Springer, September 2015.

Lecture outline

Training time

RCM RCTR DCTR PBM CM UBM SDCM CCM DBN SDBN0

MLE is much faster than EM

PBM and UBM are fast enough compared to DBN

Lecture outline

Experimental setup

Full WSCD 2012 dataset

146,278,823 query sessions30,717,251 unique queries117,093,258 unique URLs41,275 relevance labels (for 4,991 queries)

50% for training, 50% for testing

PyClick

Log-likelihood and perplexity

Click model Perplexity Log-likelihood

DBN 1.3510 −0.2824DCM 1.3627 −0.3613CCM 1.3692 −0.3560UBM 1.3431 −0.2646

UBM is the best in terms of predicting user click behavior

UBM has the largest number of examination parameters (55)

Ranking evaluation

Click model @1 @3 @5 @10

DBN 0.717 0.725 0.764 0.833DCM 0.736 0.746 0.780 0.844CCM 0.741 0.752 0.785 0.846UBM 0.724 0.737 0.773 0.838

CCM is the best in terms of ranking

Not covered in this corse (but covered in the book)

Lecture 3 summary

Click model’s output Evaluation Best model

Conditional click probabilities Log-likelihood UBMFull click probabilities Perplexity UBMParameter values Ranking evaluation CCM

Training time MLE-based

Course overview

Basic Click Models

Advanced Models

Recent Studies

Future Research

Up next

Practical Session 1

Acknowledgments

All content represents the opinion of the authors which is not necessarily shared orendorsed by their respective employers and/or sponsors.

click models for web search - lecture...

Documents

a comparative study of click models for web search filea...

click here to search

click models for web search - lecture 2 · click models for...

cost models for executive search

determining relevance rankings from search click logs

leverage the search power through pay per click

a search engine for 3d models

innovation models that click

click models for web search - lecture 4 · first...

one click search

search engine marketing pay per click. distinctions: sem vs....

click models for web search - lecture 1 · lecture 1...

click chain model in web search

monthly recall report...blue bird body company models: to...

adwords search funnels: going beyond the last click

wscd09 workshop on web search click data 2009

multiple retrieval models and regression models for prior...

select reports console. type in progress, click search

click here to open search - iowa state university

search and click digital : puissance digital marketing