activity ranking in linked in feed

20
Presented By : Ranjith Kumar Bodla 1 Authors: Deepak Agarwal Bee-Chung Chen Liang Zhang et al Activity Ranking in LinkedIn Feed

Upload: bodla-kumar

Post on 13-Apr-2017

32 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Activity ranking in linked in feed

1

Presented By : Ranjith Kumar Bodla

Authors: Deepak Agarwal

Bee-Chung Chen

Liang Zhang et al

Activity Ranking in LinkedIn Feed

Page 2: Activity ranking in linked in feed

2 LinkedIn Feed Straw man approaches Activities on LinkedIn Feed Freshness System Architecture Model and Features Desktop Bucket Test Results

Outline

Page 3: Activity ranking in linked in feed

3 Professional network Heterogeneous updates

More than 40 types. Shared articles, job changes

connection updates etc.

Challenges Large scale (313M+ members) Relevance & Personalization Freshness, diversity, user

fatigue

How do we rank activities?

LinkedIn Feed

Page 4: Activity ranking in linked in feed

4 Reverse chronological

ranking(recency) Fresh but not relevant

Ranking by social popularity Likes, a useful signal CTR not monotonically

related Not all activities have

likes

Straw man approaches

Page 5: Activity ranking in linked in feed

5 Taxonomy

Each activity represented as a triple (actor type, verb type, object type)

Connection : (member, connect, member) Opinion: (member, like, article)

What happens if we simply rank by CTR?

Activities on LinkedIn Feed

Page 6: Activity ranking in linked in feed

6 Connection activities

Symmetric connection (Member connection) Asymmetric connection (Following)

Informational activities (Messages, Articles, Pictures) Profile activities (Profile pictures, job position, contact

info) Opinion activities (like and comment) Site-specific activities

Unique about LinkedIn - job anniversaries, endorsements of users’ skills and job recommendations.

Activities on LinkedIn Feed

Page 7: Activity ranking in linked in feed

7

Taxonomy of Activities on LinkedIn Feed

Page 8: Activity ranking in linked in feed

8

Two aspects of freshness - First, the age of an activity since its

creation. Second, the number of times a user has

seen a particular item in the past. Differ significantly for different users.

For example, a heavy user might see an activity several times within the first hour of its lifetime.

Freshness

Page 9: Activity ranking in linked in feed

9

LinkedIn are connected via a professional network, we can study how the connection relationship between a viewer and an actor affects whether the viewer would click on an activity of the actor.

Demographic similarities based on age, gender and education, etc.

Experience similarities based on the common companies, job positions, etc.

Skill similarities based on the common skills and similar skills.

Geo similarities based on the locations of the viewer and the actor at different resolutions (e.g., city, state and country).

Social network similarities based on connections of the viewer and the actor.

Connection Relationship

Page 10: Activity ranking in linked in feed

10

Page 11: Activity ranking in linked in feed

11 Training data collection

Requires randomization

Personalization features E.g., viewer type affinity, viewer actor affinity

Large scale logistic regression via Alternating Direction Method of Mulitpliers (ADMM) Scalable, distributed algorithm

Offline evaluation Unbiased estimation via replay

Relevance via CTR prediction

Page 12: Activity ranking in linked in feed

12

System Architecture

Page 13: Activity ranking in linked in feed

13 A score is required to rank the activities in the

candidate pool. To predict the CTR of each activity, learn a logistic

regression model. y denote the click on the activity described by

feature vector x then the corresponding Bernoulli random variable Y is modeled as

θ is the parameter vector which we want to learn.

Model and Features

Page 14: Activity ranking in linked in feed

14

Large Scale Logistic Regression Via ADMM

Page 15: Activity ranking in linked in feed

15

Large Scale Logistic Regression Via ADMM

Page 16: Activity ranking in linked in feed

16

Large Scale Logistic Regression Via ADMM

Page 17: Activity ranking in linked in feed

This approach typically require greedily

picking the next activity based on the current set of items.

It is also important to keep the feed fresh. we perform an additional exponential decay on

the score based on the age of activities. Repeated impressions of the same activity is

also discouraged by introducing an additional decay factor that takes into account past impression counts of the activity.

17Reranking

Page 18: Activity ranking in linked in feed

18 Effect of freshness. Impression Discounting. Model Personalization. Combining All The Features.

Desktop Bucket Test Results

Page 19: Activity ranking in linked in feed

19

Good ranking model are found by continuously testing of different models with different features.

The models learned from data may not have all the characteristics we desire

Different platforms (such as mobile vs desktop) have different characteristics.

Personalizing our models for user’s tastes and behavior often gives large improvements in performance.

Conclusion

Page 20: Activity ranking in linked in feed

20

Questions ?