effect of heuristics on serendipity in path-based storytelling with linked data
TRANSCRIPT
Effect of Heuristics on Serendipity in Path-Based Storytelling with Linked Data
Laurens De VochtChristian Beecks, Ruben Verborgh, Erik Mannens, Thomas Seidl, Rik Van de Walle
BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps
BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps
BA
?
How to consistently improve and tailor existing pathfinding approaches? [pathfinding]
How well do heuristics effect user expectations so users are able to discover feeling confident about the story facts relevance? [serendipity]
Is semantic distance between facts a good criterion for optimizing the paths forming a story? [user judgments]
6
- trivial
randomness -
familiarity
surprise
sense making
+ discovery
Serendipity
BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps
8
Original Core Algorithm A* based
A*
h =Jaccard
Distance
w = Common
Node Degree
Optimizations
9
Improved Algorithm Wraps Core
Algorithm
h
w
Domain Delineation
Iterative Refinementto increase semantic relatedness
10
Heuristics [h]
Jaccard
NormalizedDBpedia Distance
Confidence
11
Weights [w]
Jaccard
Jiang-ConrathDistance (JCW)
Common NodeDegree (CND)
BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps
13
Semantic Distance0.62 via Physics 0.45 via Hume
EinsteinNewton
Physics
Hume
:influences
:discipline
:birthPlace :deathPlace
Semantic Distances
14
Normalized Web Search Distancee.g. Google Distance, Bing Distance…
15
Motivating Example
16
17
18
Semantic Distances (continued)
BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps
20
Serendipity – Semantic Distance
21
Serendipity – User Judgments
22
Serendipity – User Judgments
Least agreement (high standard deviation): Carl Linnaeus and Albert Einstein [JCWJaccard]Carl Linnaeus and Baruch Spinoza are Expert, Intellectual and Scholar Baruch Spinoza’s and Albert Einstein’s are both Pantheists Intellectuals and Jewish Philosophers
Most relevant and consistent: Charles Darwin and Carl Linnaeus [CNDJaccard]Copley Medal’s the award of Alfred Russel Wallace and Charles Darwin Alfred Russel Wallace’s and Charles Darwin’s awards are Royal Medal and Copley Medal Alfred Russel Wallace and Charles Darwin are known for their Natural selection Carl Linnaeus and Alfred Russel Wallace have as subject ‘Fellows of the Royal Society’ Carl Linnaeus and Alfred Russel Wallace are Biologists and Colleagues
BA
Introduction
Pathfinding
Semantic Distance
Evaluation
Conclusions & Next Steps
24
Conclusions
Reducing the number of arbitrary resources/facts revealed for a story.
Dbpedia example: telling a story with better link estimation, in cases where the original algorithm did not make optimal choices of links.
The most consistent output was generated with the Jaccard distance used both as weight and heuristic; or as heuristic in combination with the Jiang-Conrath distance as weight.
The most arbitrary facts occur in a story when using the combined node degree as weight with the Jaccard distance as heuristic, both in the optimized and the original algorithm.
User judgments confirm the findings for the Jiang-Conrath weight, original algorithm and for the Jaccard distance used as weight and heuristic in terms of discovery.
25
Validate the correlation between the effect of the link estimation on the arbitrariness as perceived by users and computational semantic relatedness measures such as SemRank.
Measure the scalability of the approach by implementing the algorithms: (i) solely on the client, (ii) completely on the sever, and (iii) in a distributed client/server architecture.
Next Steps
Additional questions?@[email protected]://slideshare.net/laurensdv