Comparing Personalized Pagerank and Activation Spreading in Wikipedia Diagram-Based Search

Published In

2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)

Document Type

Citation

Publication Date

12-2021

Abstract

Diagram Navigation (DN) is based on using existing diagrams for a domain as maps to navigate and query a collection from different perspectives. With a relatively small number of manual connections, such as ones between diagram concepts and related documents, a domain expert can integrate their perspective of a domain (depicted in a diagram) into the navigation system of a collection. DN utilizes the abundance of internal connections in a collection, such as Wikipedia hyperlinks to access the entire collection. In a Diagram-to-Content (D2C) query, an end user selects a diagram concept to retrieve a ranked list of related collection documents. In a Content-to-Diagram (C2D) query, DN highlights related concepts in a diagram based on document(s) selected by the user. To increase D2C ranking performance, we study and tune Personalized PageRank and an energy-spreading algorithm. We report key differences in how the algorithms rank D2C queries. We show that the tested algorithms are affected differently by Wikipedia graph structures, such as categories and hyperlinks from article templates. We also show that diagrams not only can provide overviews, but they also positively bias the ranking of D2C queries.

Rights

Copyright 2021 IEEE

DOI

10.1109/JCDL52503.2021.00016

Persistent Identifier

https://archives.pdx.edu/ds/psu/36883

Publisher

IEEE

Share

COinS