What should I cite?
cross-collection reference recommendation of patents and papers
pp. 40-46
Abstrakt
Research results manifest in large corpora of patents and scientific papers. However, both corpora lack a consistent taxonomy and references across different document types are sparse. Therefore, and because of contrastive, domain-specific language, recommending similar papers for a given patent (or vice versa) is challenging.We propose a recommender system that leverages topic distributions and keywords to recommend related work despite these challenges. As a case study, we evaluate our approach on patents and papers of two fields: medical and computer science. We find that topic-based recommenders complement word-based recommenders for documents with collection-specific language and increase mean average precision by up to 27%. As a result of our work, publications from both corpora form a joint digital library, which connects academia and industry.
Publication details
Published in:
Kamps Jaap, Tsakonas Giannis, Manolopoulos Yannis, Iliadis Lazaros, Karydis Ioannis (2017) Research and advanced technology for digital libraries: 21st international conference on theory and practice of digital libraries, TPDL 2017, Thessaloniki, Greece, September 18-21, 2017. Dordrecht, Springer.
Seiten: 40-46
DOI: 10.1007/978-3-319-67008-9_4
Referenz:
Risch Julian, Krestel Ralf (2017) „What should I cite?: cross-collection reference recommendation of patents and papers“, In: J. Kamps, G. Tsakonas, Y. Manolopoulos, L. Iliadis & I. Karydis (eds.), Research and advanced technology for digital libraries, Dordrecht, Springer, 40–46.