IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Augmenting Low-Resource Language Wikipedia through Hyperlink Type Recommendation
Nhu NGUYENHideaki TAKEDA
著者情報
ジャーナル フリー 早期公開

論文ID: 2024EDP7258

詳細
抄録

Wikipedia stands out as a globally utilized linguistic resource available in over 330 languages, attracting contributions from a diverse group of editors on a global scale. Despite its widespread use, significant disparities persist among language publications, including variations in the number of articles, the spectrum of topics covered, and even the number of contributing community editors. In this paper, we aim to alleviate this gap in the coverage of low-resource languages. Although previous work has focused on multilingual interoperability efforts, the potential of hyperlinks has not been fully realized. Therefore, this study introduces a novel approach focused on hyperlinks, specifically emphasizing hyperlink types derived from Wikidata. We extract and analyze patterns related to these hyperlink types across different languages, using them as recommended solutions to connect the topics of various languages, particularly low-resource languages. Collaborative filtering experiments suggest that using combined languages leads to good overall results while preserving the uniqueness of each language.

著者関連情報
© 2025 The Institute of Electronics, Information and Communication Engineers
前の記事 次の記事
feedback
Top