人工知能学会第二種研究会資料
Online ISSN : 2436-5556
Semantic labeling for quantitative data using Wikidata
NGUYEN PhucTAKEDA Hideaki
著者情報
研究報告書・技術報告書 フリー

2018 年 2018 巻 SWO-045 号 p. 04-

詳細
抄録

Semantic labeling for numerical attributes is a process of matching numerical at- tributes in tabular resources to properties and classes in knowledge bases. It can be used in many applications such as table search, table extension, and knowledge augmentation. One of the chal- lenges of this tasks is to distinguish numerical attributes expressed in various scales or units of measurement. Indeed, how to distinguish the similar attributes of "human height - centimeters" and "human height - feet" and the dissimilar attribute "population - million". Previous studies assume the similar attributes expressed in the same scale. In fact, the similar attributes could be expressed differently since the data resource is constructed by different people in different back- ground and context. In this paper, we propose a novel method to improve the performance of semantic labeling for numerical attributes in various scales. We use an external knowledge about unit conversion taken from Wikidata to generate more data resources for the numerical background knowledge bases (WBKB). Our empirical experiments show that using the WBKB can improve the performance of semantic labeling expressed in various scales.

著者関連情報
© 2018 著作者
前の記事 次の記事
feedback
Top