電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<情報処理・ソフトウェア>
論文で引用された研究データの同定と分類
角掛 正弥松原 茂樹
著者情報
ジャーナル 認証あり

2020 年 140 巻 12 号 p. 1357-1364

詳細
抄録

This paper proposes a method for identifying and classifying the research data cited in scholarly papers, aiming at automatic generation of metadata stored in data repository. This study focuses on URL citations in the scholarly papers. That is, the targets are to identify the URLs referring to the research data and to classify them into tool and data. The method is realized as a multi-class classification (tool/data/others). The method acquires the distributed representations of the URLs from the context around them, and uses them as the input feature. There exists an advantage in that the meanings of URLs can be given based on their surrounding words. This study adopts an approach of computing the meaning of the entire URL from those of the components of the URL. In order to evaluate the performance of the proposed method, experiments on URL classification were conducted. The scholarly papers included in the proceedings of the international conference were used as experimental data. Experimental results have shown the effectiveness of the proposed method for identifying and classifying URLs referring to research data.

著者関連情報
© 2020 電気学会
前の記事 次の記事
feedback
Top