人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
原著論文
生命科学の複数LODの統合による目的別タンパク質分子間相互作用Linked Open Dataの構築
込山 悠介番野 雅城鑓水 優行加藤 文彦大向 一輝武田 英明清水 謙多郎
著者情報
ジャーナル フリー

2014 年 29 巻 4 号 p. 356-363

詳細
抄録

Researchers of agriculture, life science and drug design of the need to acquire information that combines two or more life science databases for problem solving. Semantic Web technologies are already necessary for data integration between those databases. This study introduces a technique of utilizing RDF (Resource Description Framework) and OWL (Web Ontology Language) as a data set for development of a machine learning predictor of interactomics. Also, for SPARQL (SPARQL Protocol and RDF Query Language) we sketched the implementing method of interactomics LOD (Linked Open Data) in the graph database. Interactomics LOD has included the pairs of protein--protein interactions of tyrosine kinase, the pairs of amino acid residues of sugar (carbohydrate) binding proteins, and cross-reference information of the protein chain among an entry of major bioscience databases since 2013. Finally, we designed three RDF schema models and made access possible using AllegroGraph 4.11 and Virtuoso 7. The number of total triples was 1,824,859,745 in these databases. It could be combined with public LOD of the life science domain of 28,529,064,366 triples and was able to be searched. We showed that it was realistic to deal with large-scale LOD on a comparatively small budget by this research. The cost cut by LOD decreased not only expense but development time. Especially RDF-SIFTS (Structure Integration with Function, Taxonomy and Sequence) that is an aggregate of 10 small LOD was constructed in the short period of BioHackathon 2013 or was developed in one week. We could say that we can obtain quickly a data set required for the machine learning of interactomics by using LOD. We set up the interactomics LOD for application development as a database. SPARQL endpoints of these databases are exhibited on the portal site UTProt (The University of Tokyo Protein, http://utprot.net).

著者関連情報
© 人工知能学会 2014
次の記事
feedback
Top