電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<ソフトウェア・情報処理>
蛋白質立体構造データを利用した文献からの蛋白質相互作用記述文抽出方式
兼田 佳和Md. Ahaduzzaman Munna大川 剛直
著者情報
ジャーナル フリー

2005 年 125 巻 5 号 p. 690-697

詳細
抄録
Because a protein expresses its function through interaction with other substrates, it is vital to create a database of protein interaction. Since the total volume of information on protein interaction is described in terms of thousands of literatures, it is nearly impossible to extract all this information manually. Although extraction systems for interaction information based on the template matching method have already been developed, it is not possible to match all the sentences with interaction information due to the extent of sentence complexity.
We propose a method of extracting sentences with interaction information independent of sentence structure. In a protein-compound complex structure, the interacting residue is near to its partner. The distance between them can be calculated by using the structure data in the PDB database, with a short distance indicating that the sentences associated with them might describe the interaction information. In a free-protein structure, the distance cannot be calculated because the coordinates of the protein's partner are not registered in the structure data. Hence, we use the homology protein structure data, which is complexed with the protein's parter.
The proposed method was applied to seven literatures written about protein-compound complexes and four literatures written about free proteins, obtaining F-measures of 71% and 72%, respectively.
著者関連情報
© 電気学会 2005
前の記事 次の記事
feedback
Top