人工知能学会論文誌

一般論文

原著論文

頂点間の類似度の足し合わせによるリンク予測精度の改善

元田剛史, 村田剛志

2011 年 26 巻 3 号 p. 427-439
発行日: 2011年
公開日: 2011/03/01

DOIhttps://doi.org/10.1527/tjsai.26.427

ジャーナルフリー

抄録を表示する抄録を非表示にする

Recently, network analysis has been intensively investigated in several fields of science. Link prediction is a problem of predicting the existence of a link between two entities based on observed links, and it is one of the popular link mining tasks. Although many link prediction methods have been proposed, they have their merits and demerits. In this paper, we present two topics as follows: 1) In order to obtain the strategies of selecting the best link prediction methods, we perform experiments of six link prediction methods (Common Neighbors (CN) , Jaccard's Coefficient (JC) , Adamic/Adar (AA) , Shortest Path (SP) , Preferential Attachment (PA) and Hierarchical Random Graph (HRG) ) for 39 real networks. 2) We propose a new similarity that is the summation of similarities based on the logistic regression. We used 10-fold cross validation and bagging for model selection of proposed method. We estimate the accuracy and computation time of HRG, proposed method (bagging) and proposed method (10-fold cross validation) for 28 data sets. As a result of 1) , CN, JC and AA achieve good performance for the networks that has higher clustering coefficient than 0.4. SP achieves good performance for the network that has higher average shortest path length than 3. PA underperforms the random predictor for the network has lower variance of degrees than 0.5. HRG performs consistently well. As a result of 2) , accuracy of proposed methods (both of bagging and 10-fold cross validation) are reached higher than the accuracy of HRG for 17 data sets and finishes the calculation faster than HRG. Proposed methods perform good accuracy for social network, citation network, dictionary network, biological network and transfer network (journey). Proposed methods underperform for trade network, circuit network, and food web network. Sometimes, proposed method (bagging) reaches higher accuracy than the accuracy of proposed method (10-fold cross validation). Proposed method (10-fold cross validation) finishes the calculation faster than proposed method (bagging). In conclusion, proposed methods finish the calculation faster than HRG and accuracy of proposed methods reaches higher than HRG.

抄録全体を表示

PDF形式でダウンロード (274K)
グラフカーネルを用いた非分かち書き文からの漸次的語彙知識獲得

萩原正人, 小川泰弘, 外山勝彦

2011 年 26 巻 3 号 p. 440-450
発行日: 2011年
公開日: 2011/04/01

DOIhttps://doi.org/10.1527/tjsai.26.440

ジャーナルフリー

抄録を表示する抄録を非表示にする

Extraction of named entitiy classes and their relationships from large corpora often involves morphological analysis of target sentences and tends to suffer from out-of-vocabulary words. In this paper we propose a semantic category extraction algorithm called Monaka and its graph-based extention g-Monaka, both of which use character n-gram based patterns as context to directly extract semantically related instances from unsegmented Japanese text. These algorithms also use ``bidirectional adjacent constraints,'' which states that reliable instances should be placed in between reliable left and right context patterns, in order to improve proper segmentation. Monaka algorithms uses iterative induction of instaces and pattens similarly to the bootstrapping algorithm Espresso. The g-Monaka algorithm further formalizes the adjacency relation of character n-grams as a directed graph and applies von Neumann kernel and Laplacian kernel so that the negative effect of semantic draft, i.e., a phenomenon of semantically unrelated general instances being extracted, is reduced. The experiments show that g-Monaka substantially increases the performance of semantic category acquisition compared to conventional methods, including distributional similarity, bootstrapping-based Espresso, and its graph-based extension g-Espresso, in terms of F-value of the NE category task from unsegmented Japanese newspaper articles.

抄録全体を表示

PDF形式でダウンロード (361K)
利得関数の簡潔な記述方法を用いた提携構造形成問題の解法

大田直樹, Vincent Conitzer, 一村良, 櫻井祐子, 岩崎敦, 横尾真

2011 年 26 巻 3 号 p. 451-460
発行日: 2011年
公開日: 2011/04/01

DOIhttps://doi.org/10.1527/tjsai.26.451

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper presents a new way of formalizing the Coalition Structure Generation problem (CSG), so that we can apply constraint optimization techniques to it. Forming effective coalitions is a major research challenge in AI and multi-agent systems. CSG involves partitioning a set of agents into coalitions so that social surplus is maximized. Traditionally, the input of the CSG problem is a black-box function called a characteristic function, which takes a coalition as an input and returns the value of the coalition. As a result, applying constraint optimization techniques to this problem has been infeasible. However, characteristic functions that appear in practice often can be represented concisely by a set of rules, rather than a single black-box function. Then, we can solve the CSG problem more efficiently by applying constraint optimization techniques to the compact representation directly. We present new formalizations of the CSG problem by utilizing recently developed compact representation schemes for characteristic functions. We first characterize the complexity of the CSG under these representation schemes. In this context, the complexity is driven more by the number of rules rather than by the number of agents. Furthermore, as an initial step towards developing efficient constraint optimization algorithms for solving the CSG problem, we develop mixed integer programming formulations and show that an off-the-shelf optimization package can perform reasonably well, i.e., it can solve instances with a few hundred agents, while the state-of-the-art algorithm (which does not make use of compact representations) can solve instances with up to 27 agents.

抄録全体を表示

PDF形式でダウンロード (417K)
医療サービス実践知の共有支援に向けたオントロジーの構築と利用

クリニカルパスに基づく医療ワークフローのモデル化とその実践知獲得インタビューでの利用

小川泰右, 山崎友義, 池田満, 鈴木斎王, 荒木賢二, 橋田浩一

2011 年 26 巻 3 号 p. 461-472
発行日: 2011年
公開日: 2011/04/07

DOIhttps://doi.org/10.1527/tjsai.26.461

ジャーナルフリー

抄録を表示する抄録を非表示にする

It is ideal to provide medical services as patient-oriented. The medical staff members share the final goals to recover patients. Toward the goals, each staff has practical knowledge to achieve patient-oriented medical services. But each medical staff has his/her own sense of value that comes from his/her expertness. Therefore the practical knowledge sometimes conflicts. The aim of this research is to develop an intelligent system to support externalizing practical knowledge, and sharing it among medical staff members. In this paper, the author propose a method to model the sense of value of each medical staff as his/her understanding about medical service workflow, and to obtain the practical knowledge using the models. The method was experimented by an implementation of knowledge-sharing system base on the method and by its trial use in Miyazaki University Hospital.

抄録全体を表示

PDF形式でダウンロード (2824K)
部分パスに基づいた木カーネル

木村大翼, 久保山哲二, 渋谷哲朗, 鹿島久嗣

2011 年 26 巻 3 号 p. 473-482
発行日: 2011年
公開日: 2011/04/19

DOIhttps://doi.org/10.1527/tjsai.26.473

ジャーナルフリー

抄録を表示する抄録を非表示にする

Kernel method is one of the promising approaches to learning with tree-structured data, and various efficient tree kernels have been proposed to capture informative structures in trees. In this paper, we propose a new tree kernel function based on ``subpath sets'' to capture vertical structures in tree-structured data, since tree-structures are often used to code hierarchical information in data. We also propose a simple and efficient algorithm for computing the kernel by extending the Multikey quicksort algorithm used for sorting strings. The time complexity of the algorithm is O((|T_1|+|T_2|)log(|T_1|+|T_2|)) time on average, and the space complexity is O({|T_1|+|T_2|)}, where |T_1| and |T_2| are the numbers of nodes in two trees T_1 and T_2. We apply the proposed kernel to two supervised classification tasks, XML classification in web mining and glycan classification in bioinformatics. The experimental results show that the predictive performance of the proposed kernel is competitive with that of the existing efficient tree kernel proposed by Vishwanathan et al., and is also empirically faster than the existing kernel.

抄録全体を表示

PDF形式でダウンロード (553K)

J-STAGEへの登録はこちら（無料）