人工知能学会論文誌

一般論文

原著論文

適応的実数値交叉 AREX の提案と評価

秋本洋平, 永田裕一, 佐久間淳, 小野功, 小林重信

2009 年24 巻6 号 p. 446-458
発行日: 2009年
公開日: 2009/08/04

DOIhttps://doi.org/10.1527/tjsai.24.446

ジャーナルフリー

抄録を表示する抄録を非表示にする

Since once premature convergence happens evolutionary algorithms for function optimization can no longer explore areas of the search space and fail to find the optimum, it is required to handle the notorious drawback. This paper proposes two novel approaches to overcome premature convergence of real-coded genetic algorithms (RCGAs). The first idea is to control the sampling region of crossover by adaptation of expansion rate. The second idea is to cause the acceleration of the movement of population by descending the mean of crossover. Finally, we propose a crossover that combines the adaptation of expansion rate technique and the crossover mean descent technique, called AREX (adaptive real-coded ensemble crossover). The performance of the real-coded GA using AREX is evaluated on several benchmark functions including functions whose landscape forms ridge structure or multi-peak structure, both of which are likely to lead to the miserable convergence phenomenon. The experimental results show not only that the proposed method can locate the global optima of functions on which it is difficult for the existing GAs to discover it but also that our approach outperforms the existing one in number of function evaluations on all functions. Our approach enlarges the classes of functions that real-coded GAs can solve.

抄録全体を表示

PDF形式でダウンロード (1461K)
木の半正定値カーネル

フレームワークとサーベイ

申吉浩

2009 年24 巻6 号 p. 459-468
発行日: 2009年
公開日: 2009/08/07

DOIhttps://doi.org/10.1527/tjsai.24.459

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper proposes two frameworks to be used in engineering tree kernels. One is to ensure that the resulting tree kernels be positive semidefinite, while the other is for efficient algorithms to compute the kernels based on the dynamic programming. The first framework provides a method to construct tree kernels using primitive kernels for simpler structures (e.g. for labels, strings) as building blocks and a easy-to-check sufficient condition for the resultant tree kernels to be positive semidefinite. The second framework provides a set of templates of algorithms to calculate a wide range of tree kernels in O(|X|³) or O(|X|2)-time, where |X| denotes the number of vertices of trees.

抄録全体を表示

PDF形式でダウンロード (258K)
接尾辞情報を利用した文書からの組織名抽出

乾孝司, 村上浩司, 橋本泰一, 内海和夫, 石川正道

2009 年24 巻6 号 p. 469-479
発行日: 2009年
公開日: 2009/08/07

DOIhttps://doi.org/10.1527/tjsai.24.469

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper presents a method for boosting the performance of the organization name recognition, which is a part of named entity recognition (NER). Although gazetteers (lists of the NEs) have been known as one of the effective features for supervised machine learning approaches on the NER task, the previous methods which have applied the gazetteers to the NER were very simple. The gazetteers have been used just for searching the exact matches between input text and NEs included in them. The proposed method generates regular expression rules from gazetteers, and, with these rules, it can realize a high-coverage searches based on looser matches between input text and NEs. To generate these rules, we focus on the two well-known characteristics of NE expressions; 1) most of NE expressions can be divided into two parts, class-reference part and instance-reference part, 2) for most of NE expressions the class-reference parts are located at the suffix position of them. A pattern mining algorithm runs on the set of NEs in the gazetteers, and some frequent word sequences from which NEs are constructed are found. Then, we employ only word sequences which have the class-reference part at the suffix position as suffix rules. Experimental results showed that our proposed method improved the performance of the organization name recognition, and achieved the 84.58 F-value for evaluation data.

抄録全体を表示

PDF形式でダウンロード (433K)
光と影を用いたテキストのテーマ関連度の可視化

西原陽子, 佐藤圭太, 砂山渡

2009 年24 巻6 号 p. 479-487
発行日: 2009年
公開日: 2009/08/18

DOIhttps://doi.org/10.1527/tjsai.24.479

ジャーナルフリー

抄録を表示する抄録を非表示にする

There are so many opportunities to transmit text information on the Web. Since texts on the Web are not always written by professional writers, those may not be coherent or may be hard to be comprehended. Therefore, we should take too much time and energy to grasp topic relevance of a text.
This paper describes HINATA system that visualizes texts using light and shadow based on topic relevance. Topic is defined as a set of words such as nouns contained in a title of a text. The light expresses sentences related to a topic, and the shadow expresses sentences unrelated to a topic. This visualization method efficiently supports users for finding the parts related to a topic, and for grasping relations between sentences of a text and a topic. Experimental results showed that the proposed system could support users for understanding how a text was related to a topic.

抄録全体を表示

PDF形式でダウンロード (2860K)

速報論文

イベント系列マイニングを目的とする新聞記事からの時間情報に基づく単語抽出

多田知道, 岩沼宏治, 鍋島英知

2009 年24 巻6 号 p. 488-493
発行日: 2009年
公開日: 2009/08/18

DOIhttps://doi.org/10.1527/tjsai.24.488

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper shows a new method of extracting important words from newspaper articles based on time-sequence information. This word extraction method plays an important role in event sequence mining. TF-IDF is a well-known method to rank word's importance in a document. However, the TF-IDF method never consider the time information embedded in sequential textual data, which is peculiar to newspapers. In this research, we will propose a new word-extraction method, called the TF-IDayF method, which considers time-sequence information, and can extract important/characteristic words expressing sequential events. The TF-IDayF method never use so-called burst phenomenon of topic word occurrences, which has been studied by lots of researchers. The TF-IDayF method is quite simple, but effective and easy to compute in sequential textual mining. We evaluate the proposed method from three points of view, i.e., a semantic viewpoint, a statistical one and a data mining viewpoint through several experiments.

抄録全体を表示

PDF形式でダウンロード (839K)

原著論文

地理情報検索のためのスニペット生成法

戸田浩之, 安田宜仁, 奥村学, 松浦由美子, 片岡良治

2009 年24 巻6 号 p. 494-506
発行日: 2009年
公開日: 2009/08/18

DOIhttps://doi.org/10.1527/tjsai.24.494

ジャーナルフリー

抄録を表示する抄録を非表示にする

Geographic information retrieval (GIR) aims at the retrieval of geographic-related documents based through the use of not only on keyword relevance but also on geographic relationships between the query and the geographic information in the texts. However, how to show search results in GIR has not been studied well, especially with regard to generating snippets that reflect the geographic part of the query. This paper proposes a novel snippet generation method. Our method first converts geographic phrases in the target text into geographic coordinates, then scores each of them according to their distance from the query using the coordinates. Next, it extracts fragments of the target text based on the distribution of the query keyword and geographic scores, and presents the combined fragments as a snippet. Evaluations are conducted with regard to two different aspects. Both attributes confirm the effectiveness of our method.

抄録全体を表示

PDF形式でダウンロード (414K)
雑談自由対話を実現するためのWWW上の文書からの妥当な候補文選択手法

柴田雅博, 冨浦洋一, 西口友美

2009 年24 巻6 号 p. 507-519
発行日: 2009年
公開日: 2009/09/04

DOIhttps://doi.org/10.1527/tjsai.24.507

ジャーナルフリー

抄録を表示する抄録を非表示にする

We propose an open-ended dialog system that generates a proper sentence to a user's utterance using abundant documents on the World Wide Web as sources. Existing knowledge-based dialog systems give meaningful information to a user, but they are unsuitable for open-ended input. The system Eliza can handle open-ended input, but it gives no meaningful information. Our system lies between the above two dialog systems; it converses on various topics and gives meaningful information related to the user's utterances. The system selects an appropriate sentence as a response from documents gathered through the Web, on the basis of surface cohesion and shallow semantic coherence. The surface cohesion follows centering theory and the semantic coherence is calculated on the basis of the conditional distribution and inverse document frequency of content words (nouns, verbs, and adjectives.) We developed a trial system to converse about movies and experimentally found that the proposed method generated 66% appropriate responses.

抄録全体を表示

PDF形式でダウンロード (790K)
Markov Logicを利用した時間的順序関係の同時推論

吉川克正, リーデルセバスチャン, 浅原正幸, 松本裕治

2009 年24 巻6 号 p. 521-530
発行日: 2009年
公開日: 2009/10/20

DOIhttps://doi.org/10.1527/tjsai.24.521

ジャーナルフリー

抄録を表示する抄録を非表示にする

Recent work on temporal relation identification has focused on three types of relations between events: temporal relations between an event and a time expression, between a pair of events and between an event and the document creation time. These types of relations have mostly been identified in isolation by event pairwise comparison. However, this approach neglects logical constraints between temporal relations of different types that we believe to be helpful. We therefore propose a Markov Logic model that jointly identifies relations of all three relation types simultaneously. By evaluating our model on the TempEval data we show that this approach leads to about 2% higher accuracy for all three types of relations ---and to the best results for the task when compared to those of other machine learning based systems.

抄録全体を表示

PDF形式でダウンロード (331K)
特許明細書からの技術課題情報の抽出

酒井浩之, 野中尋史, 増山繁

2009 年24 巻6 号 p. 531-540
発行日: 2009年
公開日: 2009/10/20

DOIhttps://doi.org/10.1527/tjsai.24.531

ジャーナルフリー

抄録を表示する抄録を非表示にする

We propose a method for extracting information on the technical effect from a patent document. The information on the technical effect extracted by our method is useful for generating patent maps (see e.g., Figure 1.) automatically or analyzing the technical trend from patent documents. Our method extracts expressions containing the information on the technical effect by using frequent expressions and clue expressions effective for extracting them. The frequent expressions and clue expressions are extracted by using statistical information and initial clue expressions automatically. Our method extracts expressions containing the information on the technical effect without predetermined patterns given by hand, and is expected to be applied to other tasks for acquiring expressions that have a particular meaning (e.g., information on the means for solving the problems) not limited to the information on the technical effect. Our method achieves not only high precision (78.0%) but also high recall (77.6%) by acquiring such clue expressions automatically from patent documents.

抄録全体を表示

PDF形式でダウンロード (361K)

特集論文　「近未来チャレンジ」

原著論文

新技術が持つ特長に注目した技術調査支援ツール

西山莉紗, 竹内広宜, 渡辺日出雄, 那須川哲哉

2009 年24 巻6 号 p. 541-548
発行日: 2009年
公開日: 2009/10/20

DOIhttps://doi.org/10.1527/tjsai.24.541

ジャーナルフリー

抄録を表示する抄録を非表示にする

It is important for R&D managers, consultants, and other people seeking broad knowledge in technology fields to survey technical literature such as research papers, white papers, and technology news articles. One of the important kinds of information for those people regards the effectiveness of new technologies in their own businesses. General search engines are good at selecting documents revealing the details of a specific technology or a technology field, but it is hard to obtain useful information about how a technology will apply to individual business cases from such search results. There is a need for a technology survey assistance tool that helps users find technologies with suitable capabilities. In this paper, two technical tasks were tackled to develop the prototype of this assistance tool: Extraction of advantage phrases and scoring for the advantage phrases to find novel applications in the target technology field. We describe a new method to identify advantage phrases in technical documents and our scoring function that gives higher scores to novel applications of the technology. The results of evaluations showed our phrase identification method with only a few phrasal patterns performs almost as well as human annotators, and the proposed scoring conforms better to the decisions made by professionals than random sort.

抄録全体を表示

PDF形式でダウンロード (248K)
Wikipediaマイニング

近未来チャレンジキックオフ編

中山浩太郎, 伊藤雅弘, Maike ERDMANN, 白川真澄, 道下智之, 原隆浩, 西尾章治郎

2009 年24 巻6 号 p. 549-557
発行日: 2009年
公開日: 2009/10/20

DOIhttps://doi.org/10.1527/tjsai.24.549

ジャーナルフリー

抄録を表示する抄録を非表示にする

Wikipedia, a collaborative Wiki-based encyclopedia, has become a huge phenomenon among Internet users. It covers a huge number of concepts of various fields such as arts, geography, history, science, sports and games. As a corpus for knowledge extraction, Wikipedia's impressive characteristics are not limited to the scale, but also include the dense link structure, URL based word sense disambiguation, and brief anchor texts. Because of these characteristics, Wikipedia has become a promising corpus and a new frontier for research. In the past few years, a considerable number of researches have been conducted in various areas such as semantic relatedness measurement, bilingual dictionary construction, and ontology construction. Extracting machine understandable knowledge from Wikipedia to enhance the intelligence on computational systems is the main goal of "Wikipedia Mining," a project on CREP (Challenge for Realizing Early Profits) in JSAI. In this paper, we take a comprehensive, panoramic view of Wikipedia Mining research and the current status of our challenge. After that, we will discuss about the future vision of this challenge.

抄録全体を表示

PDF形式でダウンロード (632K)
身体地図機能を有する事故サーベイランスシステム

坪井利樹, 北村光司, 西田佳史, 本村陽一, 高野太刀雄, 山中龍宏, 溝口博

2009 年24 巻6 号 p. 558-568
発行日: 2009年
公開日: 2009/10/20

DOIhttps://doi.org/10.1527/tjsai.24.558

ジャーナルフリー

抄録を表示する抄録を非表示にする

This paper proposes a new technology,``a bodygraphic injury surveillance system (BISS)'' that not only accumulates accident situation data but also represents injury data based on a human body coordinate system in a standardized and multilayered way. Standardized and multilayered representation of injury enables accumulation, retrieval, sharing, statistical analysis, and modeling causalities of injury across different fields such as medicine, engineering, and industry. To confirm the effectiveness of the developed system, the authors collected 3,685 children's injury data in cooperation with a hospital. As new analyses based on the developed BISS, this paper shows bodygraphically statistical analysis and childhood injury modeling using the developed BISS and Bayesian network technology.

抄録全体を表示

PDF形式でダウンロード (2912K)
認知症予防回復支援サービスの開発と忘却の科学

共想法により社会的交流の場を生成する会話支援サービス

大武美保子

2009 年24 巻6 号 p. 569-576
発行日: 2009年
公開日: 2009/10/20

DOIhttps://doi.org/10.1527/tjsai.24.569

ジャーナルフリー

抄録を表示する抄録を非表示にする

Purpose of this study is to explore sustainable service where continuous validation is possible through the development of support service for prevention and recovery from dementia towards science of lethe. We designed and implemented conversation support service via coimagination method based on multiscale service design method, both were proposed by the author. Interactive conversation supported by coimagination method generates social interaction so as to prevent progress of dementia. Multiscale service model consists of tool, event, human, network, style and rule. Service elements at different scales of tool, event, and human were developed according to the model. Firstly, we developed conversation interactivity measuring method in order to measure intensity of cognitive activities for prevention of dementia (event). Secondly, education program for learning coimagination method was designed and provided in order to bring out social intelligence of participants and instructors (human). Thirdly, relationship between social intelligence and prevention of dementia is discussed based on the experimental data (tool).

抄録全体を表示

PDF形式でダウンロード (614K)

エラータ

Erratum：光と影を用いたテキストのテーマ関連度の可視化
[人工知能学会論文誌 Vol.24, No.6, pp. 479-487]

2009 年24 巻6 号 p. 577
発行日: 2009年
公開日: 2009/10/22

DOIhttps://doi.org/10.1527/tjsai.24.577

ジャーナルフリー

抄録を表示する抄録を非表示にする

Vol.24, No.6 に掲載の論文のページ数に誤りがありました．（）内に正しいページ数を記します．

抄録全体を表示

PDF形式でダウンロード (27K)
Erratum：イベント系列マイニングを目的とする新聞記事からの
時間情報に基づく単語抽出
[人工知能学会論文誌 Vol.24, No.6, pp. 488-493]

2009 年24 巻6 号 p. 578
発行日: 2009年
公開日: 2009/10/22

DOIhttps://doi.org/10.1527/tjsai.24.578

ジャーナルフリー

抄録を表示する抄録を非表示にする

Vol.24, No.6 に掲載の論文のページ数に誤りがありました．（）内に正しいページ数を記します．

抄録全体を表示

PDF形式でダウンロード (31K)
Erratum：地理情報検索のためのスニペット生成法
[人工知能学会論文誌 Vol.24, No.6, pp. 494-506]

2009 年24 巻6 号 p. 579
発行日: 2009年
公開日: 2009/10/22

DOIhttps://doi.org/10.1527/tjsai.24.579

ジャーナルフリー

抄録を表示する抄録を非表示にする

Vol.24, No.6 に掲載の論文のページ数に誤りがありました．（）内に正しいページ数を記します．

抄録全体を表示

PDF形式でダウンロード (27K)
Erratum：雑談自由対話を実現するためのWWW上の文書からの
妥当な候補文選択手法
[人工知能学会論文誌 Vol.24, No.6, pp. 507-519]

2009 年24 巻6 号 p. 580
発行日: 2009年
公開日: 2009/10/22

DOIhttps://doi.org/10.1527/tjsai.24.580

ジャーナルフリー

抄録を表示する抄録を非表示にする

Vol.24, No.6 に掲載の論文のページ数に誤りがありました．（）内に正しいページ数を記します．

抄録全体を表示

PDF形式でダウンロード (28K)

J-STAGEへの登録はこちら（無料）