Information and Media Technologies
Online ISSN : 1881-0896
ISSN-L : 1881-0896
Computing
Mining Sequential Patterns More Efficiently by Reducing the Cost of Scanning Sequence Databases
Jiahong WangYoshiaki AsanumaEiichiro KodamaToyoo TakataJie Li
著者情報
ジャーナル フリー

2007 年 2 巻 1 号 p. 163-177

詳細
抄録
Sequential pattern mining is a useful technique used to discover frequent subsequences as patterns in a sequence database. Depending on the application, sequence databases vary by number of sequences, number of individual items, average length of sequences, and average length of potential patterns. In addition, to discover the necessary patterns in a sequence database, the support threshold may be set to different values. Thus, for a sequential pattern-mining algorithm, responsiveness should be achieved for all of these factors. For that purpose, we propose a candidate-driven pattern-growth sequential pattern-mining algorithm called FSPM (Fast Sequential Pattern Mining). A useful property of FSPM is that the sequential patterns concerning a user-specified item can be mined directly. Extensive experimental results show that, in most cases FSPM outperforms existing algorithms. An analytical performance study shows that it is the inherent potentiality of FSPM that makes it more effective.
著者関連情報
© 2007 by Information Processing Society of Japan
前の記事 次の記事
feedback
Top