ファジィ近傍に基づく内積空間とカーネル法によるテキストデータ解析

河崎 佑一; 宮本 定明

doi:10.3156/jsoft.21.461

Abstract

A fuzzy neighborhood model to analyze text data is proposed. This method can represent a sequencial structure in a set of texts, while traditional methods like the vector space model cannot as it simply counts the number of words in a text. Moreover fuzzy neighborhood model is a generalization of the vector space model and fuzzy equivalence relations. An advantage of this model is that it provides a positive definite kernel for data analysis. Accordingly we apply the present model to text analysis using kernel c-means clustering and kernel principal component analysis. Two examples of analysis of newspaper articles and medical incident reports are shown.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!