非階層クラスター分析におけるデータの前処理によるクラス分類結果の特徴

日野 良紀; 青木 真吾; 井上 和重

doi:10.14864/fss.40.0_178

40th Fuzzy System Symposium

Session ID : 1F2-2

DOI https://doi.org/10.14864/fss.40.0_178

Conference information

Host: Japan Society for Fuzzy Theory and Intelligent Info rmatics (SOFT)

Name : 40th Fuzzy System Symposium

Number : 40

Location : [in Japanese]

Date : September 02, 2024 - September 04, 2024

proceeding

Data Preprocessing in Non-Hierarchical Cluster Analysis for Characteristics of Classification Results

*Haruki Hino, Shingo Aoki, Kazushige Inoue

Author information

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Data science is being increasingly emphasized in university education, and student are required to develop the ability to handle data correctly and address complex real-world challenges. In recent years, the use of tools such as generative AI has created an environment where even beginners can easily perform data analysis. Among these techniques, cluster analysis, which can reveal data patterns by grouping similar data points, is widely used. However, the choice of appropriate data preprocessing methods is difficult for beginners, as the results can vary significantly depending on the preprocessing method used. Therefore, this research focuses on the relationship between data preprocessing patterns and analysis results in cluster analysis. By clarifying the characteristics of this relationship, we aim to support beginners in selecting appropriate preprocessing methods. Specifically, we will use financial data from 80 listed companies, apply four different preprocessing patterns, and then perform cluster analysis to identify the differences in results and characteristics that emerge.

Corresponding author

Conference information

Register with J-STAGE for free!