Host: Japan SOciety for Fuzzy Theory and intelligent informatics
Co-host: The Korea Fuzzy Logic and Intelligent Systems Society, IEEE Computational Intelligence Society, The International Fuzzy Systems Association, 21th Century COE Program "Creation of Agent-Based Social Systems Sciences"
The feature selection is to select a set of attributes that is relevant for a given task. Due to the presence of irrelevant and redundant attributes, higher predictive accuracy can be expected by selecting only the relevant attributes from the dataset. Conventional feature selection was done after discretization. In this case, however, network size and computation would be increased drastically. In this paper, we introduced a rank-based feature selection (RFS) method using positive correlation learning (PCL). A higher-ranking attribute is the one that has a lower distance from a reference point. We consider an attribute to be relevant if its rank is higher. We empirically proved that if irrelevant features are removed, higher classification accuracy can be achieved. This is because PCL tries to create smaller coherent weights. We applied this approach to diabetes problem. We show empirically that RFS can easily remove the irrelevant features and produce better accuracy and generalization.