Host: The Japanese Society for Artificial Intelligence
Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 38
Location : [in Japanese]
Date : May 28, 2024 - May 31, 2024
Tabular data analysis is a crucial technique in various fields, including manufacturing and social infrastructure. In real-world scenarios, columns of tabular data may differ between samples due to factors such as variations in data collection sources or the inclusion of additional data contents. Most methods for tabular data analysis assume that the columns of all samples are identical. Consequently, a data analyst must choose between extracting columns that are available in all samples or selecting samples that contain the same columns. To address tabular data with different columns, a method called TransTab has been proposed. However, TransTab overlooks the relationship between column names and categorical values, making it challenging to address samples with the same categorical values but different column names. To mitigate above mentioned issue, we propose a novel approach that fuses features from column names and values. Our method has demonstrated a minimum improvement of 16.1 points in terms of AUROC compared to that of TransTab.