Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 2K5-GS-2-03
Conference information

Generating a Wide Variety of Categorical Data Using Diffusion Models
*Masane FUCHIAmar ZANASHIRHiroto MINAMITomohiro TAKAGI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Diffusion Models, which have been frequently researched in the field of Computer Vision as a method that outperforms GANs, are not confined to that field, but are spreading to other fields as well. TabDDPM using Diffusion Models have also been proposed for tabular data generation, and its authors claimed it can generate highly accurate data. However, TabDDPM tends to generate similar data as the number of such categories increases, resulting in learning filures because it handles categorical data as one-hot vectors. In order to overcome that problem, in this paper, we propose Table BD (Table Bit Diffusion) by incorporating Bit Diffusion preprocessing method. In our experiments, Table BD can generate data with a larger number of categories than TabDDPM.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top