Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 4K1-GS-9-04
Conference information

Loss function design considering the frequency domain for multidimensional time series data generation based on a diffusion model
*Yuya OKADOMEYutaka NAKAMURA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In the generation task of multi-dimensional time series data such as human behavior, the spatial loss function like L1 loss is used for training the generative model. If the diffusion probabilistic model is applied to generate time series data, the model generates the data by iterative denoising. The meaningful slight vibration in the data is considered to be denoised. In this study, we propose the loss function which includes spatial and frequency information for training the diffusion model. In the proposed loss function, the original and generated data are projected onto the frequency domain, and the coherence between these frequencies is calculated. We apply the proposed loss function to train the diffusion model for the generation of human motion during dyadic conversation. The result suggests that Frechet Inception Distance is improved by using the frequency property.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top