Proceedings of the Annual Conference of Biomedical Fuzzy Systems Association
Online ISSN : 2424-2586
Print ISSN : 1345-1510
ISSN-L : 1345-1510
Conference information
Research on building database for multimodal AI
*Shinji Mochida*Shuichi Enokida
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Pages 88-91

Details
Abstract
This paper introduces research on building multimodal AI. Multimodal AI is an AI that can uniformly handle images, voice, text, and other environmental information. To build a multimodal AI, it is necessary to build a database and interface that can uniformly handle multimodal data. Multimodal data includes voice, text, and other environmental information data. Multimodal AI has experiences like humans and can make the same judgments and take the same actions as humans. Multimodal AI can make autonomous decisions in accordance with environmental conditions and objectives and makes decisions based on experience. To realize human multimodal AI, it is first necessary to create a database that can handle multimodal data.
Content from these authors
© 2025 Biomedical Fuzzy Systems Association
Previous article Next article
feedback
Top