Host: Japan Society for Fuzzy Theory and Intelligent Info rmatics (SOFT)
Name : 41th Fuzzy System Symposium
Number : 41
Location : [in Japanese]
Date : September 03, 2025 - September 05, 2025
In recent years, retrieval-augmented generation (RAG) using large language models (LLMs) has attracted significant attention. However, many real-world documents contain structured content such as tables and images, which present challenges for conventional RAG systems. This study proposes a method for generating unified structured documents by combining OCR, layout analysis, and LLMs to preserve structural information. Experimental evaluations using university AI guidelines demonstrate that our approach improves the accuracy of RAG-generated outputs.