Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 3S1-OS-7b-03
Conference information

Challenges and Applications of Artificial Intelligence in Corpus Creation Using Case Reports of Rare and Intractable Diseases
*Eisuke DOHIYuka TATEISHIToyofumi FUJIWARAYasunori YAMAMOTO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

For effective diagnostic support using Artificial Intelligence (AI), an accurate case-based corpus is essential, yet it presents several challenges. This abstract proposes solutions to these challenges utilizing AI.Sharing case reports is difficult due to privacy concerns, and the main challenges include text extraction from PDFs, variability in disease name notation, structuring clinical data, normalizing text data, and extracting and annotating information. Particularly, text extraction from PDFs is technically challenging, and variability in disease name notation is common. Systems like CaseSharing have proven effective for structuring clinical data, and normalization of text data has been somewhat resolved using Large Language Models (LLMs). Furthermore, LLMs enable extraction of information following a timeline, but annotation remains a challenge.From these experiences, it is believed that the application of AI plays a crucial role in dataset creation. Moving forward, we aim to deepen the discussion on more effective utilization of these technologies.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top