バイオメディカル・ファジィ・システム学会大会講演論文集
Online ISSN : 2424-2586
Print ISSN : 1345-1510
ISSN-L : 1345-1510
会議情報

大規模言語モデルを用いた医療データの個人情報自動検出
*井上 愛*盛田 健人*藤井 武宏*佐野 龍樹*土肥 薫*若林 哲史
著者情報
会議録・要旨集 フリー

p. 71-74

詳細
抄録
In this study, we proposed a method for the automatic detection of personally identifiable information from various formats of medical data using a locally operated large language model (LLM). We performed optimized text extraction for each data format, then input the extracted text into Llama3 for detection. In addition, we also performed fine-tuning using LoRA and compared performance with the base model. The base model achieved a high detection rate on text data, but the detection rate decreased on image data due to misrecognition by Easy-OCR. While the output format improved after fine-tuning, the detection rate for patient IDs significantly decreased.
著者関連情報
© 2025 バイオメディカル・ファジィ・システム学会
前の記事 次の記事
feedback
Top