Host: Japan Society for Fuzzy Theory and Intelligent Info rmatics (SOFT)
Name : 41th Fuzzy System Symposium
Number : 41
Location : [in Japanese]
Date : September 03, 2025 - September 05, 2025
In this study, we extract technical terms from text using GiNZA, a Python-based natural language processing library, and NPYLM, an unsupervised language model capable of learning word segmentation. Sentences are then vectorized using λ-fuzzy measures, and their similarity is calculated using the Choquet integral.Our method enables effective extraction of domain-specific terminology, which is difficult with conventional morphological analysis. By incorporating these terms into similarity computation, we propose a more accurate similarity metric compared to existing methods.