医薬品情報学
Online ISSN : 1883-423X
Print ISSN : 1345-1464
ISSN-L : 1345-1464
資料
潜在意味の解析を用いた添付文書に基づく類似医薬品提案法の探索
菊池 美沙伊東 理絵田中 優太島田 洋輔後藤 了尾関 理恵小茂田 昌代
著者情報
ジャーナル フリー

2018 年 20 巻 2 号 p. 111-119

詳細
抄録

Objective:The topic model is a well-known method used in the field of natural language processing (NLP)that defines adocument as constructed of topics that combine specific t erms. This method is used to model topic co-occurrencemathematically. In this study,we extracted topics from featu re vectors of explicit documents called medical package insertsby using cluster analysis. Methods:We counted the terms(nouns)recognized by the morphological analysis engine MeCab and created a documentterm matrix. A value of“tf・idf”was calculated in this matrix for term weighting to avoid the effect of term frequency. We reduced the dimensionality of the matrix using singular v alue decomposition,which removed unnecessary data,and weextracted feature vectors attributed to each medical package insert. The distance between feature vectors was calculatedusing cosine distance,and cluster analysis was performed based on the distance between the vectors.

Results:Cluster analysis on our document-term matrix show ed that medical package inserts of drugs that have the sameefficacy or active ingredient were included in the same cl uster. Moreover, using term weighting and dimensionalityreduction,we could extract topics from medical package inserts.

Conclusion:We obtained a foothold to apply our findings t o the recommendation of similar drugs. Cluster analysis ofmedical package inserts using NLP can contribute to the pro per application of drugs. In addition,our study revealed thesimilarities of drugs and suggested possibilities for new applications from several points of view.

著者関連情報
© 2018 日本医薬品情報学会
前の記事 次の記事
feedback
Top