Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Enrichment of a Dictionary with Images from the Internet
Sanae FujitaHirotoshi TairaMasaaki Nagata
Author information
JOURNAL FREE ACCESS

2013 Volume 20 Issue 2 Pages 223-250

Details
Abstract
The Internet is an immense resource storehouse for images. Establishing a connection between images and dictionary definitions would enable the creation of rich dictionary resources with multimedia information. Therefore, this study aims at providing several suitable images for dictionary definitions. In this study, we targeted 25,481 words, including nouns, verbs, adjectives, and adverbs, split into 39,251 senses by querying an image search engine. The results showed that 94% of word senses could be defined as suitable images. Then, we analyzed the relationship between the visualization of each word sense and parts of speech or semantic classes. To obtain images for each word sense, we expanded the query by appending queries extracted from definitions for each word sense. Second, we analyzed both manually-selected and fixed queries and examined query expansion methods more deeply. This paper proposes a method to set queries in priority order depending on the primary word sense. Third, we show the suitability of our method through two types of evaluations because in the application of new dictionaries or new target senses, it is valuable to obtain images automatically using high-priority queries.
Content from these authors
© 2013 The Association for Natural Language Processing
Previous article Next article
feedback
Top