Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
A Comparison of Measures for Extracting Domain-Specific Lexicons for English Education
MASAO UTIYAMAKIYOMI CHUJOEIKO YAMAMOTOHITOSHI ISAHARA
Author information
JOURNAL FREE ACCESS

2004 Volume 11 Issue 3 Pages 165-197

Details
Abstract
Mastery of domain-specific vocabulary in specialized English texts is essential. In order to identify a cost-effective and efficient means to extract domain-specific vocabulary, eight individual statistical measures, and combinations of those measures, were applied to corpora and the resulting lists were then compared to an existing specialized vocabulary control list. It was found that not only was it possible to efficiently produce a list of specialized vocabulary, but a combination of measures created the most comparable data. Due to the complexity of applying combinations of measures, individual measures were also found to be effective and useful for both English teachers and researchers. The complementary similarity measure was ranked as the most effective individual measure. Moreover, each measure created a unique type of word list which has specific pedagogical applications to student proficiency levels and lexicons.
Content from these authors
© The Association for Natural Language Processing
Previous article
feedback
Top