Abstract
This paper proposes a method which disambiguates verb senses using co-occurrencebasedlikelihood parameters whose sample spaces are extended according to a thesaurus.The method selects the most plausible sense if its likelihood is significantlygreater than that of the second most plausible one.If not, the sample space is extendedand the significance test is tried again.If it cannot be extended anymore, thesystem gives up disambiguation.The method was applied to 74 polysemous verbs (about 89, 000 instances) extracted from the EDR Japanese Corpus.When the mostfrequent sense was selected, the precision was 0.65 and the applicability, i.e.the ratioof the disambiguated verbs to the treated verbs, was 1.00.The proposed methodwas compared with a class-based method.With Bunruigoihyou, the precisions ofboth the methods were 0.71, but the applicabilities of the proposed method andthe class-based method were 0.73 and 0.68, respectively.With the EDR Concept Classification Dictionary, the precisions of both the methods were 0.70, but the applicabilitiesof the proposed method and the class-based method were 0.87 and 0.76, respectively.The applicability of the proposed method is significantly higher thanthat of the class-based method, which shows the plausibility of the proposed method.