Mathematical Linguistics
Online ISSN : 2433-0302
Print ISSN : 0453-4611
Special Issue 2020 on the "Mathematical Analysis of Linguistic Data and Japanese Language Teaching for Foreigners"
A Study of Methods of Analysis Suitable for Learner Corpora
Hideaki Mori
Author information
JOURNAL OPEN ACCESS

2020 Volume 32 Issue 7 Pages 436-446

Details
Abstract
Unlike balanced corpora, numerous learner corpora are not satisfactorily large-scale or representative. However, because previous analyses were conducted fundamentally by using the same method as balanced corpora, the present study examined the statistical validity of the existing method. According to the findings, linguistic units (e.g., misuse expressions, sentences, and morphemes) are consistently used by each learner. For this reason, there is no satisfactory assumption of independence for statistical analyses, and there are cases in which the outliers distort the results. Thus, the method of collecting each learner’s frequency and analyzing learners as observation units is a valid approach to replace the existing method.
Content from these authors
© 2020 The Mathematical Linguistic Society of Japan

この記事はクリエイティブ・コモンズ [表示 - 非営利 - 改変禁止 4.0 国際]ライセンスの下に提供されています。
https://creativecommons.org/licenses/by-nc-nd/4.0/deed.ja
Previous article Next article
feedback
Top