Mathematical Linguistics
Online ISSN : 2433-0302
Print ISSN : 0453-4611
Invited Paper (A) to the Special Issue
The Concept, Types and Utility of Web Corpora
Web Corpora as a Source of Information for Etymological Studies
Tadaharu Tanomura
Author information
JOURNAL OPEN ACCESS

2016 Volume 30 Issue 6 Pages 326-343

Details
Abstract
The defining condition of a Web corpus will be that it is a huge amount of text data collected from the Internet. Although Websites such as Google Books, National Diet Library Digital Collections and newspaper archives do not satisfy the condition, they nevertheless cannot be clearly distinguished from typical Web corpora, and thus it may not be groundless to regard them as a type of Web corpus. This article, drawing upon two case studies, will demonstrate that we can easily enhance the level of the description of the history of Japanese as well as Chinese terms of the modern era with the help of information obtainable from those Websites.
Content from these authors
© The Mathematical Linguistic Society of Japan

この記事はクリエイティブ・コモンズ [表示 - 非営利 - 改変禁止 4.0 国際]ライセンスの下に提供されています。
https://creativecommons.org/licenses/by-nc-nd/4.0/deed.ja
Previous article Next article
feedback
Top