Host: The Japanese Society for Artificial Intelligence
Name : 34th Annual Conference, 2020
Number : 34
Location : Online
Date : June 09, 2020 - June 12, 2020
Legal documents contain legal terms that have similar meaning or pronunciation each other. Japanese legislation defines their usage on the basis of traditional customs and rules. In accordance with the definition, we need to use these legal terms properly and strictly in a statute. We are also encouraged to follow the definition in writing broad-sense legal documents, such as contracts and terms of use. To assist in writing legal documents, we propose a method that locates inappropriate legal terms in Japanese statutory sentences and suggests corrections. We solve this task with a classifier by regarding the task as a sentence completion test. Our classifier is based on a pretrained BERT model trained by using a large amount of general sentences. To raise performance, we apply three training techniques: domain adaptation, undersampling, classifier unification. Our experiments show that our classifier achieved better performance than Random Forest-based ones and language model-based ones.