Journal of Information Processing
Online ISSN : 1882-6652
ISSN-L : 1882-6652
NMT-Based Code Generation for Coding Assistance with Natural Language
Yuka AkinobuTeruno KajiuraMomoka ObaraKimio Kuramitsu
著者情報
ジャーナル フリー

2022 年 30 巻 p. 443-450

詳細
抄録

This paper proposes an attempt to realize coding assistance that generates Python code from natural language descriptions using neural machine translation. Although coding assistance with deep learning has recently become a major concern, few applications have used neural machine translation models. One of the major barriers is the shortage of a parallel corpus of natural language descriptions and source code. To overcome the shortage of parallel corpora, we propose a method for synthesizing parallel corpora that utilizes the formal nature of programming languages. We aim to establish a new method using an abstract syntax tree (AST) and a corpus of code fragments. Using the proposed synthesis method, we successfully constructed tens of thousands of parallel corpora and trained PyNMT models to generate Python code from Japanese input sentences. The trained PyNMT models successfully predicted the Python code from user input sentences with an accuracy of 28%. In this study, we propose a synthetic method for a parallel corpus and summarize the results of the evaluation experiments conducted on PyNMT models.

著者関連情報
© 2022 by the Information Processing Society of Japan
前の記事 次の記事
feedback
Top