集合分割問題に基づく系列アラインメントのモデル化

西野 正彬; 鈴木 潤; 梅谷 俊治; 平尾 努; 永田 昌明

doi:10.5715/jnlp.23.175

Paper

Sequence Alignment as a Set Partitioning Problem

Masaaki Nishino, Jun Suzuki, Shunji Umetani, Tsutomu Hirao, Masaaki Nagata

Author information

Keywords: Sequence Alignment, Combinatorial Optimization, Column Generation

JOURNAL FREE ACCESS

2016 Volume 23 Issue 2 Pages 175-194

DOI https://doi.org/10.5715/jnlp.23.175

Details

Abstract

Sequence alignment, which involves aligning elements of two given sequences, occurs in many natural language processing (NLP) tasks such as sentence alignment. Previous approaches for solving sequence alignment problems in NLP can be categorized into two groups. The first group assumes monotonicity of alignments; the second group does not assume monotonicity or consider the continuity of alignments. However, for example, in aligning sentences of parallel legal documents, it is desirable to use a sentence alignment method that does not assume monotonicity but can consider continuity. Herein, we present a method to align sequences where block-wise changes in the order of sequence elements exist. Our method formalizes a sequence alignment problem as a set partitioning problem, a type of combinatorial optimization problem, and solves the problem to obtain an alignment. We also propose an efficient algorithm to solve the optimization problem by applying column generation.

Corresponding author

Register with J-STAGE for free!