2002 Volume 9 Issue 2 Pages 91-110
Diff is a software program that detects differences between two data sets and is useful for natural language processing.This paper shows several example applications of how Diff can be used to detect differences, extract rewriting rules, merge two different datasets, and matching two different data sets optimally.Since Diff can be applied to a normal UNIX system, it is very easy and convenient to use.Our studies showed that Diff is a practical tool for researching natural language processing.