Our research objective is to judge document similarities using Orthogonal Transformation. We focus on Fourier system in this paper from the results of previous experimentation. In previous experimentations, what single-byte characters and double-byte characters are intermixed was a problem. Therefore we propose the characters-code-conversion against this problem. Precision of the judge rose by the characters-code-conversion, which standardizes characters code to double-byte characters. Furthermore, we decided threshold of each length of sentences.
抄録全体を表示