Abstract
As typified by World Wide Web, a lot of information
became accumulated on the Internet. However, most of the
currently distributed information is occupied by written document. Compared with it, spoken document is hardly distributed. Therefore, if the mechanism for distributing them can be built, our human society will be able to share much more information. This paper proposes a technique for editing a sentence in spoken document for the purpose of converting it into the Internet contents equipped with the accessibility and readability. By aligning the recorded video data or speech data with the edited text on a fine level, it can be utilized as the multimedia contents equipped with the accessibility. Our technique consists
of the following three sentence technologies: (1)paraphrase, (2) division, and (3) structuration. We implemented a spoken document edit system based on our techniques. We conducted an edit experiment by using lecture speech data and our technique could achieve high accuracy. From the results, we confirmed the availability of our technique.