Automatic Editing of Spoken Document for Intelligent Speech Archive

Masashi Ito; Tomohiro Ohno; Shigeki Matsubara

doi:10.14864/softscis.2008.0.364.0

Abstract

As typified by World Wide Web, a lot of information became accumulated on the Internet. However, most of the currently distributed information is occupied by written document. Compared with it, spoken document is hardly distributed. Therefore, if the mechanism for distributing them can be built, our human society will be able to share much more information. This paper proposes a technique for editing a sentence in spoken document for the purpose of converting it into the Internet contents equipped with the accessibility and readability. By aligning the recorded video data or speech data with the edited text on a fine level, it can be utilized as the multimedia contents equipped with the accessibility. Our technique consists of the following three sentence technologies: (1)paraphrase, (2) division, and (3) structuration. We implemented a spoken document edit system based on our techniques. We conducted an edit experiment by using lecture speech data and our technique could achieve high accuracy. From the results, we confirmed the availability of our technique.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!