Abstract
A method for accurate scene segmentation based on video structure analysis using sequences of continuous shots in audiovisual materials is proposed. In the proposed method, for efficient video structure analysis, similarities between the sequences of continuous shots are calculated by utilizing Dynamic Time Warping (DTW), one of the methods for sequence alignment. In this calculation, by applying Probabilistic Latent Semantic Analysis (PLSA) to audiovisual features extracted from the audiovisual materials, new features of the shots are obtained, and the proposed method newly introduces these features in definition of the costs used in DTW. Consequently, the proposed method can realize efficient video structure analysis based on the similarities between the sequences of continuous shots, and thereby accurate scene segmentation becomes feasible.