In this paper, we aim to present the effective methods of displaying video learning materials to Japanese language learners. The experiment was performed to thirty foreign students and nine native Japanese, by showing them three different types of display that are combined with images and transcriptions. We assumed here that the synthetic configuration on displaying is the most effective way. The result of this experiment shows no significant difference in those three types. Hence we analyzed the reasons and some possible factors in the various points of views, such as learner's level, question's adequacy, and testing methodology.