Abstract
Recently, we can easily obtain quite a few movies, therefore, we need a method to effectively manage those movies. At this moment, it is difficult for us to find a particular scene in a movie. If we want to do so, we have to watch through all the parts of a movie, we try to retrieve a scene where a human interacts with objects in a room. To achieve this, we firstly apply image processing techniques to analyze a movie and identify human motions in the movie, and then verbalize human behaviors in a room by observing how the human interaction with obects with a model of Bayesian network for identifying human actions.