Abstract
Recently as digital cameras and web cameras have been commonly used in our everyday lives, we can easily obtain quite a few movies. However, it is difficult to fine a particular part of the obtained movies. From this, in this paper we apply image processing for the obtained movies and then propose a method to explain human's behavior in a particular space by observing how a human beings interacts the objects in the space. By this, we aim to develop a system that enables us to retrieve a particular human behavior by words.