Abstract
An interactive identification scheme is presented for remotely located objects. In the identification process, a multi-modal description is generated based on operator's instruction of the type and the image of object. The a priori information, a set of symbolic descriptions of object knowledge, is installed in the computer system on which the scheme is implemented. Supported by this a priori knowledge, the operator specifies the object in the observed image captured by monocular TV camera. The instruction is matched with observed imagery to complete the as-is object description. Through a series of experiments, the scheme has been demonstrated to be applicable to the identification of extended classes of practical scene.