精密工学会誌

This paper introduces a fast image matching method using coarse to fine search. For reliable and high-speed matching, it is important to decide on optimal interval on coarse search. In the proposed method, efficient search is realized by making the coarse search interval non-uniform in order to make the performance of rotation matching uniform. Through experiments using 4 types×121 test images, it was confirmed that the proposed method can achieve a speedup of up to 60% compared to the conventional method.

抄録全体を表示

In anomaly detection using deep learning, normal models based on pretrained CNNs using only normal data have become the mainstream. This framework can only use normal data for training and discards valuable information even when abnormal data is available. In addition, PaDiM, one of the representative models in this framework, creates a normal model for each position and thus cannot consider the relationship between each pixel. In this paper, we propose a method to generate a normal model by considering the information of anomalous data and neighborhood information, and achieve an image-level AUROC: 0.984 on MVTec AD.

抄録全体を表示

Once-for-All (OFA) is an AI model development method that allows a model (Supernet), a redundant representation of a base AI model (Base Model), to be trained only once to obtain models (Subnets) that are suitable for various devices in terms of accuracy, processing speed and number of parameters. In this paper, we address a road obstacle detection system consisting of multiple AI models, and apply OFA to each AI model. Finally, we succeed in obtaining the optimal Subnets for the entire system by considering the combination of the obtained Subnets.

抄録全体を表示

抄録を表示する抄録を非表示にする

We propose a method for temporally enhancing the high-frequency components of video sequences with no artifacts to observe the micromovement of the human body. The existing methods of video motion magnification cause severe artifacts in the enhanced video sequence because the temporal micromovement and the spatial appearance of a subject are not stably separated from the input video sequence. When the observer views the enhanced video sequence with severe artifacts, they cannot fully check the high-frequency components of body sway. Here, we assume that the temporal micromovement is the same among all pixels contained in the subject's head. Our method stably separates the video sequence into the temporal micromovement and the spatial appearance. Then, our method amplifies the high-frequency components of the temporal micromovement. The experimental results show that our method enhanced the video sequence of body sway with no artifacts. We confirmed that the high-frequency components were viewed in the enhanced video sequence. We also visualized the reason for success or failure in baggage weight classification using the video sequence of body sway as an application of our method.

抄録全体を表示

PDF形式でダウンロード (5210K)

J-STAGEへの登録はこちら（無料）