映像情報メディア学会誌
Online ISSN : 1881-6908
Print ISSN : 1342-6907
ISSN-L : 1342-6907
66 巻, 11 号
選択された号の論文の24件中1~24を表示しています
ふぉーかす
特 集
大規模データを活用した映像メディア処理
技術解説
話 題
講 座
拡張現実感技術の最前線(第11回)
番組制作ファイル(第8回)
メディアウォッチ(第11回)
知っておきたいキーワード(第82回)
ニュース
論文・研究速報
論文特集 画像処理・符号化とアプリケーション
特集論文
  • 滝本 裕則, 吉森 聖貴, 満倉 靖恵
    2012 年 66 巻 11 号 p. J399-J406
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    In this paper, we propose a technique for automatic generation of pixel art from character images based on color-difference tolerance. Pixel art is one of image expressions for digital images and is based on the pixel level. Character images that are expressed by pixel art are composed of edge lines and a few colors. The proposed technique is for automatically generating pixel art from a photograph where there was only a single object. However, in the conventional method, it is difficult to determine several parameters of edge detection and decrease color for automatic generation of pixel art. In this paper, to create the optimal decreased color image for each target image, we propose a clustering method using a maximum distance algorithm (MDA) based on the human color-difference tolerance. Moreover, the complete automation of making pixel art is achieved by using the result obtained by clustering using the MDA for the threshold decision of the Canny edge detector. As a result, it is shown that several optimum parameters for pixel art are obtained by using the proposed method.
  • 北村 一真, 福水 洋平, 道関 隆国
    2012 年 66 巻 11 号 p. J407-J412
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    We developed invisible code that is suitable for embedding into a natural image. Our code consists of a background layer, color adjustment layer and code layer. The code layer is composed of edgeless high-frequency blocks and edgeless low-frequency blocks. Two key technologies were devised to implement this idea. One is the color adjustment layer that adjusts the color of the background image for high-frequency blocks using an original blur filter. The other is the edgeless high-frequency block that increases the transparency of the periphery of the high-frequency block. To verify the effectiveness of our invisible code, we estimated the invisibility and reading accuracy for code embedded in four common types of images. The results demonstrate that the invisibility has been improved about 1.34 times and the reading accuracy has been improved about 1.38 times compared with the conventional invisible code.
  • 鬼頭 卓也, 阿知葉 征彦, 辻 拓実
    2012 年 66 巻 11 号 p. J413-J419
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    This paper proposes a new method of correspondence point search for intermediate viewpoint image generation using Lagrangian interpolation from multi-view images obtained from the circular camera arrangement. Correspondence point search is not performed on the basis of the real camera image but is performed on the basis of interpolation domain. In the conventional method, if the camera interval exceeds 4°, this causes disturbance in the image. However, the proposed method was able to produce an image with fewer disturbances to the intermediate viewpoint with a camera interval up to 6°. In addition, it was able to suppress a significant increase in processing costs by varying the block size to be used for correspondence point search by focusing on the amount of edge that is included in the search range.
  • 江田 孝治
    2012 年 66 巻 11 号 p. J420-J425
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    The purpose of this research is to propose a new super-resolution method using DCT. The proposed method enlarges a small block in an input image to a large block through DCT. A high-definition image is created by applying this method to the entire low-resolution input image. The enlargement of small blocks is carried out by expanding the DCT coefficient's frequency domain to the same size as that of the large blocks. First, the high-frequency component that is lacking in a small block is searched for from a database. Then, it is added to a small block and restores the high-frequency component. The high frequency component is searched for using a DCT sign index. The database is trained using multiple natural images beforehand. The proposed method made a high-speed search possible by using DCT sign index. From many simulation results, it was found that the proposed method was more effective than the traditional one.
論文
  • ~動的3次元モデルのテレビ番組への応用~
    久富 健介, 冨山 仁博, 片山 美和, 岩舘 祐一, 松永 孝治, 井藤 良幸, 石原 渉
    2012 年 66 巻 11 号 p. J426-J433
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    This paper describes a method of producing scenes for TV program, using “dynamic 3D models”. Dynamic 3D models are 3D models generated from images shot by multiple cameras surrounding an actor for each frame using 3D-reconstruction and texture-mapping techniques. They can provide highly realistic images as well as natural motion without using a motion-capture system. In addition, they allow flexible production, so the positions of the models can be changed and the models can be duplicated. Dynamic 3D models are therefore suitable for producing crowd scenes with numerous people. Although techniques of 3D reconstruction and texture mapping have been discussed and several applications have been proposed, they have not been widely used into TV or movie production because of the size of data and insufficient textural quality. By establishing an efficient production flow with recent powerful processors, we succeeded in producing scenes with sufficient quality for a TV program, which was aired in December, 2009. The scenes involving one thousand soldiers could be produced by shooting several sequences with only two actors. We describe the production flow that we established and present some scenes that we produced using dynamic 3D models for the drama.
  • 松尾 琢也, 福嶋 慶繁, 石橋 豊
    2012 年 66 巻 11 号 p. J434-J443
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    In this paper, we propose a refinement filter for depth maps. The filter convolutes an image and a depth map with a cross-computed kernel. We call the filter a weighted cross bilateral filter. The main advantages of the proposed method are the filter fits outlines of objects in the depth map to silhouettes in the image, while the filter reduces Gaussian noise in other areas. Additionally, its computational cost is independent of depth ranges. Thus, we can obtain accurate depth maps at the lower cost than the conventional approaches, which require Markov random field-based optimization methods. Experimental results show that the accuracy of the depth map in edge areas goes up and its running time is low.
  • 河合 吉彦, 住吉 英樹, 藤井 真人, 八木 伸行
    2012 年 66 巻 11 号 p. J444-J452
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    News programs often have sudden brightness changes caused by electronic flashes from still cameras. This paper proposes a correction method for the video deterioration caused by flashes using the frame interpolation technique. The proposed method creates an interpolated frame from neighboring frames to replace the frame area altered by the flash luminescence. To estimate accurate motion vectors used for frame interpolation, the proposed method adopts a novel block-based cost function that takes into account not only the frames before and after the flash but the flash frame itself. In addition, the hierarchical motion estimation and a new vector refinement filter are used to improve the interpolation quality. Experimental results for broadcasting video show that the proposed method is superior to the conventional interpolation method.
  • 宋 洋, 張 暁林
    2012 年 66 巻 11 号 p. J453-J460
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    A human-like active binocular vision system, inspired by binocular eye movements in animals, would help robots with automatic fast target switching, smooth target pursuing and efficient visual stabilization. In this paper, a control model that integrates saccadic eye movement, smooth pursuit eye movement, vestibulo-ocular reflex and optokinetic response is proposed. The control interface of the model has been simplified to one external saccadic command input. By Using this target selection command, target switching, target pursuing and visual stabilization of camera would run automatically. To implement the system with parallel processing, like the one used in neural network, the control model and multi-motor control are implemented in a FPGA chip. Finally, the proposed model was tested Using an image processing PC and a binocular robot head and the results show high efficiency of this control model.
  • 石戸谷 顕太朗, 山本 圭介, 大平 茂輝, 長尾 確
    2012 年 66 巻 11 号 p. J461-J470
    発行日: 2012年
    公開日: 2012/10/25
    ジャーナル フリー
    Reading as many technical documents as possible is important to improve our research. When we read these documents, we can understand their contents more easily by referring to related resources such as images, audio clips, and videos. Videos contain a variety of helpful information, and facilitate our understanding of technical documents. We propose a method to define video scenes and document elements, and to annotate them with additional information such as relationships. Based on these annotations and relationships, we developed a support system that uses videos and helps readers understand technical documents. We performed some experiments to confirm whether the system was usable.
研究速報
feedback
Top