The Journal of The Institute of Image Information and Television Engineers

Focus

[title in Japanese]

[in Japanese]

2012Volume 66Issue 11 Pages k19
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.k19

JOURNAL FREE ACCESS

Download PDF (364K)

Special Issue

Image Processing and Large-Scale Data

1. Image and Video Semantic Analysis based on Large-Scale Data

Shin'ichi Satoh

2012Volume 66Issue 11 Pages 887-890
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.887

JOURNAL FREE ACCESS

Download PDF (3156K)
2. Information Media and Information Credibility

Katsumi Tanaka, Yusuke Yamamoto

2012Volume 66Issue 11 Pages 891-895
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.891

JOURNAL FREE ACCESS

Download PDF (679K)
3. Fusion of Image and Heterogeneous Media

Kazutoshi Sumiya, Daisuke Kitayama, Shoko Wakamiya

2012Volume 66Issue 11 Pages 896-902
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.896

JOURNAL FREE ACCESS

Download PDF (1143K)
4. Integration of Real World Data and Image Media Technology

Keiji Yanai

2012Volume 66Issue 11 Pages 903-906
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.903

JOURNAL FREE ACCESS

Download PDF (1199K)
5. Monitoring of Meteorological Events Through the Integration and Retrieval of Massive Multimedia Data

Asanobu Kitamoto

2012Volume 66Issue 11 Pages 907-912
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.907

JOURNAL FREE ACCESS

Download PDF (755K)

Technical Survey

Management of Audio Level of the Digital TV Using Loudness Normalization

Mikihiko Okamoto, Eiichi Matsunaga

2012Volume 66Issue 11 Pages 913-919
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.913

JOURNAL FREE ACCESS

Download PDF (651K)

Topcis

Current Topics of SIGGRAPH 2012

Masayuki Nakajima, Akihiko Shirai

2012Volume 66Issue 11 Pages 920-927
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.920

JOURNAL FREE ACCESS

Download PDF (1656K)

Technical Guide

Technology Frontier of Augmented Reality(11)

An Application of the Augmented Reality for Fire and Disaster Prevention

Masafumi Hosokawa

2012Volume 66Issue 11 Pages 928-933
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.928

JOURNAL FREE ACCESS

Download PDF (1947K)

Production File on Broadcast Program(8)

Chikyu-Ichiban (No1 on the earth); The highest treehouse on the earth

Hirofumi Zusho

2012Volume 66Issue 11 Pages 934-938
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.934

JOURNAL FREE ACCESS

Download PDF (2766K)

Media Watch(11)

Report on Tokusatsu Museum at Museum of Contemporary Art Tokyo

Takahiro Tsujii, Takayuki Kai

2012Volume 66Issue 11 Pages 939-941
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.939

JOURNAL FREE ACCESS

Download PDF (1171K)

Keywords you should know(82)

DisplayPort

Hideo Nagano

2012Volume 66Issue 11 Pages 942-945
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.942

JOURNAL FREE ACCESS

Download PDF (406K)

News

News

[in Japanese], [in Japanese]

2012Volume 66Issue 11 Pages 946-947
Published: 2012
Released on J-STAGE: November 21, 2014

DOIhttps://doi.org/10.3169/itej.66.946

JOURNAL FREE ACCESS

Download PDF (392K)

Method for Automatic Generation of Pixel Art based on Color-difference Tolerance

Hironori Takimoto, Seiki Yoshimori, Yasue Mitsukura

2012Volume 66Issue 11 Pages J399-J406
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J399

JOURNAL FREE ACCESS

Show abstractHide abstract

In this paper, we propose a technique for automatic generation of pixel art from character images based on color-difference tolerance. Pixel art is one of image expressions for digital images and is based on the pixel level. Character images that are expressed by pixel art are composed of edge lines and a few colors. The proposed technique is for automatically generating pixel art from a photograph where there was only a single object. However, in the conventional method, it is difficult to determine several parameters of edge detection and decrease color for automatic generation of pixel art. In this paper, to create the optimal decreased color image for each target image, we propose a clustering method using a maximum distance algorithm (MDA) based on the human color-difference tolerance. Moreover, the complete automation of making pixel art is achieved by using the result obtained by clustering using the MDA for the threshold decision of the Canny edge detector. As a result, it is shown that several optimum parameters for pixel art are obtained by using the proposed method.

View full abstract

Download PDF (1103K)
High-frequency Invisible Code for Natural Images Using Color Shift Technique

Kazuma Kitamura, Yohei Fukumizu, Takakuni Douseki

2012Volume 66Issue 11 Pages J407-J412
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J407

JOURNAL FREE ACCESS

Show abstractHide abstract

We developed invisible code that is suitable for embedding into a natural image. Our code consists of a background layer, color adjustment layer and code layer. The code layer is composed of edgeless high-frequency blocks and edgeless low-frequency blocks. Two key technologies were devised to implement this idea. One is the color adjustment layer that adjusts the color of the background image for high-frequency blocks using an original blur filter. The other is the edgeless high-frequency block that increases the transparency of the periphery of the high-frequency block. To verify the effectiveness of our invisible code, we estimated the invisibility and reading accuracy for code embedded in four common types of images. The results demonstrate that the invisibility has been improved about 1.34 times and the reading accuracy has been improved about 1.38 times compared with the conventional invisible code.

View full abstract

Download PDF (1952K)
Correspondence Point Search Focusing on Interpolation Domain for Intermediate Viewpoint Image Generation from Circular Camera Arrangement Epipolar Plane Image

Takuya Kito, Masahiko Achiha, Takumi Tuji

2012Volume 66Issue 11 Pages J413-J419
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J413

JOURNAL FREE ACCESS

Show abstractHide abstract

This paper proposes a new method of correspondence point search for intermediate viewpoint image generation using Lagrangian interpolation from multi-view images obtained from the circular camera arrangement. Correspondence point search is not performed on the basis of the real camera image but is performed on the basis of interpolation domain. In the conventional method, if the camera interval exceeds 4°, this causes disturbance in the image. However, the proposed method was able to produce an image with fewer disturbances to the intermediate viewpoint with a camera interval up to 6°. In addition, it was able to suppress a significant increase in processing costs by varying the block size to be used for correspondence point search by focusing on the amount of edge that is included in the search range.

View full abstract

Download PDF (1991K)
Super-Resolution using Discrete Cosine Transform sign as Index of High-Frequency Components

Takaharu Kouda

2012Volume 66Issue 11 Pages J420-J425
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J420

JOURNAL FREE ACCESS

Show abstractHide abstract

The purpose of this research is to propose a new super-resolution method using DCT. The proposed method enlarges a small block in an input image to a large block through DCT. A high-definition image is created by applying this method to the entire low-resolution input image. The enlargement of small blocks is carried out by expanding the DCT coefficient's frequency domain to the same size as that of the large blocks. First, the high-frequency component that is lacking in a small block is searched for from a database. Then, it is added to a small block and restores the high-frequency component. The high frequency component is searched for using a DCT sign index. The database is trained using multiple natural images beforehand. The proposed method made a high-speed search possible by using DCT sign index. From many simulation results, it was found that the proposed method was more effective than the traditional one.

View full abstract

Download PDF (687K)

“Digital Extras” in Video Production

∼Application of Dynamic 3D Models to TV Programs∼

Kensuke Hisatomi, Kimihiro Tomiyama, Miwa Katayama, Yuichi Iwadate, Ko ...

2012Volume 66Issue 11 Pages J426-J433
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J426

JOURNAL FREE ACCESS

Show abstractHide abstract

This paper describes a method of producing scenes for TV program, using “dynamic 3D models”. Dynamic 3D models are 3D models generated from images shot by multiple cameras surrounding an actor for each frame using 3D-reconstruction and texture-mapping techniques. They can provide highly realistic images as well as natural motion without using a motion-capture system. In addition, they allow flexible production, so the positions of the models can be changed and the models can be duplicated. Dynamic 3D models are therefore suitable for producing crowd scenes with numerous people. Although techniques of 3D reconstruction and texture mapping have been discussed and several applications have been proposed, they have not been widely used into TV or movie production because of the size of data and insufficient textural quality. By establishing an efficient production flow with recent powerful processors, we succeeded in producing scenes with sufficient quality for a TV program, which was aired in December, 2009. The scenes involving one thousand soldiers could be produced by shooting several sequences with only two actors. We describe the production flow that we established and present some scenes that we produced using dynamic 3D models for the drama.

View full abstract

Download PDF (1967K)
Depth Map Refinement with Weighted Cross Bilateral Filter

Takuya Matsuo, Norishige Fukushima, Yutaka Ishibashi

2012Volume 66Issue 11 Pages J434-J443
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J434

JOURNAL FREE ACCESS

Show abstractHide abstract

In this paper, we propose a refinement filter for depth maps. The filter convolutes an image and a depth map with a cross-computed kernel. We call the filter a weighted cross bilateral filter. The main advantages of the proposed method are the filter fits outlines of objects in the depth map to silhouettes in the image, while the filter reduces Gaussian noise in other areas. Additionally, its computational cost is independent of depth ranges. Thus, we can obtain accurate depth maps at the lower cost than the conventional approaches, which require Markov random field-based optimization methods. Experimental results show that the accuracy of the depth map in edge areas goes up and its running time is low.

View full abstract

Download PDF (1590K)
Correction Method using Frame Interpolation for Video Deterioration Caused by Electronic Flashes

Yoshihiko Kawai, Hideki Sumiyoshi, Mahito Fujii, Nobuyuki Yagi

2012Volume 66Issue 11 Pages J444-J452
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J444

JOURNAL FREE ACCESS

Show abstractHide abstract

News programs often have sudden brightness changes caused by electronic flashes from still cameras. This paper proposes a correction method for the video deterioration caused by flashes using the frame interpolation technique. The proposed method creates an interpolated frame from neighboring frames to replace the frame area altered by the flash luminescence. To estimate accurate motion vectors used for frame interpolation, the proposed method adopts a novel block-based cost function that takes into account not only the frames before and after the flash but the flash frame itself. In addition, the hierarchical motion estimation and a new vector refinement filter are used to improve the interpolation quality. Experimental results for broadcasting video show that the proposed method is superior to the conventional interpolation method.

View full abstract

Download PDF (1370K)
An Integrated System for Basic Eye Movements

Yang Song, Xiaolin Zhang

2012Volume 66Issue 11 Pages J453-J460
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J453

JOURNAL FREE ACCESS

Show abstractHide abstract

A human-like active binocular vision system, inspired by binocular eye movements in animals, would help robots with automatic fast target switching, smooth target pursuing and efficient visual stabilization. In this paper, a control model that integrates saccadic eye movement, smooth pursuit eye movement, vestibulo-ocular reflex and optokinetic response is proposed. The control interface of the model has been simplified to one external saccadic command input. By Using this target selection command, target switching, target pursuing and visual stabilization of camera would run automatically. To implement the system with parallel processing, like the one used in neural network, the control model and multi-motor control are implemented in a FPGA chip. Finally, the proposed model was tested Using an image processing PC and a binocular robot head and the results show high efficiency of this control model.

View full abstract

Download PDF (1799K)
Support System based on Annotations of Documents and Video Scenes for Reading Technical Documents

Kentaro Ishitoya, Keisuke Yamamoto, Shigeki Ohira, Katashi Nagao

2012Volume 66Issue 11 Pages J461-J470
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J461

JOURNAL FREE ACCESS

Show abstractHide abstract

Reading as many technical documents as possible is important to improve our research. When we read these documents, we can understand their contents more easily by referring to related resources such as images, audio clips, and videos. Videos contain a variety of helpful information, and facilitate our understanding of technical documents. We propose a method to define video scenes and document elements, and to annotate them with additional information such as relationships. Based on these annotations and relationships, we developed a support system that uses videos and helps readers understand technical documents. We performed some experiments to confirm whether the system was usable.

View full abstract

Download PDF (879K)

Statistical Crosstalk Reduction Effect by Blank Channels in FDM/WDM Optical ADM for Digital Terrestrial Broadcast CATV Signals

Kei Nakagawa, Koji Kikushima

2012Volume 66Issue 11 Pages J471-J475
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J471

JOURNAL FREE ACCESS

Show abstractHide abstract

We report the statistical simulation results of the success probability of optical FDM/WDM Add-Drop-Mux (ADM) for CATV systems transferring terrestrial broadcast digital TV signals. Statistical simulation results show that the required signal crosstalk attenuation value of the optical notch filter can be reduced by inserting blank TV signal channels. The success probability value of the optical FDM/WDM Add-Drop-Mux can consequently be increased.

View full abstract

Download PDF (756K)
Computer-Generated Oblique Caricatures from Photographs

Ayami Yoneyama, Keita Takahashi, Masahide Kaneko

2012Volume 66Issue 11 Pages J476-J480
Published: 2012
Released on J-STAGE: October 25, 2012

DOIhttps://doi.org/10.3169/itej.66.J476

JOURNAL FREE ACCESS

Show abstractHide abstract

A method for generating oblique caricatures by computers is presented. The inputs are facial images captured from both frontal and several oblique directions. The proposed method can generate caricatures not only for the directions where the input images were provided but also for arbitrary directions by using the results of the principle component analysis of the data on facial features.

View full abstract

Download PDF (1138K)

Register with J-STAGE for free!