The Journal of The Institute of Image Information and Television Engineers

Focus

Mass Disaster Victim Identification and the Role of Information Technology

Takafumi Aoki

2011Volume 65Issue 12 Pages k20
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.k20

JOURNAL FREE ACCESS

Download PDF (412K)

Technical Guide

Lecture

Development and Evolution of Broadcast Technologies; That support the news on the Great East Japan Earthquake

Kenji Nagai

2011Volume 65Issue 12 Pages 1677-1684
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1677

JOURNAL FREE ACCESS

Download PDF (1470K)

3D that Everyone Understands(The Last Chapter)

The Future Directions of 3D and Interaction Techniques

Daisuke Iwai, Kosuke Sato

2011Volume 65Issue 12 Pages 1718-1722
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1718

JOURNAL FREE ACCESS

Download PDF (4805K)

Image, Audio, and Tactile Information Technology for Human Welfare

1. Research Subjects and Prospects of Image, Audio and Tactile Information Technology for Human Welfare; As an example of sensori-motor communication aids

Tohru Ifukube

2011Volume 65Issue 12 Pages 1685-1689
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1685

JOURNAL FREE ACCESS

Download PDF (3229K)
2. Tactile Interface Technology for the Blind and Deafblind Persons

Naoyuki Okouchi, Tadahiro Sakai

2011Volume 65Issue 12 Pages 1690-1695
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1690

JOURNAL FREE ACCESS

Download PDF (945K)
3. Universal Image Processing Technology for High Definition Television

Masato Nagasawa, Yoshitomo Nakamura, Kagari Kawakatsu, Takehiko Ishizu ...

2011Volume 65Issue 12 Pages 1696-1700
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1696

JOURNAL FREE ACCESS

Download PDF (750K)
4. Audio Signal Processing for Elderly People

Atsushi Imai

2011Volume 65Issue 12 Pages 1701-1704
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1701

JOURNAL FREE ACCESS

Download PDF (422K)

Technical Survey

Recent Trends in 100 Gigabit Ethernet and its Related Technologies

Osamu Ishida

2011Volume 65Issue 12 Pages 1705-1711
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1705

JOURNAL FREE ACCESS

Download PDF (625K)

Topics

Report on IFA2011 and IBC2011

Yoshikazu Yamada, Kazuhiro Kamimura

2011Volume 65Issue 12 Pages 1712-1717
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1712

JOURNAL FREE ACCESS

Download PDF (2371K)

Special Edition

Keywords you should know(71)

Color Space

Kenji Yokoi

2011Volume 65Issue 12 Pages 1723-1725
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1723

JOURNAL FREE ACCESS

Download PDF (661K)

My Recommendations on Research and Development Tools(54)

Efficient 3D Video Processing in MATLAB

Toshiaki Fujii

2011Volume 65Issue 12 Pages 1726-1728
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1726

JOURNAL FREE ACCESS

Download PDF (1072K)

Activity Notes on Standardization(18)

International Standardization of the Next Generation STB for Cable Television

Satoshi Miyaji

2011Volume 65Issue 12 Pages 1729-1732
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1729

JOURNAL FREE ACCESS

Download PDF (5198K)

Report

2011 ITE Annual Convention

Seishi Takamura

2011Volume 65Issue 12 Pages 1733-1736
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1733

JOURNAL FREE ACCESS

Download PDF (545K)

News

News

[in Japanese], [in Japanese]

2011Volume 65Issue 12 Pages 1737-1738
Published: 2011
Released on J-STAGE: December 01, 2013

DOIhttps://doi.org/10.3169/itej.65.1737

JOURNAL FREE ACCESS

Download PDF (366K)

Psychological Study of Cognitive Model of Movie Recognition and Comparison with Theoretical Movie-analysis Models

Junji Ohyama, Katsumi Watanabe

2011Volume 65Issue 12 Pages 1813-1816
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1813

JOURNAL FREE ACCESS

Show abstractHide abstract

We proposed a new cognitive visual model of movie recognition based on our previous findings of psychophysical phenomena. Our cognitive model suggested two important functions of movie recognition process. First, a continuous movie sequence was divided and perceived as serial event segments of short scenes. The movie would be coded for each of the segments and structuralized as a contextual association of each segments. Second, the knowledge structure of the context of previously viewed movies was used to predict the ongoing movie context and the online segmentation. We compared our cognitive model with a previously proposed theoretical model of movie processing. The results of our experiments supported our hypothesis: an adaptive learning mechanism of online movie segmentation would be effective for an intelligent knowledge-based structure of a future movie analysis system.

View full abstract

Download PDF (694K)
Coefficient-reversed Bilateral Filter for Image Halftoning with Error Diffusion

Xiaoyu Fu, Tao Wang, Kiichi Urahama

2011Volume 65Issue 12 Pages 1817-1820
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1817

JOURNAL FREE ACCESS

Show abstractHide abstract

We present a new method for halftoning images by scanning the pixels in a random order and diffusing the errors by using a coefficient-reversed bilateral filter. The random traversal of pixels avoids creating artificial dot patterns that usually appear in conventional halftone images by raster scanning the pixels. The coefficient-reversed bilateral filter is effective at enhancing the edges. We apply this halftoning method to the non-photorealistic rendering of stippling images.

View full abstract

Download PDF (2811K)

Numerical Image Formation by Lens for Wave-Optical Simulation of Reconstruction of Full Parallax Computer-Generated Hologram

Kazuya Murakami, Kyoji Matsushima

2011Volume 65Issue 12 Pages 1793-1800
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1793

JOURNAL FREE ACCESS

Show abstractHide abstract

A novel method is proposed to provide exact prediction of the optical reconstruction of full-parallax computer generated holograms (CGHs). The wave field emitted from the CGH is numerically propagated into a pupil, and then the image formation by a lens is simulated by a procedure based on wave optics. Two numerical techniques for free space propagation, the shifted-Fresnel diffraction and rotational transformation, are used for the numerical calculation. Our method thus enables the viewpoint to be moved and the gaze point to be changed. This technique allows us to confirm the effect of non-diffraction light and the conjugate image of full-parallax large-scaled CGHs prior to their fabrication.

View full abstract

Download PDF (2626K)
Time-Of-Flight 3D Range Camera Based on Code Modulating Technique

Yusuke Hashimoto, Kenichi Murakami, Kenji Taniguchi

2011Volume 65Issue 12 Pages 1801-1807
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1801

JOURNAL FREE ACCESS

Show abstractHide abstract

We developed a new signal processing technique for a 3D real-time imager that is based on the time-of-fight method. The use of a code-signal modulation technique enables us to solve some of the problems with the distance measurements. One problem is the ambiguous measurement caused by the cyclic repetition of the signal phase according to the propagation distance when using the conventional CW modulation technique. The second problem is caused by the mixture of signals reflected on the glass panel and at the object through the glass. The code-signal modulation technique with the noise filter excludes the ambiguous signals reflected from the background objects at long distances over the modulation period. In addition, the signal correction process extracts the intended signal reflected at the object through the glass. With this novel signal-processing the utility of the 3D real-time imager can be further extended.

View full abstract

Download PDF (2632K)
Roll-Type Optical Advanced Memory with High Recording Capacity

Masatoshi Tsuji, Wataru Inami, Yoshimasa Kawata, Masaharu Ito

2011Volume 65Issue 12 Pages 1808-1812
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1808

JOURNAL FREE ACCESS

Show abstractHide abstract

We propose a newly developed roll-type optical memory (RoCAM). This memory is a multilayered memory and has a cylindrical shape. It consists of a recording and transparent layers wound onto a shaft. A RoCAM has five advantages. First, the media is easily fabricated. Second, the groove structures in RoCAM are easily implemented. Third, it has parallel recording and reading. Fourth, there is stable rotation and, finally, constant linear velocity. We report these advantageous features and the one-dimensional parallel signal readout of RoCAM.

View full abstract

Download PDF (1359K)

Effects of Target Motion on Binocular Stereopsis and Monoptic Depth

Yasuaki Tamada, Takayuki Hino, Hitomi Ikeura, Kohei Miura, Masayuki Sa ...

2011Volume 65Issue 12 Pages 1776-1782
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1776

JOURNAL FREE ACCESS

Show abstractHide abstract

It is widely believed that excessive binocular disparity produces diplopia without clear depth perception. However, recent studies have reported that diplopic images with a very large disparity appear clearly in depth when they move. It is currently unclear whether this facilitation of stereoscopic depth caused by target movement in diplopic images requires the involvement of both eyes. A monocular target stimulating the nasal or temporal retina of either eye appears in depth as if it has uncrossed or crossed disparity, respectively (i.e. monoptic depth). In the present study we examined the dynamic properties of monoptic depth and binocular stereopsis. Two small circular targets were presented 5 degrees above and below a fixation point and oscillated horizontally in counter phase. With binocular stereopsis, disparities with the same magnitude and opposite polarity were applied to the two targets. With monoptic depth, targets were removed for either eye. The results revealed that target motion facilitated binocular stereopsis but not monoptic depth. These findings suggest that corresponding target images stimulating both eyes are necessary for a depth of large magnitude to be perceived in motion, in spite of diplopia.

View full abstract

Download PDF (667K)
Precise and Real-time Pupil Position Estimation in Color Camera Face Images Based on Near-infrared Pupil Detection Method

Keisuke Matsumura, Yoshinobu Ebisawa

2011Volume 65Issue 12 Pages 1783-1787
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1783

JOURNAL FREE ACCESS

Show abstractHide abstract

A recent finding showed that infants with autism do not tend to look into their mother's eyes. The disease can be diagnosed by examining the eccentricity of the infant's gaze distribution from the mother's eyes and showing a video of the mother's face on a display screen. In the present study, to cope with a lot of examination, we develop a system that immediately detects the pupil positions in the mother's face video provided by the color camera using our robust pupil detection technique. The system consists of one color camera and two monochrome cameras with near-infrared light sources. All cameras are calibrated. The monochrome camera determines the 3D positions of the mother's pupil centers, which are transformed into the coordinates in the color camera image. These processes were easily performed with 60 fps. In addition, the experimental results show the precise pupil center detection in the color face videos.

View full abstract

Download PDF (1397K)
Communication Aid for Foreigner and Hearing Impairment at Station

Kaoru Nakazono, Mari Kakuta, Yujji Nagashima

2011Volume 65Issue 12 Pages 1788-1792
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1788

JOURNAL FREE ACCESS

Show abstractHide abstract

We developed a communication aid, called VUTE, which aids people who have difficulty in spoken communication such as elderly people with hearing impairments, deaf people and foreigners. We present an overview of VUTE 2010 that can be used at railway stations. This system displays picture symbols on a portable data terminal and prompts the user to select the appropriate symbol. Finally, it outputs sentences which correspond to the answers given by the user. We also carried out an evaluation test on seven subjects in their fifties or sixties. We confirmed that VUTE can output sentences that correspond to the situations which had been arranged in advance, without using written letters/characters or voices.

View full abstract

Download PDF (1299K)

Influence of Network Delay on Quality of Experience in Free-Viewpoint Video Transmission

Ayano Tatematsu, Bohai Liu, Norishige Fukushima, Yutaka Ishibashi

2011Volume 65Issue 12 Pages 1742-1749
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1742

JOURNAL FREE ACCESS

Show abstractHide abstract

We investigated the effect of network delay on changing viewpoint in free-viewpoint video transmission by conducting a Quality of Experience (QoE) assessment. We address two transmission methods: synthesized image transmission and depth and image transmission. We assessed the image quality, interactivity of changing viewpoint, and comprehensive quality as QoE factors. The assessment results indicate that the image quality of the synthesized image transmission is higher than that of the depth and image transmission, which is advantageous in terms of interactivity. Also, because the inferior-to-superior relationship between the two methods depends on the characteristics of the video content and camera work used in the rendering process for the comprehensive quality, we should choose one suitable method from the two methods according to the situation.

View full abstract

Download PDF (1992K)
A Method for Displaying Timing between Speaker's Face and Captions for a Real-time Speech-to-Caption System

Hayato Kuroki, Shuichi Ino, Satoko Nakano, Kotaro Hori, Tohru Ifukube, ...

2011Volume 65Issue 12 Pages 1750-1757
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1750

JOURNAL FREE ACCESS

Show abstractHide abstract

We have been studying a real-time speech-to-caption system using speech recognition technology with a repeat speaking method. In this system, we used a repeat speaker who listens to a lecturer's voice and then speaks back the lecturer's utterances into a speech recognition computer. Our developing system showed that the accuracy of the captions is about 97% in Japanese-Japanese conversion, and the conversion time from voices to captions is about 4 seconds in English-English conversion in some international conferences. Of course it required a lot of costs to achieve these high performances. In human communications, speech understanding depends not only on verbal information but also on non-verbal information such as speaker's gestures and face and mouth movements. Therefore, we found a suitable way to display the information of captions and speaker's face movement images to achieve higher comprehension after briefly storing information once into a computer. In this paper, we investigated the relationship of the display sequence and display timing between captions that have speech recognition errors and the speaker's face movement images. The results showed that the sequence displaying the caption before the speaker's face image improved the comprehension of the captions. The sequence displaying both simultaneously showed an improvement of only a few percent higher than that of the question sentence, and the sequence displaying the speaker's face image before the caption showed almost no change. In addition, the sequence displaying the caption 1 second before the speaker's face showed the most significant improvement of all the conditions in the hearing-impaired.

View full abstract

Download PDF (1126K)
Static and Dynamic Characteristics of Accommodation and Vergence Responses while Viewing Stereoscopic Displays and Real Objects

Haruki Mizushina, Ippei Negishi, Hiroshi Ando, Shinobu Masaki

2011Volume 65Issue 12 Pages 1758-1767
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1758

JOURNAL FREE ACCESS

Show abstractHide abstract

The conflict between accommodation and vergence stimuli has been identified as a possible cause of visual fatigue from viewing stereoscopic images. We examined static and dynamic characteristics of accommodation and vergence responses while viewing stereoscopic displays and real objects to clarify the effect of stereoscopic images on the visual function. We used an instrument based on the Shack-Hartmann wavefront sensor to measure accommodation and vergence responses simultaneously. Accommodation responses to the static stereoscopic target with large binocular disparity were deviated from those to the real target, i.e., static characteristics. The step responses of the accommodation response showed considerable individual differences, i.e., dynamic characteristics. In addition, the asymmetries of step responses of accommodation were observed between the near-to-far and the far-to-near step directions. These results suggest that we need to examine both static and dynamic characteristics of accommodation and vergence responses to clarify the biological effect of stereoscopic images.

View full abstract

Download PDF (2735K)
Remote Eye-gaze Tracking System by One-point Gaze Calibration

Yoshinobu Ebisawa, Kazuki Abo, Kiyotaka Fukumoto

2011Volume 65Issue 12 Pages 1768-1775
Published: December 01, 2011
Released on J-STAGE: December 20, 2011

DOIhttps://doi.org/10.3169/itej.65.1768

JOURNAL FREE ACCESS

Show abstractHide abstract

Conventional gaze tracking systems are burdensome in that they require the user to gaze at several targets on the PC screen in the user-calibration process. The proposed calibration procedure requires the user to gaze at only one target. The implemented system consists of four camera-calibrated, wide-view video cameras arranged around the screen, with near-infrared light-emitting diode (LED) lights attached to each camera. The angle θ between the line of sight and the line connecting the center of the pupil and the camera (LED lights) is related to the vector from the center of the pupil to the corneal reflection detected from the video image. The user-calibration process makes it possible to determine three parameters, which can be achieved using three of the four cameras. Usually, the larger that angle θ is, the worse the gaze detection precision is. A weighted mean method is proposed to determine the final precise gaze point.

View full abstract

Download PDF (1614K)

Register with J-STAGE for free!