-
Kazuo Ishino
2003 Volume 57 Issue 7 Pages
772-777
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
A Possibility for High Presence Audio Reproduction on Broadcasting
Hiroyuki Okubo
2003 Volume 57 Issue 7 Pages
778-780
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
-
How Do You Show the Cinema with Pleasure?
Masaki Morimoto
2003 Volume 57 Issue 7 Pages
781-785
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
-
Koji Kondo
2003 Volume 57 Issue 7 Pages
786-788
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
Kazuyuki Ikedo
2003 Volume 57 Issue 7 Pages
789-791
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
Shinichi Oda
2003 Volume 57 Issue 7 Pages
792-795
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
How to Enjoy Video-Editing
Hiroshi Shimoda
2003 Volume 57 Issue 7 Pages
796-799
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
Shigeki Nakamura
2003 Volume 57 Issue 7 Pages
800-801
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
Manabu Ito, Haruo Hiki, Takuyo Kogure, Youichi Ishibashi, Fumio Hasega ...
2003 Volume 57 Issue 7 Pages
812-818
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
In the broadband era, video distribution is being accelerated, and this requires metadata technology for retrieving desired video materials This paper describes a study of a profile for video retrieval based on a layered structure that uses MPEG-7 descriptors. The profiles purpose is to facilitate efficient retrieval of video materials in an archive by means of the internet
View full abstract
-
Hang Fu, Takayuki Nagai, Masahide Kaneko, Akira Kurematsu
2003 Volume 57 Issue 7 Pages
819-828
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
Recognizing characters on signboards in a scene would be very useful for translation, aquiring relevant information through the Internet, etc. This paper presents a method for extracting characters on signboards in arbitrary scene images that does not require any manual operations. First, we describe briefly the framework of an information handling system driven by characters on signboards in scene images. Next, a method for detecting signboard regions and individual character regions is investigated in consideration of signboards characteristics. The proposed method for detecting signboard regions is based on hierarchical clustering in the
L* a* b* color space. Features of contour shape are examined to narrow the candidates of signboard regions. In addition, merging of adjacent candidate regions and rough checking of character existence are performed. Finally, each character on the signboard region is extracted by using binarization followed by horizontal and vertical projections. Experimental results from applying the method to 1000 various scene images show the effectiveness of the proposed method.
View full abstract
-
Masahito Kumano, Yasuo Ariki, Kenji Shunto, Kiyoshi Tsukada
2003 Volume 57 Issue 7 Pages
829-839
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
Video editing is used to produce a final version with a specific duration by finding and selecting appropriate shots from the raw video material and connecting them. Video editing process is generally conducted according to the special rules called “video grammar” in order to produce excellent and intelligible videos for broadcasting. However, this editing consumes a lot of human editor's working time. To solve this problem, an efficient and new video editing technique or system is required. The goal of this study is to develop an intelligent support system for video editing based on video grammar. This paper proposes a method of using camerawork density, camerawork instability, and cut point parameters to automatically segment the raw video materials into useful sections and useless sections. The method is based on video grammar as a part of the video editing support system.
View full abstract
-
Jun-ichiro Hayashi, Shuu Li, Hiroyasu Koshimizu
2003 Volume 57 Issue 7 Pages
840-846
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
This paper proposes a Hough transform that improves performance and reduces fabrication costs as compared to the conventional Hough transform. It is called the Digital Template. Hough Transform (DTHT) and is capable of higher detectability of shorter edges that surpasses the original Hough line detection. Apart from the superiority of the direct line segment detection, since DTHT requires a 4-dimensional parameter space for line segment detection, its computation cost is high. We introduce several countermeasures to reduce computation costs, and demonstrate the practical performance of DTHT. Wrinkle detection by DTHT was executed to show the novel detectability of both longer and shorter edge segments.
View full abstract
-
Hitoshi Yamauchi, Hiromitsu Takahashi
2003 Volume 57 Issue 7 Pages
847-853
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
Recently, methods and techniques for Intelligent Transport Systems (ITS) are being researched actively. Many road environment detection systems that assist drivers' judgement and compensate misdetections of road conditions ahead are being proposed. In many of these systems, the recognition of road signs is achieved by resizing images and using template matching. However, resizing an image reduces its quality, and causes a system to fail in matching. In this paper, a recognition method based on vectors is proposed. After calculating vectors of each pixel, outline vector lists are generated by tracing the vectors. Then, signatures of the image are generated from previously detected outline vector lists. Recognition is processed by using string matching technique to match these signatures to template signatures.
View full abstract
-
Bin Chen, Mitsuhiko Meguro, Masahide Kaneko
2003 Volume 57 Issue 7 Pages
854-863
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
The ability to interact with multiple users as well as to recognize the ways in which people are interacting is essential for a robot participating in society. Forming shared attention is regarded as fundamental in improving the flexibility of human-robot interaction. This paper proposes an algorithm that enables shared attention between a robot and speakers in alternative conversational situations. We firstly present a microphone array technique and a method to combine auditory and visual information for estimating the physical location of the sound source. Secondly, the speaker is detected according to the linear combination of the result of sound source localization and the likelihood map representing the distribution of pixels having skin color. Finally, in the conversational situations, the speaker's focus of attention is alternatively detected and shared by the robot for upcoming communication based on the proposed algorithm. Several experimental results are presented that demonstrate the effectiveness of the proposed method.
View full abstract
-
Megumi Takezawa, Miki Haseyama, Hideo Kitajima
2003 Volume 57 Issue 7 Pages
864-867
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
This paper proposes a fast search technique using a genetic algorithm (GA) including a simulated annealing (SA) algorithm for the optimal parameters of the iterated function system (IFS) utilized in fractal image coding. The heavy computational costs of the previous methods to find the optimal IFS parameters are a serious problem, and in order to overcome it, we have already proposed a GA-based technique to find them in short time. However, its reduction is not enough for the practical use. Therefore, the proposed method achieves further reduction by including an SA search in the GA search. Some simulation results show that the proposed method achieves more reduction in the computational costs than the only GA-based method does.
View full abstract
-
Asako Fujii, Mitsuhiko Meguro, Masahide Kaneko
2003 Volume 57 Issue 7 Pages
868-872
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
Seishi Takamura, Yoshiyuki Yashima
2003 Volume 57 Issue 7 Pages
873-877
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS
-
Katsuyuki Matsui, Takashi Tachibana, Masaaki Fujiyoshi, Hitoshi Kiya
2003 Volume 57 Issue 7 Pages
878-881
Published: July 01, 2003
Released on J-STAGE: August 17, 2011
JOURNAL
FREE ACCESS
-
Gosuke Ohashi, Sinji Ide, Yoshifumi Shimodaira
2003 Volume 57 Issue 7 Pages
882-885
Published: July 01, 2003
Released on J-STAGE: March 14, 2011
JOURNAL
FREE ACCESS