The 35th Special Issue on Advanced Image Input/Output Technologies for Large Volumes of Image Data
-
Masayuki TANAKA, Masatoshi OKUTOMI
2007Volume 36Issue 5 Pages
631-640
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
Many consumer digital color cameras have a single image sensor with a color filter array. The data captured by the single image sensor are called raw data. An effective compression of the raw data is highly demanded. This paper proposes a raw data compression method using existing image coding framework. The proposed coding method is performed to minimize error between the observed raw data and the decoded raw data. Experimental comparisons demonstrate that the proposed method has high performance compared to existing methods.
View full abstract
-
Shoji YAMAMOTO, Naotaka OBARA, Norimichi TSUMURA, Toshiya NAKAGUCHI, Y ...
2007Volume 36Issue 5 Pages
641-649
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
We have developed an efficient BTF compression and reproducing technique based on a dichromatic reflection model. For the reproduction of the gloss appearance of rough surface, Bi-directional Texture Function (BTF) rendering is thought to be used in this system. However, efficient compression and reproduction methods are required for real-time rendering. In this paper, we propose an efficient approach of BTF rendering by separating the reflected light to the specular, the cast shadow and the invariant diffuse elements. The efficient compression for the BTF rendering is performed by applying the Local Principal Component Analysis (LPCA) to the specular and the cast shadow images, since their appearance depends on the direction of the viewing or illumination. This method performs accurate reproduction by fewer principal components in comparison with the conventional method. The final projection image in an arbitrary direction of the viewing and illumination is synthesized from the specular reflection image, the cast shadow image and the invariant diffuse reflection image. Finally, the proposed method is implemented on the appearance-based display, and demonstrated high-speed BTF rendering which is two times faster than the conventional method.
View full abstract
-
Takeshi ITO, Sou NAKAMURA, Satoshi GOTO, Takeshi IKENAGA
2007Volume 36Issue 5 Pages
650-656
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
Since JPEG 2000 is selected as the standard of Digital Cinema, it attracts a great deal of attention. However, JPEG 2000 coding requires huge amount of computational complexity for a high definition image with more than 200 million pixels required in Digital Cinema standard. Especially, EBCOT is a bottleneck because of its difficulty in parallel processing. This paper proposes a parallel coefficient bit modeling (CBM) algorithm which can process a bit plane, a stripe and a pass in parallel and a CBM architecture based on it. Hardware evaluation results show that a 733k Gate (TSMC 0.18 μm) circuit can process the code block (block size 32×32, 30bit planes) which satisfies Digital Cinema standard in real time.
View full abstract
-
Tomonori KANAZAWA, Masanori IZUMIDA, Kenji MURAKAMI, Shun IDO
2007Volume 36Issue 5 Pages
657-664
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
The Internet has dramatically increased the number of its user in exchanging image data. The amount of image data is huge compared with that of text data. Even under these conditions, users would like to recognize the images as early as possible. Previously, Internet users have used Progressive JPEG which can be seen an approximation to the whole image very quickly, with gradual improvement of quality as one waits longer. However, the order of data transmission based on DCT coefficients has not been defined. As a result, it has caused problems that an unnatural improvement of displaying the image occurs depending on network conditions. In this paper, we propose a variable frequency band division, and apply our method to DWT coding for mobile devices, such as cellular phones and PDAs. Additionally, we attempt evaluation experiments for the methods availability.
View full abstract
-
Kohei ARAI, Kaname SETO
2007Volume 36Issue 5 Pages
665-673
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
Watermark method is proposed for improvement of data hiding performance. Digital watermarking in MRA: Multi Resolution Analysis is used abundantly. It is the method of embedding restricted data at the specific frequency component after wavelet decomposition, and since restricted data is also discoverable by exploring at what frequency component it is embedding, the subject is left behind to privacy. The method of giving eigenvalue decomposition as this preprocessing is proposed. Since only the author who holds the original picture image can know the eigenvalue of an original picture image, it is only an author to restore this. However, if this method permits a certain amount of error using the circulation picture embedding restricted data, since it can presume the approximate eigenvalue, its privacy is inadequate. Here, the preprocessing which gives oblique-coordinates conversion after eigenvalue decomposition is proposed further. Since the degree of slanting crossing angle of oblique coordinates can be set up arbitrarily, privacy improves. Thus, digital watermarking to propose is excellent in privacy. The effect was checked using the picture by which this is used abundantly as a standard picture of data compression. It is confirmed that data hiding performance of the proposed method is almost double compared to the conventional method based on MRA.
View full abstract
-
Kouhei SUGATA, Ryo OHTERA, Takahiko HORIUCHI
2007Volume 36Issue 5 Pages
674-679
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
The Retinex is a human perception-based image processing algorithm which provides color constancy and dynamic range compression. A real-time processing of the Retinex is required to meet the needs of various potential users. The Retinex processing contains a relatively large number of complex computations, thus to achieve real-time performance using current technologies requires specialized hardware and software. However, there is a limit to the image size that the Retinex can output an improved image in a short time even if any machines are used. As one of the solutions by approaching from software implementation, in this paper, we propose a serial Retinex algorithm for improving appearance of spatially localized images. Our algorithm does not output a whole image by the Retinex at a time, but the algorithm calculates it consecutively and output it sequentially. So, with progress of processing time, the appearance will be gradually improved and the processing region will be gradually spread out. We verified the effectiveness of the proposed algorithm by experiments.
View full abstract
-
Takahiro MORI, Motonobu TONOMURA, Yuuji OHSUMI, Satoshi GOTO, Takeshi ...
2007Volume 36Issue 5 Pages
680-687
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
This paper proposes an interpolation algorithm and LSI architecture for a camera with a fish-eye lens. Since the fish-eye lens has around 180 degree field of view and a deep depth of field, it's very promising for a surveillance camera, a video conference system, and so on. Combination of a cubic spline interpolation and a direct interpolation from a RAW image enable to obtain high quality image. A three-stage linear interpolation scheme and a dedicate operators boost up processing performance. Evaluation results base on a subjective test show that our proposed algorithm can generate high-quality interpolation image. Furthermore, a dedicated image processing core with 290k gates can process HDTV (1920×1080, 60fps) image by using 0.18 μm CMOS technology.
View full abstract
-
Miho KIKUCHI, Kazutoshi TAGAI, Teppei TAKANO, Hiroshi KANEKO
2007Volume 36Issue 5 Pages
688-700
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
This paper describes a method and experiment on online handprinted “Hiragana” character recognition. The recognition process is composed of a rough classification step by Procrustes analysis, which is a generalization of linear regression model, and a detailed recognition step using vector correlation. The Procrustes analysis is a permissive one for rotation changes. And vector correlation is a excellent measure for acquiring directional information about local strokes. The proposed method is corroborated by a computer simulation, which correctly classified 99.2% of 2300 patterns in 46 categories.
View full abstract
-
Koji ABE, Mituhiro NENDAI, Takahiro HAYASHI, Haruhiko KIMURA
2007Volume 36Issue 5 Pages
701-709
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
Contents of sports tickers in broadcasted sports videos could be efficient as queries in analyzing the video contents. However, since sports tickers are displayed with various colors and brightness, it is difficult to recognize inner objects of the sports tickers only according to the brightness distribution. Besides, in order to recognize what the objects represent, it is indispensable to investigate the change of these objects according to time traveling. In this paper, the proposed method can recognize locations of the out ticker, score tickers, and runner markers in the sports ticker of broadcasted baseball videos analyzing the brightness histograms obtained from every of grayscale and color images, and the change of the sports ticker with the correlation coefficient. Experimental results with four broadcasted baseball videos have shown that the proposed method can exactly work in the case when the out ticker is represented by a letter.
View full abstract
-
Jing WANG, Takeshi IKENAGA, Satoshi GOTO
2007Volume 36Issue 5 Pages
710-720
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
Contour detection is a fundamental step to scene analysis and interpretation. However, because contours often locate in rich texture background it is still a difficult task in realistic vision. Through multi-scale analysis, it becomes clear that edge responses of real object contours are relatively stable across scales, while those from noise or texture background are not. In this paper, a salient contour detection method is proposed based on the scale invariance of piecewise linear approximation of real object contours. Firstly, an image pyramid is efficiently constructed by repeatedly smoothing and sub-sampling the image. Secondly, the piecewise linear approximation of contours in multiple scales are extracted and then a collinear line grouping process is implemented to improve the connectivity of the contours. Thirdly, the new salient line segments are generated based on the analysis on the stability of line segments across scales. Experimental results show that the proposed method can effectively improve the connectivity and saliency of the contour detection compared with the former method.
View full abstract
-
Hiroshi IKEDA, Tomoyuki YAMAGUCHI, Shuji HASHIMOTO
2007Volume 36Issue 5 Pages
721-730
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
In the field of meteorology, the analysis of satellite images is important to understand current and future atmospheric states, as the satellite images have advantages on range and frequency of the observations. In this study, we introduce a system of cloud classification and detection of cirrus streak for comprehensive and structural understandings of atmospheric states. In the cloud classification, the proposed system classifies clouds into appropriate cloud classes using the past cloud classes and cloud velocity based on the strong correlation of successive satellite images. Moreover, our system detects cirrus streaks using cloud shape, velocity and cloud classes obtained by the cloud classification. In the experiments, the effectiveness of the proposed system is investigated with actual satellite images.
View full abstract
-
Hisae SHIBUYA, Takafumi OKABE, Yasuo NAKAGAWA
2007Volume 36Issue 5 Pages
731-737
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
Real-time defect classification on wafer inspection is important for semiconductor manufacturing. Rule-based classification is popularly employed because of its advantages that classification rules can be decided based on theories and experiences and can be easily understood by anyone. But it is difficult for non-experts to decide them. So we propose automatic rule generation algorithm based on teaching of feature data with correct defect classes. The proposed algorithm constructs a decision tree based on separability measures, which are defined using entropy and c-squared statistic. The algorithm was evaluated using a set of feature data obtained with a wafer inspection tool. The data set included 186 defects of six classes and its classification accuracy was 86.6%. This result shows the algorithm works usefully when features are appropriate for classification.
View full abstract
-
Joo Kooi TAN, Kensuke KOUNO, Seiji ISHIKAWA, hyoungSoup KIM, Takashi S ...
2007Volume 36Issue 5 Pages
738-746
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
This paper proposes a technique for high speed recognition of human motions. For an intelligent robot which will help human lives in various ways, it is indispensable to recognize human motions or behaviors irrespective of its observing direction. For this purpose, we propose a structured motion database provided by an eigenspace having a B-tree structure within it. In the motion database, every appearance of a human motion is to be registered. Another idea for speeding up the recognition is to describe a motion in a compressed form. We employ a JK method to represent a motion sequence in a single image. Performed experiment showed satisfactory results.
View full abstract
-
Yutaka MARUOKA, Shuhei TAKEMOTO, Yasuhiko WATANABE, Yoshihiro OKADA
2007Volume 36Issue 5 Pages
747-755
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
In this paper, we propose a method of surface details representation based on Level-of-Detail (LOD) of haptic feature. Previous studies did not care for LOD control of surface details: Surface details were generally compressed collision detection and their details were lost. To solve this problem, we propose a method of compressing surface details. Our method realizes two rendering methods: the rendering based on conservation of structure features and the rendering based on selection of partial shapes.
View full abstract
-
Masanori UNNO, Takanari TANABATA, Hiroko (NAKAMURA) MIYAMURA, Tomoko S ...
2007Volume 36Issue 5 Pages
756-761
Published: 2007
Released on J-STAGE: August 25, 2011
JOURNAL
FREE ACCESS
In this paper, we propose the image analysis for growth development of rice leaves movement. We propose the technique of imaging for growth of rice and of the visualization for growth of the leaf using temporal-spatial analysis and apply these techniques to growth analysis. To be established the visualization technique for leaf movement, we corrected image series of rice growing during some month and processed acquired image series coordinate conversion and projection space-time map. Observation of the physiological phenotypes in the growth of rice, such as change of the movement pattern, circumference, and growth movement, can be performed using the visualized result.
View full abstract