IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Online ISSN : 1745-1337
Print ISSN : 0916-8508
Volume E104.A, Issue 6
Displaying 1-13 of 13 articles from this issue
Special Section on Image Media Quality
  • Gosuke OHASHI
    2021 Volume E104.A Issue 6 Pages 845
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    JOURNAL FREE ACCESS
    Download PDF (242K)
  • Kensho HARA
    Article type: INVITED PAPER
    2021 Volume E104.A Issue 6 Pages 846-856
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 07, 2020
    JOURNAL FREE ACCESS

    The performance of video action recognition has improved significantly in recent decades. Current recognition approaches mainly utilize convolutional neural networks to acquire video feature representations. In addition to the spatial information of video frames, temporal information such as motions and changes is important for recognizing videos. Therefore, the use of convolutions in a spatiotemporal three-dimensional (3D) space for representing spatiotemporal features has garnered significant attention. Herein, we introduce recent advances in 3D convolutions for video action recognition.

    Download PDF (680K)
  • Hiroaki KUDO, Tetsuya MATSUMOTO, Kentaro KUTSUKAKE, Noritaka USAMI
    Article type: PAPER
    2021 Volume E104.A Issue 6 Pages 857-865
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 08, 2020
    JOURNAL FREE ACCESS

    In this paper, we evaluate a prediction method of regions including dislocation clusters which are crystallographic defects in a photoluminescence (PL) image of multicrystalline silicon wafers. We applied a method of a transfer learning of the convolutional neural network to solve this task. For an input of a sub-region image of a whole PL image, the network outputs the dislocation cluster regions are included in the upper wafer image or not. A network learned using image in lower wafers of the bottom of dislocation clusters as positive examples. We experimented under three conditions as negative examples; image of some depth wafer, randomly selected images, and both images. We examined performances of accuracies and Youden's J statistics under 2 cases; predictions of occurrences of dislocation clusters at 10 upper wafer or 20 upper wafer. Results present that values of accuracies and values of Youden's J are not so high, but they are higher results than ones of bag of features (visual words) method. For our purpose to find occurrences dislocation clusters in upper wafers from the input wafer, we obtained results that randomly select condition as negative examples is appropriate for 10 upper wafers prediction, since its results are better than other negative examples conditions, consistently.

    Download PDF (3335K)
  • Rintaro YANAGI, Ren TOGO, Takahiro OGAWA, Miki HASEYAMA
    Article type: PAPER
    2021 Volume E104.A Issue 6 Pages 866-875
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: November 30, 2020
    JOURNAL RESTRICTED ACCESS

    Various cross-modal retrieval methods that can retrieve images related to a query sentence without text annotations have been proposed. Although a high level of retrieval performance is achieved by these methods, they have been developed for a single domain retrieval setting. When retrieval candidate images come from various domains, the retrieval performance of these methods might be decreased. To deal with this problem, we propose a new domain adaptive cross-modal retrieval method. By translating a modality and domains of a query and candidate images, our method can retrieve desired images accurately in a different domain retrieval setting. Experimental results for clipart and painting datasets showed that the proposed method has better retrieval performance than that of other conventional and state-of-the-art methods.

    Download PDF (7423K)
  • Shiori YAMAGUCHI, Keita HIRAI, Takahiko HORIUCHI
    Article type: PAPER
    2021 Volume E104.A Issue 6 Pages 876-886
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: January 07, 2021
    JOURNAL FREE ACCESS

    In this study, we present a novel method for removing smoke from videos based on a single image sequence. Smoke is a significant artifact in images or videos because it can reduce the visibility in disaster scenes. Our proposed method for removing smoke involves two main processes: (1) the development of a smoke imaging model and (2) smoke removal using spatio-temporal pixel compensation. First, we model the optical phenomena in natural scenes including smoke, which is called a smoke imaging model. Our smoke imaging model is developed by extending conventional haze imaging models. We then remove the smoke from a video in a frame-by-frame manner based on the smoke imaging model. Next, we refine the appearance of the smoke-free video by spatio-temporal pixel compensation, where we align the smoke-free frames using the corresponding pixels. To obtain the corresponding pixels, we use SIFT and color features with distance constraints. Finally, in order to obtain a clear video, we refine the pixel values based on the spatio-temporal weightings of the corresponding pixels in the smoke-free frames. We used simulated and actual smoke videos in our validation experiments. The experimental results demonstrated that our method can obtain effective smoke removal results from dynamic scenes. We also quantitatively assessed our method based on a temporal coherence measure.

    Download PDF (39236K)
  • Lei YANG, Tingxiao YANG, Hiroki KIMURA, Yuichiro YOSHIMURA, Kumiko ARA ...
    Article type: PAPER
    2021 Volume E104.A Issue 6 Pages 887-896
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: January 18, 2021
    JOURNAL RESTRICTED ACCESS

    In medical fields, detecting traumatic bleedings has always been a difficult task due to the small size, low contrast of targets and large number of images. In this work we propose an automatic traumatic bleeding detection approach from contrast enhanced CT images via deep CNN networks, containing segmentation process and classification process. CT values of DICOM images are extracted and processed via three different window settings first. Small 3D patches are cropped from processed images and segmented by a 3D CNN network. Then segmentation results are converted to point cloud data format and classified by a classifier. The proposed pre-processing approach makes the segmentation network be able to detect small and low contrast targets and achieve a high sensitivity. The additional classification network solves the boundary problem and short-sighted problem generated during the segmentation process to further decrease false positives. The proposed approach is tested with 3 CT cases containing 37 bleeding regions. As a result, a total of 34 bleeding regions are correctly detected, the sensitivity reaches 91.89%. The average false positive number of test cases is 1678. 46.1% of false positive predictions are decreased after being classified. The proposed method is proved to be able to achieve a high sensitivity and be a reference of medical doctors.

    Download PDF (3501K)
  • Miho SHINOHARA, Yukina TAMURA, Shinya MOCHIDUKI, Hiroaki KUDO, Mitsuho ...
    Article type: LETTER
    2021 Volume E104.A Issue 6 Pages 897-901
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 15, 2020
    JOURNAL RESTRICTED ACCESS

    We investigated the function in the Lateral Geniculate Nucleus of avoidance behavior due to the inconsistency between binocular retinal images due to blue from vergence eye movement based on avoidance behavior caused by the inconsistency of binocular retinal images when watching the rim of a blue-yellow equiluminance column.

    Download PDF (643K)
  • Miho SHINOHARA, Reiko KOYAMA, Shinya MOCHIDUKI, Mitsuho YAMADA
    Article type: LETTER
    2021 Volume E104.A Issue 6 Pages 902-906
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 15, 2020
    JOURNAL RESTRICTED ACCESS

    We paid attention the amount of change for each resolution by specifying the gaze position of images, and measured accommodation and convergence eye movement when watching high-resolution images. Change of convergence angle and accommodation were like the actual depth composition in the image when images were presented in the high-resolution.

    Download PDF (1333K)
  • Misaki SHIKAKURA, Yusuke KAMEDA, Takayuki HAMAMOTO
    Article type: LETTER
    2021 Volume E104.A Issue 6 Pages 907-911
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: January 07, 2021
    JOURNAL FREE ACCESS

    This paper reports the evolution and application potential of image sensors with high-speed brightness gradient sensors. We propose an adaptive exposure time control method using the apparent motion estimated by this sensor, and evaluate results for the change in illuminance and global / local motion.

    Download PDF (1010K)
Regular Section
  • Kazuki MATSUYAMA, Toru TANZAWA
    Article type: PAPER
    Subject area: Circuit Theory
    2021 Volume E104.A Issue 6 Pages 912-926
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: November 24, 2020
    JOURNAL RESTRICTED ACCESS

    This paper formulates minimal word-line (WL) delay time with pre-emphasis pulses to design the pulse width as a function of the overdrive voltage for large memory arrays such as 3D NAND. Circuit theory for a single RC line only with capacitance to ground and that only with coupling capacitance as well as a general case where RC lines have both grounded and coupling capacitance is discussed to provide an optimum pre-emphasis pulse width to minimize the delay time. The theory is expanded to include the cases where the resistance of the RC line driver is not negligibly small. The minimum delay time formulas of a single RC delay line and capacitive coupling RC lines was in good agreement (i.e. within 5% error) with measurement. With this research, circuit designers can estimate an optimum pre-emphasis pulse width and the delay time for an RC line in the initial design phase.

    Download PDF (2394K)
  • Kwangjin JEONG, Masahiro YUKAWA
    Article type: PAPER
    Subject area: Algorithms and Data Structures
    2021 Volume E104.A Issue 6 Pages 927-939
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 11, 2020
    JOURNAL RESTRICTED ACCESS

    Multikernel adaptive filtering is an attractive nonlinear approach to online estimation/tracking tasks. Despite its potential advantages over its single-kernel counterpart, a use of inappropriately weighted kernels may result in a negligible performance gain. In this paper, we propose an efficient recursive kernel weighting technique for multikernel adaptive filtering to activate all the kernels. The proposed weights equalize the convergence rates of all the corresponding partial coefficient errors. The proposed weights are implemented via a certain metric design based on the weighting matrix. Numerical examples show, for synthetic and multiple real datasets, that the proposed technique exhibits a better performance than the manually-tuned kernel weights, and that it significantly outperforms the online multiple kernel regression algorithm.

    Download PDF (2130K)
  • Kazuhiro SATO, Shun-ichi AZUMA
    Article type: PAPER
    Subject area: Mathematical Systems Science
    2021 Volume E104.A Issue 6 Pages 940-948
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 01, 2020
    JOURNAL FREE ACCESS

    We address analysis and design problems of aggregate demand response systems composed of various consumers based on controllability to facilitate to design automated demand response machines that are installed into consumers to automatically respond to electricity price changes. To this end, we introduce a controllability index that expresses the worst-case error between the expected total electricity consumption and the electricity supply when the best electricity price is chosen. The analysis problem using the index considers how to maximize the controllability of the whole consumer group when the consumption characteristic of each consumer is not fixed. In contrast, the design problem considers the whole consumer group when the consumption characteristics of a part of the group are fixed. By solving the analysis problem, we first clarify how the controllability, average consumption characteristics of all consumers, and the number of selectable electricity prices are related. In particular, the minimum value of the controllability index is determined by the number of selectable electricity prices. Next, we prove that the design problem can be solved by a simple linear optimization. Numerical experiments demonstrate that our results are able to increase the controllability of the overall consumer group.

    Download PDF (1342K)
  • Zhen LI, Baojun ZHAO, Wenzheng WANG, Baoxian WANG
    Article type: LETTER
    Subject area: Image
    2021 Volume E104.A Issue 6 Pages 949-953
    Published: June 01, 2021
    Released on J-STAGE: June 01, 2021
    Advance online publication: December 01, 2020
    JOURNAL RESTRICTED ACCESS

    Hyperspectral images (HSIs) are generally susceptible to various noise, such as Gaussian and stripe noise. Recently, numerous denoising algorithms have been proposed to recover the HSIs. However, those approaches cannot use spectral information efficiently and suffer from the weakness of stripe noise removal. Here, we propose a tensor decomposition method with two different constraints to remove the mixed noise from HSIs. For a HSI cube, we first employ the tensor singular value decomposition (t-SVD) to effectively preserve the low-rank information of HSIs. Considering the continuity property of HSIs spectra, we design a simple smoothness constraint by using Tikhonov regularization for tensor decomposition to enhance the denoising performance. Moreover, we also design a new unidirectional total variation (TV) constraint to filter the stripe noise from HSIs. This strategy will achieve better performance for preserving images details than original TV models. The developed method is evaluated on both synthetic and real noisy HSIs, and shows the favorable results.

    Download PDF (2151K)
feedback
Top