IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

Special Section on Smart Multimedia & Communication Systems

FOREWORD

Hiroshi OCHI, Masayuki KUROSAKI

2022Volume E105.AIssue 4 Pages 611-612
Published: April 01, 2022
Released on J-STAGE: April 01, 2022

DOIhttps://doi.org/10.1587/transfun.2021SMF0001

JOURNAL FREE ACCESS

Download PDF (482K)
Enabling a MAC Protocol with Self-Localization Function to Solve Hidden and Exposed Terminal Problems in Wireless Ad Hoc Networks

Chongchong GU, Haoyang XU, Nan YAO, Shengming JIANG, Zhichao ZHENG, Ru ...

Article type: PAPER
Subject area: Mobile Information Network and Personal Communications
2022Volume E105.AIssue 4 Pages 613-621
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 19, 2021

DOIhttps://doi.org/10.1587/transfun.2021SMP0004

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

In a wireless ad hoc network (MANET), nodes can form a centerless, self-organizing, multi-hop dynamic network without any centralized control function, while hidden and exposed terminals seriously affect the network performance. Meanwhile, the wireless network node is evolving from single communication function to jointly providing a self precise positioning function, especially in indoor environments where GPS cannot work well. However, the existing medium access control (MAC) protocols only deal with collision control for data transmission without positioning function. In this paper, we propose a MAC protocol based on 802.11 standard to enable a node with self-positioning function, which is further used to solve hidden and exposed terminal problems. The location of a network node is obtained through exchanging of MAC frames jointly using a time of arrival (TOA) algorithm. Then, nodes use the location information to calculate the interference range, and judge whether they can transmit concurrently. Simulation shows that the positioning function of the proposed MAC protocol works well, and the corresponding MAC protocol can achieve higher throughput, lower average end-to-end delay and lower packet loss rate than that without self-localization function.

View full abstract

Download PDF (2974K)
Codebook Design for Vertical Beamforming by Multi-Cell Coordination

Kenji HOSHINO, Teruya FUJII

Article type: PAPER
Subject area: Digital Signal Processing, Mobile Information Network and Personal Communications
2022Volume E105.AIssue 4 Pages 622-630
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 11, 2021

DOIhttps://doi.org/10.1587/transfun.2021SMP0017

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

Fifth-generation (5G) mobile communication systems employ beamforming technology using massive multiple-input and multiple-output (MIMO) to improve the reception quality and spectrum efficiency within a cell. Meanwhile, coordinated beamforming among multiple base stations is an effective approach to improving the spectrum efficiency at the cell edges, in which massive MIMO is deployed at geographically distant base stations and beamforming control is conducted in a cooperative manner. Codebook-based beamforming is a method for realizing multi-cell coordinated beamforming, in which each base station selects one of multiple beams that are predefined in a codebook. In codebook-based beamforming, it is important to design an efficient codebook that takes into account the beam allocation and the number of beams. In general, the larger the number of beams defined in a codebook, the more finely tuned the beam control can be and a greater improvement in spectrum efficiency can be expected. However, it requires a huge signal processing to optimize the beam combinations with a large number of beams by coordinated beamforming. This paper proposes a novel codebook design that efficiently assigns beam directions and widths in a vertical plane. Computer simulations showed that the proposed codebook performs as well as the conventional method while requiring fewer beam combinations.

View full abstract

Download PDF (1410K)
Deep Gaussian Denoising Network Based on Morphological Operators with Low-Precision Arithmetic

Hikaru FUJISAKI, Makoto NAKASHIZUKA

Article type: PAPER
Subject area: Image, Digital Signal Processing
2022Volume E105.AIssue 4 Pages 631-638
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: November 08, 2021

DOIhttps://doi.org/10.1587/transfun.2021SMP0008

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

This paper presents a deep network based on morphological filters for Gaussian denoising. The morphological filters can be applied with only addition, max, and min functions and require few computational resources. Therefore, the proposed network is suitable for implementation using a small microprocessor. Each layer of the proposed network consists of a top-hat transform, which extracts small peaks and valleys of noise components from the input image. Noise components are iteratively reduced in each layer by subtracting the noise components from the input image. In this paper, the extensions of opening and closing are introduced as linear combinations of the morphological filters for the top-hat transform of this deep network. Multiplications are only required for the linear combination of the morphological filters in the proposed network. Because almost all parameters of the network are structuring elements of the morphological filters, the feature maps and parameters can be represented in short bit-length integer form, which is suitable for implementation with single instructions, multiple data (SIMD) instructions. Denoising examples show that the proposed network obtains denoising results comparable to those of BM3D [1] without linear convolutions and with approximately one tenth the number of parameters of a full-scale deep convolutional neural network [2]. Moreover, the computational time of the proposed method using SIMD instructions of a microprocessor is also presented.

View full abstract

Download PDF (1867K)
A Method for Generating Color Palettes with Deep Neural Networks Considering Human Perception

Beiying LIU, Kaoru ARAKAWA

Article type: PAPER
Subject area: Image, Vision, Neural Networks and Bioengineering
2022Volume E105.AIssue 4 Pages 639-646
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: September 30, 2021

DOIhttps://doi.org/10.1587/transfun.2021SMP0011

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

A method to generate color palettes from images is proposed. Here, deep neural networks (DNN) are utilized in order to consider human perception. Two aspects of human perception are considered; one is attention to image, and the other is human preference for colors. This method first extracts N regions with dominant color categories from the image considering human attention. Here, N is the number of colors in a color palette. Then, the representative color is obtained from each region considering the human preference for color. Two deep neural-net systems are adopted here, one is for estimating the image area which attracts human attention, and the other is for estimating human preferable colors from image regions to obtain representative colors. The former is trained with target images obtained by an eye tracker, and the latter is trained with dataset of color selection by human. Objective and subjective evaluation is performed to show high performance of the proposed system compared with conventional methods.

View full abstract

Download PDF (2418K)
Vector Quantization of Speech Spectrum Based on the VQ-VAE Embedding Space Learning by GAN Technique

Tanasan SRIKOTR, Kazunori MANO

Article type: PAPER
Subject area: Speech and Hearing, Digital Signal Processing
2022Volume E105.AIssue 4 Pages 647-654
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: September 30, 2021

DOIhttps://doi.org/10.1587/transfun.2021SMP0018

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

The spectral envelope parameter is a significant speech parameter in the vocoder's quality. Recently, the Vector Quantized Variational AutoEncoder (VQ-VAE) is a state-of-the-art end-to-end quantization method based on the deep learning model. This paper proposed a new technique for improving the embedding space learning of VQ-VAE with the Generative Adversarial Network for quantizing the spectral envelope parameter, called VQ-VAE-EMGAN. In experiments, we designed the quantizer for the spectral envelope parameters of the WORLD vocoder extracted from the 16kHz speech waveform. As the results shown, the proposed technique reduced the Log Spectral Distortion (LSD) around 0.5dB and increased the PESQ by around 0.17 on average for four target bit operations compared to the conventional VQ-VAE.

View full abstract

Download PDF (2802K)
A Data Augmentation Method for Cow Behavior Estimation Systems Using 3-Axis Acceleration Data and Neural Network Technology

Chao LI, Korkut Kaan TOKGOZ, Ayuka OKUMURA, Jim BARTELS, Kazuhiro TODA ...

Article type: PAPER
Subject area: Neural Networks and Bioengineering
2022Volume E105.AIssue 4 Pages 655-663
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: September 30, 2021

DOIhttps://doi.org/10.1587/transfun.2021SMP0003

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

Cow behavior monitoring is critical for understanding the current state of cow welfare and developing an effective planning strategy for pasture management, such as early detection of disease and estrus. One of the most powerful and cost-effective methods is a neural-network-based monitoring system that analyzes time series data from inertial sensors attached to cows. For this method, a significant challenge is to improve the quality and quantity of teaching data in the development of neural network models, which requires us to collect data that can cover various realistic conditions and assign labels to them. As a result, the cost of data collection is significantly high. This work proposes a data augmentation method to solve two major quality problems in the collection process of teaching data. One is the difficulty and randomicity of teaching data acquisition and the other is the sensor position changes during actual operation. The proposed method can computationally emulate different rotating states of the collar-type sensor device from the measured acceleration data. Furthermore, it generates data for actions that occur less frequently. The verification results showed significantly higher estimation performance with an average accuracy of over 98% for five main behaviors (feeding, walking, drinking, rumination, and resting) based on learning with long short-term memory (LSTM) network. Compared with the estimation performance without data augmentation, which was insufficient with a minimum of 60.48%, the recognition rate was improved by 2.52-37.05pt for various behaviors. In addition, comparison of different rotation intervals was investigated and a 30-degree increment was selected based on the accuracy performances analysis. In conclusion, the proposed data expansion method can improve the accuracy in cow behavior estimation by a neural network model. Moreover, it contributes to a significant reduction of the teaching data collection cost for machine learning and opens many opportunities for new research.

View full abstract

Download PDF (2814K)

Regular Section

Artificial Bandwidth Extension for Lower Bandwidth Using Sinusoidal Synthesis based on First Formant Location

Yuya HOSODA, Arata KAWAMURA, Youji IIGUNI

Article type: PAPER
Subject area: Engineering Acoustics
2022Volume E105.AIssue 4 Pages 664-672
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 12, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1044

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

The narrow bandwidth limitation of 300-3400Hz on the public switching telephone network results in speech quality deterioration. In this paper, we propose an artificial bandwidth extension approach that reconstructs the missing lower bandwidth of 50-300Hz using sinusoidal synthesis based on the first formant location. Sinusoidal synthesis generates sinusoidal waves with a harmonic structure. The proposed method detects the fundamental frequency using an autocorrelation method based on YIN algorithm, where a threshold processing avoids the false fundamental frequency detection on unvoiced sounds. The amplitude of the sinusoidal waves is calculated in the time domain from the weighted energy of 300-600Hz. In this case, since the first formant location corresponds to the first peak of the spectral envelope, we reconstruct the harmonic structure to avoid attenuating and overemphasizing by increasing the weight when the first formant location is lower, and vice versa. Consequently, the subjective and objective evaluations show that the proposed method reduces the speech quality difference between the original speech signal and the bandwidth extended speech signal.

View full abstract

Download PDF (3868K)
An Algorithm for Single Snapshot 2D-DOA Estimation Based on a Three-Parallel Linear Array Model

Shiwen LIN, Yawen ZHOU, Weiqin ZOU, Huaguo ZHANG, Lin GAO, Hongshu LIA ...

Article type: PAPER
Subject area: Digital Signal Processing
2022Volume E105.AIssue 4 Pages 673-681
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 05, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1074

JOURNAL FREE ACCESS

Show abstractHide abstract

Estimating the spatial parameters of the signals by using the effective data of a single snapshot is essential in the field of reconnaissance and confrontation. Major drawback of existing algorithms is that its constructed covariance matrix has a great degree of rank loss. The performance of existing algorithms gets degraded with low signal-to-noise ratio. In this paper, a three-parallel linear array based algorithm is proposed to achieve two-dimensional direction of arrival estimates in a single snapshot scenario. The key points of the proposed algorithm are: 1) construct three pseudo matrices with full rank and no rank loss by using the single snapshot data from the received signal model; 2) by using the rotation relation between pseudo matrices, the matched 2D-DOA is obtained with an efficient parameter matching method. Main objective of this work is on improving the angle estimation accuracy and reducing the loss of degree of freedom in single snapshot 2D-DOA estimation.

View full abstract

Download PDF (1390K)
Stability Analysis and Control of Decision-Making of Miners in Blockchain

Kosuke TODA, Naomi KUZE, Toshimitsu USHIO

Article type: PAPER
Subject area: Nonlinear Problems
2022Volume E105.AIssue 4 Pages 682-688
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 01, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1099

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

To maintain blockchain-based services with ensuring its security, it is an important issue how to decide a mining reward so that the number of miners participating in the mining increases. We propose a dynamical model of decision-making for miners using an evolutionary game approach and analyze the stability of equilibrium points of the proposed model. The proposed model is described by the 1st-order differential equation. So, it is simple but its theoretical analysis gives an insight into the characteristics of the decision-making. Through the analysis of the equilibrium points, we show the transcritical bifurcations and hysteresis phenomena of the equilibrium points. We also design a controller that determines the mining reward based on the number of participating miners to stabilize the state where all miners participate in the mining. Numerical simulation shows that there is a trade-off in the choice of the design parameters.

View full abstract

Download PDF (1274K)
Construction of Two Classes of Minimal Binary Linear Codes Based on Boolean Function

Jiawei DU, Xiaoni DU, Wengang JIN, Yingzhong ZHANG

Article type: PAPER
Subject area: Cryptography and Information Security
2022Volume E105.AIssue 4 Pages 689-693
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: September 30, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1062

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

Linear codes with a few-weight have important applications in combinatorial design, strongly regular graphs and cryptography. In this paper, we first construct a class of Boolean functions with at most five-valued Walsh spectra, and determine their spectrum distribution. Then, we derive two classes of linear codes with at most six-weight from the new functions. Meanwhile, the length, dimension and weight distributions of the codes are obtained. Results show that both of the new codes are minimal and among them, one is wide minimal code and the other is a narrow minimal code and thus can be used to design secret sharing scheme with good access structures. Finally, some Magma programs are used to verify the correctness of our results.

View full abstract

Download PDF (1502K)
An Efficient ARQ Scheme under IEEE 802.11ac Error Channel

Xueyan LI, Peng CHENG, Bin WU

Article type: PAPER
Subject area: Mobile Information Network and Personal Communications
2022Volume E105.AIssue 4 Pages 694-703
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 04, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1054

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

In this paper, an automatic retransmission request (ARQ) scheme for IEEE 802.11ac is presented, which can solve the problem of severe packet loss and greatly improve the performance in error-prone environments. The proposed solution only requires to be deployed on the sender and is compatible with the 802.11 protocol. The algorithm utilizes the basic strategy of sliding retransmission and then adds the method of copying frames. The media access control (MAC) protocol data unit (MPDU) lost in the transmission and the newly added data frame brought by the sliding window change are replicated. The scheme retransmits the duplicated aggregated packet and further improves the throughput by increasing the probability of successful transmission of sub-frames. Besides, we also establish a mathematical model to analyze the performance of the proposed scheme. We introduce the concept of average aggregated sub-frames and express the sliding retransmission strategy as the aggregated transmission of average aggregated sub-frames, thereby simplifying the model and effectively analyzing the theoretical throughput of the proposed algorithm. The simulation results of Network simulator 3 (NS-3) simulation results demonstrate that the performance of the proposed algorithm is better than the traditional sliding retransmission ARQ algorithm in error-prone channels with a higher physical layer rate.

View full abstract

Download PDF (859K)
Image Super-Resolution via Generative Adversarial Networks Using Metric Projections onto Consistent Sets for Low-Resolution Inputs

Hiroya YAMAMOTO, Daichi KITAHARA, Hiroki KURODA, Akira HIRABAYASHI

Article type: PAPER
Subject area: Image
2022Volume E105.AIssue 4 Pages 704-718
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: September 29, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1038

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

This paper addresses single image super-resolution (SR) based on convolutional neural networks (CNNs). It is known that recovery of high-frequency components in output SR images of CNNs learned by the least square errors or least absolute errors is insufficient. To generate realistic high-frequency components, SR methods using generative adversarial networks (GANs), composed of one generator and one discriminator, are developed. However, when the generator tries to induce the discriminator's misjudgment, not only realistic high-frequency components but also some artifacts are generated, and objective indices such as PSNR decrease. To reduce the artifacts in the GAN-based SR methods, we consider the set of all SR images whose square errors between downscaling results and the input image are within a certain range, and propose to apply the metric projection onto this consistent set in the output layers of the generators. The proposed technique guarantees the consistency between output SR images and input images, and the generators with the proposed projection can generate high-frequency components with few artifacts while keeping low-frequency ones as appropriate for the known noise level. Numerical experiments show that the proposed technique reduces artifacts included in the original SR images of a GAN-based SR method while generating realistic high-frequency components with better PSNR values in both noise-free and noisy situations. Since the proposed technique can be integrated into various generators if the downscaling process is known, we can give the consistency to existing methods with the input images without degrading other SR performance.

View full abstract

Download PDF (6904K)
A Deep Q-Network Based Intelligent Decision-Making Approach for Cognitive Radar

Yong TIAN, Peng WANG, Xinyue HOU, Junpeng YU, Xiaoyan PENG, Hongshu LI ...

Article type: PAPER
Subject area: Neural Networks and Bioengineering
2022Volume E105.AIssue 4 Pages 719-726
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 15, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAP1072

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

The electromagnetic environment is increasingly complex and changeable, and radar needs to meet the execution requirements of various tasks. Modern radars should improve their intelligence level and have the ability to learn independently in dynamic countermeasures. It can make the radar countermeasure strategy change from the traditional fixed anti-interference strategy to dynamically and independently implementing an efficient anti-interference strategy. Aiming at the performance optimization of target tracking in the scene where multiple signals coexist, we propose a countermeasure method of cognitive radar based on a deep Q-learning network. In this paper, we analyze the tracking performance of this method and the Markov Decision Process under the triangular frequency sweeping interference, respectively. The simulation results show that reinforcement learning has substantial autonomy and adaptability for solving such problems.

View full abstract

Download PDF (2900K)
Near-Field Beamforming in Time Modulated Arrays

Yue MA, Chen MIAO, Yuehua LI, Wen WU

Article type: LETTER
Subject area: Digital Signal Processing
2022Volume E105.AIssue 4 Pages 727-729
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 11, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAL2073

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

Near-field beamforming has played an important role in many scenarios such as radar imaging and acoustic detection. In this paper, the near-field beamforming is implemented in the time modulated array with the harmonic. The beam pattern with a low sidelobe level in precise position is achieved by controlling the switching sequence in time modulated cross array. Numerical results verify the correctness of the proposed method.

View full abstract

Download PDF (420K)
Sea Clutter Image Segmentation Method of High Frequency Surface Wave Radar Based on the Improved Deeplab Network

Haotian CHEN, Sukhoon LEE, Di YAO, Dongwon JEONG

Article type: LETTER
Subject area: Digital Signal Processing
2022Volume E105.AIssue 4 Pages 730-733
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 12, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAL2057

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

High Frequency Surface Wave Radar (HFSWR) can achieve over-the-horizon detection, which can effectively detect and track the ships and ultra-low altitude aircrafts, as well as the acquisition of sea state information such as icebergs and ocean currents and so on. However, HFSWR is seriously affected by the clutters, especially sea clutter and ionospheric clutter. In this paper, we propose a deep learning image semantic segmentation method based on optimized Deeplabv3+ network to achieve the automatic detection of sea clutter and ionospheric clutter using the measured R-D spectrum images of HFSWR during the typhoon as experimental data, which avoids the disadvantage of traditional detection methods that require a large amount of a priori knowledge and provides a basis for subsequent the clutter suppression or the clutter characteristics research.

View full abstract

Download PDF (1623K)
Interleaved Sequences with Anti-Doppler Properties

Xi CAO, Yang YANG, Rong LUO

Article type: LETTER
Subject area: Coding Theory
2022Volume E105.AIssue 4 Pages 734-738
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 05, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAL2065

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

In this letter, we discuss the ambiguity function of interleaved sequences. Furthermore, using the Guassian sum and choosing binary m-sequences as column sequences, we investigate the property of a binary sequence set given by Zhou, Tang, Gong (IEEE Trans. Inf. Theory, 54(9), 2008), which has low ambiguity property in a large region. Those sequences could be used in radar systems.

View full abstract

Download PDF (352K)
Scaling Law of Energy Efficiency in Intelligent Reflecting Surface Enabled Internet of Things Networks

Juan ZHAO, Wei-Ping ZHU

Article type: LETTER
Subject area: Communication Theory and Signals
2022Volume E105.AIssue 4 Pages 739-742
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: September 29, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAL2061

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

The energy efficiency of intelligent reflecting surface (IRS) enabled internet of things (IoT) networks is studied in this letter. The energy efficiency is mathematically expressed, respectively, as the number of reflecting elements and the spectral efficiency of the network and is shown to scale in the logarithm of the reflecting elements number in the high regime of transmit power from source node. Furthermore, it is revealed that the energy efficiency scales linearly over the spectral efficiency in the high regime of transmit power, in contrast to conventional studies on energy and spectral efficiency trade-offs in the non-IRS wireless IoT networks. Numerical simulations are carried out to verify the derived results for the IRS enabled IoT networks.

View full abstract

Download PDF (265K)
Image Quality Improvement for Capsule Endoscopy Based on Compressed Sensing with K-SVD Dictionary Learning

Yuuki HARADA, Daisuke KANEMOTO, Takahiro INOUE, Osamu MAIDA, Tetsuya H ...

Article type: LETTER
Subject area: Image
2022Volume E105.AIssue 4 Pages 743-747
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 01, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAL2033

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

Reducing the power consumption of capsule endoscopy is essential for its further development. We introduce K-SVD dictionary learning to design a dictionary for sparse coding, and improve reconstruction accuracy of capsule endoscopic images captured using compressed sensing. At a compression ratio of 20%, the proposed method improves image quality by approximately 4.4 dB for the peak signal-to-noise ratio.

View full abstract

Download PDF (3032K)
Face Super-Resolution via Triple-Attention Feature Fusion Network

Kanghui ZHAO, Tao LU, Yanduo ZHANG, Yu WANG, Yuanzhi WANG

Article type: LETTER
Subject area: Image
2022Volume E105.AIssue 4 Pages 748-752
Published: April 01, 2022
Released on J-STAGE: April 01, 2022
Advance online publication: October 13, 2021

DOIhttps://doi.org/10.1587/transfun.2021EAL2056

JOURNAL RESTRICTED ACCESS

Show abstractHide abstract

In recent years, compared with the traditional face super-resolution (SR) algorithm, the face SR based on deep neural network has shown strong performance. Among these methods, attention mechanism has been widely used in face SR because of its strong feature expression ability. However, the existing attention-based face SR methods can not fully mine the missing pixel information of low-resolution (LR) face images (structural prior). And they only consider a single attention mechanism to take advantage of the structure of the face. The use of multi-attention could help to enhance feature representation. In order to solve this problem, we first propose a new pixel attention mechanism, which can recover the structural details of lost pixels. Then, we design an attention fusion module to better integrate the different characteristics of triple attention. Experimental results on FFHQ data sets show that this method is superior to the existing face SR methods based on deep neural network.

View full abstract

Download PDF (1268K)

Register with J-STAGE for free!