Nonlinear Theory and Its Applications, IEICE
Online ISSN : 2185-4106
ISSN-L : 2185-4106
Special Section on Nonlinear Science Workshop on the Journal
addHessian: Combining quasi-Newton method with first-order method for neural network training
Sota YasudaS. IndrapriyadarsiniHiroshi NinomiyaTakeshi KamioHideki Asai
Author information
JOURNAL FREE ACCESS

2022 Volume 13 Issue 2 Pages 361-366

Details
Abstract

First-order methods such as SGD and Adam are popularly used in training Neural networks. On the other hand, second-order methods have shown to have better performance and faster convergence despite their high computational cost by incorporating the curvature information. While second-order methods determine the step size by line search approaches, first-order methods achieve efficient learning by devising a way to adjust the step size. In this paper, we propose a new learning algorithm for training neural networks by combining first-order and second-order methods. We investigate the effectiveness of our proposed method when combined with popular first-order methods - SGD, Adagrad, and Adam, through experiments using image classification problems.

Content from these authors
© 2022 The Institute of Electronics, Information and Communication Engineers
Previous article Next article
feedback
Top