Statistical Mechanics of On-line Node-perturbation Learning

Kazuyuki Hara; Kentaro Katahira; Kazuo Okanoya; Masato Okada

doi:10.11185/imt.6.352

抄録

Node-perturbation learning (NP-learning) is a kind of statistical gradient descent algorithm that estimates the gradient of an objective function through application of a small perturbation to the outputs of the network. It can be applied to problems where the objective function is not explicitly formulated, including reinforcement learning. In this paper, we show that node-perturbation learning can be formulated as on-line learning in a linear perceptron with noise, and we can derive the differential equations of order parameters and the generalization error in the same way as for the analysis of learning in a linear perceptron through statistical mechanical methods. From analytical results, we show that cross-talk noise, which originates in the error of the other outputs, increases the generalization error as the output number increases.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

ウナギの病原菌はどの器官から分離されやすいか
制振材料に関する一般知識と材料データ
Nonlinear output voltage control of boost converters based on a model error compensator
Improvement of Shark Type I Collagen with Microbial Transglutaminase in Urea
Acquired Fanconi Syndrome with Proximal Tubular Cytoplasmic Fibrillary Inclusions of λ Light Chain Restriction

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）