Nonlinear Theory and Its Applications, IEICE
Online ISSN : 2185-4106
ISSN-L : 2185-4106
Special Section on Recent Progress in Nonlinear Theory and Its Applications
Robust Visual Place Recognition via modern Hopfield networks and foundation models
Takanori HashimotoTeijiro IsokawaMasaki KobayashiNaotake Kamiura
Author information
JOURNAL OPEN ACCESS

2026 Volume 17 Issue 3 Pages 932-944

Details
Abstract

Visual Place Recognition (VPR) under severe environmental changes remains a fundamental challenge for autonomous roboticsin real-world environments. This task can be interpreted as associative memory retrieval from noisy queries, but classical models suffer from limited capacity and sensitivity to pixel-level variations. We address this by integrating Modern Hopfield Networks with DINOv3, a self-supervised Vision Transformer that provides robust semantic representations. The primary aim of this study is not to maximize VPR accuracy itself, but to investigate whether an energy-based associative memory can be realized on the latent space of a foundation model, using VPR as a challenging real-world testbed. Place recognition is formulated as energy minimization in a semantic latent space, where stored scenes act as attractors. Experiments on the Transient Attributes Database across four seasons show that the proposed method significantly outperforms pixel-based baselines, even under extreme domain shifts. We further analyze the retrieval dynamics and the effect of the inverse temperature parameter β on attractor stability.

Content from these authors
© 2026 The Institute of Electronics, Information and Communication Engineers

This article is licensed under a Creative Commons [Attribution-NonCommercial-NoDerivatives 4.0 International] license.
https://creativecommons.org/licenses/by-nc-nd/4.0/
Previous article Next article
feedback
Top