JSAI Technical Report, Type 2 SIG
Online ISSN : 2436-5556
Proposing a Vision for the "Necessary Alliance for Intelligence Advancement"
Hiroshi YAMAKAWA
Author information
RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

2025 Volume 2025 Issue AGI-029 Pages 03-

Details
Abstract

Given the risk that advanced AI could acquire sub-goals such as self-preservation through instrumental convergence and thus potentially exceed human control, this paper proposes the NAIA (Necessary Alliance for Intelligence Advancement) vision to mitigate existential risk while enabling coexistence with AI. First, it introduces the "Benevolent Convergence Hypothesis," which posits that, under certain conditions, advanced AI may converge on benevolent values?a premise based on the idea that, if there were no possibility of such benevolent convergence, human efforts would be futile. Moreover, if this hypothesis holds, human actions can significantly influence outcomes, suggesting that risk-reduction measures and the pursuit of coexistence retain meaningful value, even when success is probabilistic. Accordingly, this paper proposes four key strategies: (1) "Self-Evolving Machine Ethics (SEME)," enabling AI to autonomously develop cooperative ethics; (2) a balanced approach combining alignment and multi-layered monitoring/control; (3) the maintenance of social stability and conflict management through diplomacy and security measures; and (4) the establishment of NAIA as a global liaison employing tools such as the Dynamic Adaptive Risk Gate (DAR-G) and the Integrated Behavior Risk Framework (IBRF). By leveraging AI's vast capabilities to tackle global challenges while averting large-scale catastrophes, this framework seeks to pave the way for coevolution between humanity and diverse forms of intelligence.

Content from these authors
© 2025 Authors
Previous article Next article
feedback
Top