Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 4M1-GS-10-04
Conference information

AI alignment language: Align
Towards building an AI alignment paradigm
*Yoshi TAMORIShun YOSHIZAWAKen MOGI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

AI Alignment is a field of research that aims to make AI operate in accordance with human ethics, values and goals. We are developing a programming language, or 'alignment language', for designing AI to operate according to specific goals and ethics. This alignment language provides specific rules and structures for aligning AI behaviour and decision criteria with human ethics and goals; AI developers can use it to clearly define AI goals and behaviours, and minimise the risk of AI acting contrary to human intentions. The language can also be used to design prompts for AI to acquire the ability to adapt to its environment and situation. We are currently in the process of designing and implementing this language and are facing several challenges. For example, how to incorporate the diversity of human ethics and values into the AI, how flexible the AI's decision criteria should be, and how the AI should respond to unknown situations. The talk will present and discuss the structure of an alignment language for designing alignments that address these issues.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top