オンライン環境において公平な資源配分を実現するアルゴリズムに関する研究

山田 博瑛; 小宮山 純平; 阿部 拳之; 岩﨑 敦

doi:10.11517/pjsai.JSAI2023.0_2T4GS503

Abstract

This work addresses learning online fair division, wherein the values of items that arrive sequentially are not directly observable, but instead the noisy, estimated values are observable when we assign the items. We consider the problem as computing market equilibria in linear Fisher markets where agents have additive utilities. In such markets, the static allocation simultaneously achieves envy-freeness and Pareto optimality by maximizing Nash social welfare, assuming that items are divisible or can be allocated randomly. However, this fact is no longer valid in online settings. To this end, we have developed online algorithms that combine dual averaging with multi-armed bandit indices. Through dual averaging, our algorithms gradually learn the values of arriving items via bandit feedback. As a result, the algorithms asymptotically achieve the optimal Nash social welfare. We also empirically demonstrate the superior performance of the proposed algorithms in synthetic and empirical datasets.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!