人工知能学会第二種研究会資料

Embedding Cognitive Map in Neural Episodic Control

松森匠哉, 妹尾卓磨, 菊池俊基, 滝本佑介, 大澤正彦, 今井倫太

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 01-
発行日: 2017/11/21
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_01

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

In reinforcement learning, environments with a sparse reward signal are significantly difficult to model. Especially, learning actions in 3D environment from the first person view is regarded as POMDP which potentially extends state space. Large environments with a sparse reward need efficient learning process in large state space. In this paper, we propose a deep reinforcement learning method with the memory module proposed in Neural Episodic Control, adding cognitive information to the memory module to improve performance.

抄録全体を表示

PDF形式でダウンロード (1575K)
方策最適化による強化学習を用いた人型ロボットの動作学習の実験

疋田聡

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 02-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_02

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

Experiments on reinforcement learning were conducted on games on OpenAI Gym and robot simulators using "Proximal Policy Optimization Algorithms", which is considered to be suitable for motion learning of humanoid robots. As a result, it was confirmed that reinforcement learning is possible by the program of the algorithm published from OpenAI. Moreover, we confirmed that the operation on the robot simulator can be operated with real robot by the experimental experiment with real robot.

抄録全体を表示

PDF形式でダウンロード (373K)
General AI Challenge を参考にしたIoT エコシステムのための Agile 開発

礼王懐成

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 03-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_03

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

I propose the new frame-work of the cooperation between AGI research and IoT/AI developer for IoT echo-system combining the state-of-art technology such as SemanticWeb of Things, Machine Learning platform and cognition model referencing Good AI AGI roadmap. And also propose the monetization of the IoT echo-system.

抄録全体を表示

PDF形式でダウンロード (1774K)
AGI への道程：2017 年版

中川裕志

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 04-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_04

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

汎用人工知能(AGI)が実現する時期については諸説があり、例えばカーツワイルは2045年と想定している。その真偽はさておき、現在までに明らかになっている情報からAGIにまつわる技術状況を検討することは、AGIへの道程を占うために役立つであろう。この発表では、このような目的に従って、以下の議論を展開する。(1)目的特化型の人工知能を組み合わせてAGIを作ることの困難さはすでにボストロムに指摘されているが、ここでは多種類の目的特化型人工知能の集合から解決すべき目標に適した目的特化型人工知能を探索することの困難さについて述べる。(2)Brooksのサブサンプション・アーキテクチャを基礎として、AGIに関係の深い自意識を実現する可能性の候補として、最近、知られるようになった長距離ニューロンを組み合わせる方法について考察する。(3)単一のAGIが超知能として他のAGIおよび人類を支配ないし駆逐するような人工知能脅威論は的外れであることを論ずる。以上のようにAGIないし超知能が実現しそうもないにもかかわらず、現在の弱い人工知能ですら人間社会においては実質的な脅威になるケースが散見されるという現状の俯瞰をもって、まとめとする。

抄録全体を表示

PDF形式でダウンロード (155K)
ベイジアンネットワークによる複数深層学習器からのデータ適合型学習器選択法

小林秀輔, 白山晋

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 05-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_05

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

This paper proposes a new method of time series prediction, using mulitiple deep learners and a Baysian network. We firstly suggests two approaches. The former is a method in which explanatory variables of inputs data are nodes of a Bayesian network and are associated with learners. On the other hand, the latter method is a method in which the outputs of all the learners are made to nodes of the Bayesian network and the outputs are integrated. In this paper, the former method will be proposed in detail. Training data is divided into some clusters with K-means clustering and the multiple deep learners are trained, depending on each clusters. A Bayesian network is used to determine which the deep learner is in charge of predicting a time series. Our proposed method is applied to financial time series data, and the predicted results for the return of Nikkei 225 is demonstrated.

抄録全体を表示

PDF形式でダウンロード (405K)
符号分割多重法により勾配推定を行う機械学習アルゴリズムの提案

佐藤功人, 圷弘明, 近藤雄樹

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 06-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_06

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

Back propagation is widely used for deep learning, however, it requires white box cost functions that is formulated and differentiable. It is difficult for non-experts to build the model for the problem for which the effective cost function is not known. In this report, we propose the gradient estimation method with code-division multiplexing that can calculate gradients of weights in the neural network by using multiple forward propagations. The proposed method enables machine learning for the problem with black box cost functions that cannot be formulated but can calculate cost value. In this report, the proposed method is evaluated on the MNIST problem. Evaluation results shows the proposed method can build the model to recognize MNIST digits and the appropriate lengths of spreading code are small in starting phase and large in finishing phase in learning term.

抄録全体を表示

PDF形式でダウンロード (1238K)
時系列データに内在する文脈構造を動作探索に用いる学習機械

柳川誠介

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 07-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_07

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

A system that transits from the initial state to the target state is assumed. The process of state transition is represented by time series data. The time series data is not given to the system unlike a program of a computer, but acquired by trial and error. To combine and search time series data, the context structure inherent in time series data is used. For example, even if the details of the time series data leading to the target state at the time of searching can not be determined, the time series data immediately before reaching the target state and the time series data indicating the movement from the initial state are linked at the upper level of the context In other words, if there is an overlap in the tree structure, it becomes a search candidate. It has been announced that the hierarchical structure is inherent in the time series data and that the basic sequence making up the time series data can naturally correspond to the activation area in the neural network.

抄録全体を表示

PDF形式でダウンロード (655K)
人工知能理論について

石原豪人

原稿種別: 研究会資料
2017 年 2017 巻 AGI-007 号 p. 08-
発行日: 2017/11/23
公開日: 2021/09/16

DOIhttps://doi.org/10.11517/jsaisigtwo.2017.AGI-007_08

研究報告書・技術報告書フリー

抄録を表示する抄録を非表示にする

Artificial intelligence is expected as the next form of computer. In this paper theory of artificial intelligence is discussed. It is based on the foundation of mathematics and thus on the necessary and sufficient conditions of intelligence, ethics and safety.

抄録全体を表示

PDF形式でダウンロード (205K)

J-STAGEへの登録はこちら（無料）