人工知能学会第二種研究会資料
Online ISSN : 2436-5556
2017 巻, AGI-007 号
第7回汎用人工知能研究会
選択された号の論文の8件中1~8を表示しています
  • 松森 匠哉, 妹尾 卓磨, 菊池 俊基, 滝本 佑介, 大澤 正彦, 今井 倫太
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 01-
    発行日: 2017/11/21
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    In reinforcement learning, environments with a sparse reward signal are significantly difficult to model. Especially, learning actions in 3D environment from the first person view is regarded as POMDP which potentially extends state space. Large environments with a sparse reward need efficient learning process in large state space. In this paper, we propose a deep reinforcement learning method with the memory module proposed in Neural Episodic Control, adding cognitive information to the memory module to improve performance.

  • 疋田 聡
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 02-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    Experiments on reinforcement learning were conducted on games on OpenAI Gym and robot simulators using "Proximal Policy Optimization Algorithms", which is considered to be suitable for motion learning of humanoid robots. As a result, it was confirmed that reinforcement learning is possible by the program of the algorithm published from OpenAI. Moreover, we confirmed that the operation on the robot simulator can be operated with real robot by the experimental experiment with real robot.

  • 礼王 懐成
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 03-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    I propose the new frame-work of the cooperation between AGI research and IoT/AI developer for IoT echo-system combining the state-of-art technology such as SemanticWeb of Things, Machine Learning platform and cognition model referencing Good AI AGI roadmap. And also propose the monetization of the IoT echo-system.

  • 中川 裕志
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 04-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    汎用人工知能(AGI)が実現する時期については諸説があり、例えばカーツワイルは2045年と想定している。その真偽はさておき、現在までに明らかになっている情報からAGIにまつわる技術状況を検討することは、AGIへの道程を占うために役立つであろう。この発表では、このような目的に従って、以下の議論を展開する。(1)目的特化型の人工知能を組み合わせてAGIを作ることの困難さはすでにボストロムに指摘されているが、ここでは多種類の目的特化型人工知能の集合から解決すべき目標に適した目的特化型人工知能を探索することの困難さについて述べる。(2)Brooksのサブサンプション・アーキテクチャを基礎として、AGIに関係の深い自意識を実現する可能性の候補として、最近、知られるようになった長距離ニューロンを組み合わせる方法について考察する。(3)単一のAGIが超知能として他のAGIおよび人類を支配ないし駆逐するような人工知能 脅威論は的外れであることを論ずる。以上のようにAGIないし超知能が実現しそうもないにもかかわらず、現在の弱い人工知能ですら人間社会においては実質的な脅威になるケースが散見されるという現状の俯瞰をもって、まとめとする。

  • 小林 秀輔, 白山 晋
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 05-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    This paper proposes a new method of time series prediction, using mulitiple deep learners and a Baysian network. We firstly suggests two approaches. The former is a method in which explanatory variables of inputs data are nodes of a Bayesian network and are associated with learners. On the other hand, the latter method is a method in which the outputs of all the learners are made to nodes of the Bayesian network and the outputs are integrated. In this paper, the former method will be proposed in detail. Training data is divided into some clusters with K-means clustering and the multiple deep learners are trained, depending on each clusters. A Bayesian network is used to determine which the deep learner is in charge of predicting a time series. Our proposed method is applied to financial time series data, and the predicted results for the return of Nikkei 225 is demonstrated.

  • 佐藤 功人, 圷 弘明, 近藤 雄樹
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 06-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    Back propagation is widely used for deep learning, however, it requires white box cost functions that is formulated and differentiable. It is difficult for non-experts to build the model for the problem for which the effective cost function is not known. In this report, we propose the gradient estimation method with code-division multiplexing that can calculate gradients of weights in the neural network by using multiple forward propagations. The proposed method enables machine learning for the problem with black box cost functions that cannot be formulated but can calculate cost value. In this report, the proposed method is evaluated on the MNIST problem. Evaluation results shows the proposed method can build the model to recognize MNIST digits and the appropriate lengths of spreading code are small in starting phase and large in finishing phase in learning term.

  • 柳川 誠介
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 07-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    A system that transits from the initial state to the target state is assumed. The process of state transition is represented by time series data. The time series data is not given to the system unlike a program of a computer, but acquired by trial and error. To combine and search time series data, the context structure inherent in time series data is used. For example, even if the details of the time series data leading to the target state at the time of searching can not be determined, the time series data immediately before reaching the target state and the time series data indicating the movement from the initial state are linked at the upper level of the context In other words, if there is an overlap in the tree structure, it becomes a search candidate. It has been announced that the hierarchical structure is inherent in the time series data and that the basic sequence making up the time series data can naturally correspond to the activation area in the neural network.

  • 石原 豪人
    原稿種別: 研究会資料
    2017 年 2017 巻 AGI-007 号 p. 08-
    発行日: 2017/11/23
    公開日: 2021/09/16
    研究報告書・技術報告書 フリー

    Artificial intelligence is expected as the next form of computer. In this paper theory of artificial intelligence is discussed. It is based on the foundation of mathematics and thus on the necessary and sufficient conditions of intelligence, ethics and safety.

feedback
Top