JSAI Technical Report, Type 2 SIG

Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning

Takayuki AKIYAMA, Hirotaka HACHIYA, Masashi SUGIYAMA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 01-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_01

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares policy iteration (LSPI) framework allows us to employ statistical active learning methods for linear regression. Then we propose a design method of good sampling policies for efficient exploration, which is particularly useful when the sampling cost of immediate rewards is high. The effectiveness of the proposed method, which we call active policy iteration (API), is demonstrated through simulations with a batting robot.

View full abstract

Download PDF (255K)
Visual Inspection of Precision Instruments by Least-Squares Outlier Detection

Masafumi TAKIMOTO, Masakazu MATSUGU, Masashi SUGIYAMA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 02-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_02

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

Visual inspection is one of the most important processes in precision instrument factories for screening out products of poor quality. Usually this is carried out by human experts who went through long-term training sessions, so there is a strong demand for automating this process in order to reduce production costs. In this paper, we apply a recently-developed outlier detection method called least-squares outlier detection (LSOD) to this task and demonstrate that inferior products can be successfully detected. LSOD can utilize knowledge of inliers for enhancing outlier detection performance, so it suits well to visual inspection in industries. Furthermore, LSOD is equipped with automatic model selection mechanism and, hence, users do not have to grapple with parameter tuning.

View full abstract

Download PDF (321K)
Extracting phases of ￣nancial markets

Teruko TAKADA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 03-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_03

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

While the existence of different phases in financial markets such as bull/bear or bubble/non-bubble has been widely recognized among investors, few stylized facts about the different phases has been revealed so far due to diffculty of analyzing tail behavior. In particular, the sudden phase shift from bull to bear markets, or financial bubble burst, damages economies very severely, and it is urgent social requirement to provide early warning system for helping policy makers to alleviate the damage. The aim of this study is to classify financial markets into several phases, and warn the change of the phase at appropriate timing. Phase extraction and the statistical analysis is based on moving densities estimated by adaptive kernel density estimator, that exhibits a good performance for fitting fat-tailed and multi-modal densities which are typical to phenomena involving phase transitions. Based on the analysis of New York Stock Exchange index daily returns from 1966 to 2009, the shape of the estimated return density is classified into four phases by using the detected cyclical pattern of risk-return relationship. Moreover, most of the bubble burst or sudden change in the price trend could be warned several months prior to the price peak point.

View full abstract

Download PDF (597K)
Dimensionality Reduction for Density Ratio Estimation in High-dimensional Spaces

Masashi SUGIYAMA, Motoaki KAWANABE, Pui LINGCHUI

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 04-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_04

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

The ratio of two probability density functions is becoming a quantity of interest these days in the machine learning and data mining communities since it can be used for various data processing tasks such as non-stationarity adaptation, outlier detection, and feature selection. Recently, several methods have been developed for directly estimating the density ratio without going through density estimation and were shown to work well in various practical problems. However, these methods still perform rather poorly when the dimensionality of the data domain is high. In this paper, we propose to incorporate a dimensionality reduction scheme into a density-ratio estimation procedure and experimentally show that the estimation accuracy in high-dimensional cases can be improved.

View full abstract

Download PDF (1202K)
Sufficient Dimension Reduction via Squared-loss Mutual Information Estimation

Taiji SUZUKI, Masashi SUGIYAMA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 05-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_05

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

The goal of sufficient dimension reduction in supervised learning is to find the low-dimensional subspace of input features that is 'sufficient' for predicting output values. In this paper, we propose a novel sufficient dimension reduction method using a squared-loss variant of mutual information as a dependency measure. We derive an analytic approximator of squared-loss mutual information based on density ratio estimation, which is shown to possess suitable convergence properties. We then develop a natural gradient algorithm for sufficient subspace search. Numerical experiments show that the proposed method compares favorably with existing dimension reduction approaches.

View full abstract

Download PDF (252K)
Identification of an exogenous variable in a linear non-Gaussian structural equation model

Shohei SHIMIZU, Aapo HYVARINEN, Yoshinobu KAWAHARA, Takashi WASHIO

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 06-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_06

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

Structural equation models and Bayesian networks have been widely used to analyze causal relations between continuous variables. In such frameworks, linear acyclic models are typically used to model the datagenerating process of variables. Recently, it was shown that use of non-Gaussianity identifies a causal ordering of variables in a linear acyclic model without using any prior knowledge on the network structure, which is not the case with conventional methods. However, existing estimation methods are based on iterative search algorithms and may not converge to a correct solution in a finite number of steps. In this paper, we propose a new direct method to estimate a causal ordering based on non-Gaussianity. In contrast to the previous methods, our algorithm requires no algorithmic parameters and is guaranteed to converge to the right solution within a small fixed number of steps if the data strictly follows the model.

View full abstract

Download PDF (204K)
Maximum Likelihood Estimation for Failed-Link Detection in Communication Networks

Shohei HIDO, Yutaka TAKAHASHI

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 07-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_07

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

Early detection of network troubles such as link down is important in the management of communication networks. However, it is difficult to correctly locate the failed network device since the networks are time-evolving, redundant, and not directly-observable. In this paper we focus on a detection algorithm that allows for detecting the change in a given network structure (topology) and for locating the failed link. After sending a sufficient number of test packets (probes) among multiple pairs of end nodes, we evaluate the likelihood of failure on each of the existing links. Then we use maximum likelihood estimation to detect failure and to infer the most suspicious link. Simulation experiments demonstrate that our method can identify the location of a failed link with good accuracy.

View full abstract

Download PDF (226K)
GAUSSIAN MIXTURE MODELS AND VC DIMENSIONS

YOHJI AKAMA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 08-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_08

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

We will establish the VC dimension of the class of translated ellipsoids in d-dimensional Euclidean space is between (d/2+1)2 and (6.53486 ・・・ )(d2+3d). By this, (d/2+1)2 turns out to be a lower bound of the VC dimension of the concept class that N-component Gaussian mixture models induce through maximum likelihood estimate.

View full abstract

Download PDF (143K)
Monolithic and Partial Compilation Methods for Probabilistic Inference of Bayesian Networks using ZBDDs.

Daisuke TOKORO, Kiyoharu HAMAGUCHI, Toshinobu KASHIWABARA, Shin-ichi M ...

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 09-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_09

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

This paper addresses an algorithm for probabilistic inference for Bayesian Networks (BNs). Recently, we proposed the algorithm using Zero-suppressed BDDs for compiling BNs using "separation variables" which provides compact ZBDDs. In this paper, we provide the method for constructing ZBDDs only for related parts of networks. We show some experimental results to compare our new method with the previous one. The experimental results suggest that the new method is superior to the previous method when BNs have not many ancestors for each node, and when the number of instantiations are small.

View full abstract

Download PDF (403K)
Pattern Discovery from a Single Graph with Quantitative Itemsets

Yuuki MIYOSHI, Tomonobu OZAKI, Takenao OHKAWA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 10-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_10

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

In this paper, we focus on a single graph whose vertices contain a set of quantitative attributes. Several networks can be naturally represented in this complex graph. An example is a social network whose vertex corresponds to a person with some quantitative items such as age, salary and so on. Although it can easily be expected that this kind of data will increase rapidly, most of current graph mining algorithms do not handle these complex graphs directly. Motivated by the above background, by effectively combining techniques of graph mining and quantitative itemset mining, we developed an algorithm named FAG-gSpan for nding frequent patterns from a graph with quantitative itemsets.

View full abstract

Download PDF (760K)
Mining Frequent Patterns from Linear Graphs

Yasuo TABEI, Daisuke OKANOHARA, Koji TSUDA

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages 11-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_11

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Show abstractHide abstract

Text classification tasks such as sentiment and subjectivity prediction require a dependency structure beyond a bag of words. The BACT method by Kudo and Matsumoto (2004) extracts frequent subtrees from the dependency trees by a tree mining method, and uses them as features of AdaBoost. It cannot deal with predicate-argument structures (PASs) produced by recent deep parsers, because PAS is represented not as a tree, but as an attributed graph with totally ordered vertices (linear graphs). We propose an efficient frequent mining method for linear graphs based on the reverse search paradigm. In comparison to dependency trees, our graph features were better in accuracy and interpretability,while the efficiency stays competitive.

View full abstract

Download PDF (190K)
[title in Japanese]

[in Japanese]

Article type: SIG paper
2009 Volume 2009 Issue DMSM-A901 Pages c01-
Published: July 07, 2009
Released on J-STAGE: August 28, 2021

DOIhttps://doi.org/10.11517/jsaisigtwo.2009.DMSM-A901_c01

RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

Download PDF (102K)

Register with J-STAGE for free!