The Journal of the Geological Society of Japan
Online ISSN : 1349-9963
Print ISSN : 0016-7630
ISSN-L : 0016-7630
Review
Problems in compositional data analysis and their solutions
Tohru OhtaHiroyoshi Arai
Author information
JOURNAL FREE ACCESS

2006 Volume 112 Issue 3 Pages 173-187

Details
Abstract

Compositional data, represented as percent or parts per million, are subject to the constant-sum constraint that precludes compositional data from much of statistical analysis. Despite this constraint, a theory for statistically rigorous treatment of compositional data is currently under intense development. This paper reviews the utility of two main procedures for compositional data analysis, which will be termed "logratio analysis" and "simplicial analysis". Logratio analysis is a way to map compositional data from a simplex space to a Euclidean real space by transforming compositional data into logarithms of component ratios. This bijectional mapping allows the transformed data to be analyzed by many traditional statistical methods available in real space. On the other hand, simplicial analysis introduces proper classes of parametric distribution, translation operation, scalar multiplication operation, identity unit and metric function within the simplex space. These definitions permit the simplex space to be reviewed as a metric space and compositional data as an Abelian group. Moreover, simplicial analysis provides statistical methodologies for compositional data, which are analogous to those for data sets associated with real space. A brief overview of the constant-sum constraint is followed by mathematical descriptions of logratio and simplicial analyses. Practical analyses of real data sets based on logratio and simplicial analyses are provided to illustrate their potential and to encourage their use.

Content from these authors
© 2006 by The Geological Society of Japan
Next article
feedback
Top