首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Advances in Principal Balances for Compositional Data
Authors:J A Martín-Fernández  V Pawlowsky-Glahn  J J Egozcue  R Tolosona-Delgado
Institution:1.Dept. Informàtica, Matemàtica Aplicada, i Estadística,Universitat de Girona,Girona,Spain;2.Dept. d’Enginyeria Civil i Ambiental,U. Politècnica de Catalunya,Barcelona,Spain;3.Dept. Modelling and Evaluation,Helmholtz-Zentrum Dresden-Rossendorf, Helmholtz-Institut Freiberg for Resource Technology,Freiberg,Germany
Abstract:Compositional data analysis requires selecting an orthonormal basis with which to work on coordinates. In most cases this selection is based on a data driven criterion. Principal component analysis provides bases that are, in general, functions of all the original parts, each with a different weight hindering their interpretation. For interpretative purposes, it would be better to have each basis component as a ratio or balance of the geometric means of two groups of parts, leaving irrelevant parts with a zero weight. This is the role of principal balances, defined as a sequence of orthonormal balances which successively maximize the explained variance in a data set. The new algorithm to compute principal balances requires an exhaustive search along all the possible sets of orthonormal balances. To reduce computational time, the sets of possible partitions for up to 15 parts are stored. Two other suboptimal, but feasible, algorithms are also introduced: (i) a new search for balances following a constrained principal component approach and (ii) the hierarchical cluster analysis of variables. The latter is a new approach based on the relation between the variation matrix and the Aitchison distance. The properties and performance of these three algorithms are illustrated using a typical data set of geochemical compositions and a simulation exercise.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号