Ítem


Interpretation of Orthonormal Coordinates in Case of Three-part Compositions Applied to Orthogonal Regression for Compositional Data

Orthonormal coordinates are very important tool for compositional data processing using standard statistical methods. Namely, in order to express a D-part composition in the Euclidean real space we use isometric log-ratio (ilr) transformation, which is an isometric mapping from the sample space of compositions, the simplex S D with the Aitchison geometry, to the (D −1)-dimensional Euclidean real space RD−1 . The ilr transformation results in coordinates of an orthonormal basis on the simplex. Advantages coming from this transformation, like the mentioned isometry between S D and RD−1 , are closely related with the problem of interpreting orthonormal coordinates, constructed by sequential binary partition. Their interpretation can be approached as balances between groups of parts of a composition as well as by expressing their covariance structure by log-ratios of parts of the analyzed composition, i.e. in terms of ratios. Note that if we want to achieve interpretation of results of statistical analysis directly on the simplex (in terms of the original compositional parts), the backtransformation is required. The aim of the contribution is to analyze the interpretation of two coordinates (balances) obtained by the ilr transformation of three-part compositions. Attention is focused on interpreting coordinates coming from the description of their covariance structure. General conclusions will be used for analysing results from orthogonal regression for compositions. Its main idea is to fit a line explaining the set of n compositional data points in coordinates in such a way that the sum of squared distances from data points to the estimated line is minimal. By using the theory of linear regression models with type II constraints, it is possible to construct confidence bounds or testing hypotheses on regression parameters. However, especially the mentioned parameters cannot be easily interpreted back on the simplex, the interpretation is only possible in sense of the orthonormal coordinates. The theoretical considerations will be illustrated on a real-world example

Universitat de Girona. Departament d’Informàtica i Matemàtica Aplicada

Altres contribucions: Universitat de Girona. Departament d’Informàtica i Matemàtica Aplicada
Autor: Donevska, S.
Fišerová, E.
Hron, Karel
Data: 5 juny 2018
Resum: Orthonormal coordinates are very important tool for compositional data processing using standard statistical methods. Namely, in order to express a D-part composition in the Euclidean real space we use isometric log-ratio (ilr) transformation, which is an isometric mapping from the sample space of compositions, the simplex S D with the Aitchison geometry, to the (D −1)-dimensional Euclidean real space RD−1 . The ilr transformation results in coordinates of an orthonormal basis on the simplex. Advantages coming from this transformation, like the mentioned isometry between S D and RD−1 , are closely related with the problem of interpreting orthonormal coordinates, constructed by sequential binary partition. Their interpretation can be approached as balances between groups of parts of a composition as well as by expressing their covariance structure by log-ratios of parts of the analyzed composition, i.e. in terms of ratios. Note that if we want to achieve interpretation of results of statistical analysis directly on the simplex (in terms of the original compositional parts), the backtransformation is required. The aim of the contribution is to analyze the interpretation of two coordinates (balances) obtained by the ilr transformation of three-part compositions. Attention is focused on interpreting coordinates coming from the description of their covariance structure. General conclusions will be used for analysing results from orthogonal regression for compositions. Its main idea is to fit a line explaining the set of n compositional data points in coordinates in such a way that the sum of squared distances from data points to the estimated line is minimal. By using the theory of linear regression models with type II constraints, it is possible to construct confidence bounds or testing hypotheses on regression parameters. However, especially the mentioned parameters cannot be easily interpreted back on the simplex, the interpretation is only possible in sense of the orthonormal coordinates. The theoretical considerations will be illustrated on a real-world example
Accés al document: http://hdl.handle.net/2072/319414
Llenguatge: eng
Editor: Universitat de Girona. Departament d’Informàtica i Matemàtica Aplicada
Drets: Tots els drets reservats
Matèria: Estadística matemàtica -- Congressos
Mathematical statistics -- Congresses
Anàlisi multivariable -- Congressos
Multivariate analysis -- Congresses
Títol: Interpretation of Orthonormal Coordinates in Case of Three-part Compositions Applied to Orthogonal Regression for Compositional Data
Tipus: info:eu-repo/semantics/conferenceObject
Repositori: Recercat

Matèries

Autors