Item


Graphing and Communicating Compositional Data in High Dimensions

Visualization of data becomes more challenging as the dimensionality of the data increases, impacting not only the display of the data itself but also the modeling results. This paper discusses common visualization techniques for compositional data. None of them seem to be well suited for changes in compositions that depend on either a metric covariate or a factor. The clr-deviation chart as a chart with a factor or covariate as abscissa and all centered log ratio-transformed component values superimposed on the ordinate axis is then introduced jointly with the clr-component chart. The clr-deviation chart takes advantage of the sum-equals-zero property of clr-transformed compositional data. It has some theoretical and practical advantages over alternatives and one major disadvantage – an arbitrarily scaled ordinate axis; its properties are discussed. The usefulness of the methods are illustrated using an example analyzing the changes of proportions of the different diseases treated by hospitalization over a period of 13 years in Germany

Universitat de Girona. Departament d’Informàtica i Matemàtica Aplicada

Other contributions: Universitat de Girona. Departament d’Informàtica i Matemàtica Aplicada
Author: Ulbrich, H.F.
Date: 2018 June 5
Abstract: Visualization of data becomes more challenging as the dimensionality of the data increases, impacting not only the display of the data itself but also the modeling results. This paper discusses common visualization techniques for compositional data. None of them seem to be well suited for changes in compositions that depend on either a metric covariate or a factor. The clr-deviation chart as a chart with a factor or covariate as abscissa and all centered log ratio-transformed component values superimposed on the ordinate axis is then introduced jointly with the clr-component chart. The clr-deviation chart takes advantage of the sum-equals-zero property of clr-transformed compositional data. It has some theoretical and practical advantages over alternatives and one major disadvantage – an arbitrarily scaled ordinate axis; its properties are discussed. The usefulness of the methods are illustrated using an example analyzing the changes of proportions of the different diseases treated by hospitalization over a period of 13 years in Germany
Document access: http://hdl.handle.net/2072/319391
Language: eng
Publisher: Universitat de Girona. Departament d’Informàtica i Matemàtica Aplicada
Rights: Tots els drets reservats
Subject: Anàlisi multivariable -- Congressos
Multivariate analysis -- Congresses
Estadística matemàtica -- Congressos
Mathematical statistics -- Congresses
Title: Graphing and Communicating Compositional Data in High Dimensions
Type: info:eu-repo/semantics/conferenceObject
Repository: Recercat

Subjects

Authors