Ítem


Rate-Distortion Theory for Clustering in the Perceptual Space

How to extract relevant information from large data sets has become a main challengein data visualization. Clustering techniques that classify data into groups according to similaritymetrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in thedata space as an independent step previous to visualization. In this paper, we propose clusteringon the perceptual space by maximizing the mutual information between the original data and thefinal visualization. With this purpose, we present a new information-theoretic framework based onthe rate-distortion theory that allows us to achieve a maximally compressed data with a minimalsignal distortion. Using this framework, we propose a methodology to design a visualization processthat minimizes the information loss during the clustering process. Three application examples of theproposed methodology in different visualization techniques such as scatterplot, parallel coordinates,and summary trees are presented

This work has been funded in part by grants from the Spanish Government (Nr. TIN2016-75866-C3-3-R) and from the Catalan Government (Nr. 2014-SGR-1232)

MDPI (Multidisciplinary Digital Publishing Institute)

Autor: Bardera i Reig, Antoni
Bramon Feixas, Roger
Ruiz Altisent, Marc
Boada, Imma
Resum: How to extract relevant information from large data sets has become a main challengein data visualization. Clustering techniques that classify data into groups according to similaritymetrics are a suitable strategy to tackle this problem. Generally, these techniques are applied in thedata space as an independent step previous to visualization. In this paper, we propose clusteringon the perceptual space by maximizing the mutual information between the original data and thefinal visualization. With this purpose, we present a new information-theoretic framework based onthe rate-distortion theory that allows us to achieve a maximally compressed data with a minimalsignal distortion. Using this framework, we propose a methodology to design a visualization processthat minimizes the information loss during the clustering process. Three application examples of theproposed methodology in different visualization techniques such as scatterplot, parallel coordinates,and summary trees are presented
This work has been funded in part by grants from the Spanish Government (Nr. TIN2016-75866-C3-3-R) and from the Catalan Government (Nr. 2014-SGR-1232)
Accés al document: http://hdl.handle.net/2072/292125
Llenguatge: eng
Editor: MDPI (Multidisciplinary Digital Publishing Institute)
Drets: Attribution 4.0 Spain
URI Drets: http://creativecommons.org/licenses/by/4.0/es/
Matèria: Visualització de la informació
Information visualization
Informació, Teoria de la
Information theory
Títol: Rate-Distortion Theory for Clustering in the Perceptual Space
Tipus: info:eu-repo/semantics/article
Repositori: Recercat

Matèries

Autors