university of groningen exploring multidimensional

2
University of Groningen Exploring Multidimensional Projections Through Explanatory Maps Rodrigues Oliveira da Silva, Renato; Rauber, Paulo Eduardo; Messias Martins, Rafael; Minghim, Rosane; Telea, Alexandru IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below. Document Version Publisher's PDF, also known as Version of record Publication date: 2016 Link to publication in University of Groningen/UMCG research database Citation for published version (APA): Rodrigues Oliveira da Silva, R., Rauber, P. E., Messias Martins, R., Minghim, R., & Telea, A. (2016). Exploring Multidimensional Projections Through Explanatory Maps. Poster session presented at ICT.Open 2016, Amersfoort, Netherlands. Copyright Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons). Take-down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum. Download date: 12-09-2017

Upload: others

Post on 02-Dec-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: University of Groningen Exploring Multidimensional

University of Groningen

Exploring Multidimensional Projections Through Explanatory Maps

Rodrigues Oliveira da Silva, Renato; Rauber, Paulo Eduardo; Messias Martins, Rafael;Minghim, Rosane; Telea, Alexandru

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish tocite from it. Please check the document version below.

Document VersionPublisher's PDF, also known as Version of record

Publication date:2016

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):Rodrigues Oliveira da Silva, R., Rauber, P. E., Messias Martins, R., Minghim, R., & Telea, A. (2016).Exploring Multidimensional Projections Through Explanatory Maps. Poster session presented at ICT.Open2016, Amersfoort, Netherlands.

CopyrightOther than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of theauthor(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policyIf you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediatelyand investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons thenumber of authors shown on this cover page is limited to 10 maximum.

Download date: 12-09-2017

Page 2: University of Groningen Exploring Multidimensional

Exploring Multidimensional Projections Through Explanatory Maps

Multidimensional Projections (MPs) are popular tools used to support the analysis of multidimensional data. MPs can project data to a low dimensional space, and are typically visualized as 2D scatterplots where similar elements are conveniently positioned in close neighborhoods. However such visualizations tell us which points are similar, but not why. Our aim is, thus, to enrich 2D MP scatterplots with explanatory information telling users which key dimensions make closely-projected points similar, and their values.

To explain a projection we identify which dimensions are more important to define similarity among close points. We first assign a rank to each of the dimensions for each projected point, based on increasing order of data variance in that dimension over that neighborhood. We also define the ranking confidence for each point based on how mixed are the top-ranked dimensions on their neighborhood points. We display the top ranks and their confidences over the projection using a dense map technique based on nearest-neighbor (Voronoi) interpolation, with top-ranks encoded by a categorical colormap and confidences by brightness.

Figure 1 - Explanation of points sampled from three cube faces. Points on the same cube-face have the same value for one dimension d, so they have assigned the same color. Color brightness indicate the confidence ranking, from high (bright) to low (dark).

(a) Projection scatterplot (b) Explanatory map

Using one top-ranked dimension per point, the explanatory map defines clusters, grouping elements whose similarities are best explained by the same dimension. Dimensions names are positioned on the clusters outlines. We use a colorbar to inform the distribution values for those dimensions on demand.

Figure 2 - Top Ranked Explanatory map of projected points from a Software quality metrics dataset.

Future work targets better visual encodings to explain domain specific data, such as texts and time series. We also want to develop new strategies to analyze time-dependent projections.

We can use more than one dimension per point to have a more detailed explanation. By thesholding the ranking of every point, we create sets of top ranked dimensions. Next, we can apply the same strategy as before, defining and explaining clusters of sets containing the same dimensions.

Figure 3 - Dimensions Sets Explanatory map of the Software quality metrics projection.

To facilitate the interpretation of our maps, we automatically detect same colored regions outlines and fill these with dimensions labels.The outlines detection step involves computing the convex hull of these regions, sample the hulls, and iteratively shrink them moving the sampled points with a small step along the contour’s inward normal.Then fill outlines with dimension names labels, by creating a grid over each outline. Labels are iteratively moved to grid cells inside the outlines boundaries, until a valid position is found. We position labels on angles according to the outlines shapes, or horizontally. The size of each label reflects its weight importance on the dimension ranking, i.e., big labels indicate dimensions which contribute more to neighborhoods similarities.

Renato Rodrigues Oliveira da SilvaJohann Bernoulli Institute for Mathematics and Computer ScienceUniversity of GroningenNijenborgh 9, 9747 AG Groningene-mail: [email protected]

This work is supported by FAPESPGrant 2011/18838-5