Abstract
Compositional data are characterized by the fact that their elemental information is contained in simple pairwise logratios of the parts that constitute the composition. While pairwise logratios are typically easy to interpret, the number of possible pairs to consider quickly becomes (too) large even for medium-sized compositions, which might hinder interpretability in further multivariate analyses. Sparse methods can therefore be useful to identify few, important pairwise logratios (respectively parts contained in them) from the total candidate set. To this end, we propose a procedure based on the construction of all possible pairwise logratios and employ sparse principal component analysis to identify important pairwise logratios. The performance of the procedure is demonstrated both with simulated and real-world data. In our empirical analyses, we propose three visual tools showing (i) the balance between sparsity and explained variability, (ii) stability of the pairwise logratios, and (iii) importance of the original compositional parts to aid practitioners with their model interpretation.
| Original language | English |
|---|---|
| Publisher | Cornell University - arXiv |
| Number of pages | 23 |
| DOIs | |
| Publication status | Published - 2023 |
Publication series
| Series | arXiv.org |
|---|---|
| Number | 2311.13911 |
| ISSN | 2331-8422 |
Fingerprint
Dive into the research topics of 'Identifying Important Pairwise Logratios in Compositional Data with Sparse Principal Component Analysis'. Together they form a unique fingerprint.Research output
- 1 Article
-
Identifying Important Pairwise Logratios in Compositional Data with Sparse Principal Component Analysis
Nesrstová, V., Wilms, I., Hron, K. & Filzmoser, P., Feb 2025, In: Mathematical Geosciences. 57, 2, p. 333-358 26 p.Research output: Contribution to journal › Article › Academic › peer-review
Open Access
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver