Comparison of five dimensionality reduction methods on the multidimensional “Forest Cover Type” dataset
Abstract
A multidimensional and multivariate structure, with mixed data type gives to researchers the opportunity to use many statistical methods of dimensionality reduction, which aimed at a reduced representation of the original data set that will be smaller in “volume” but will still contain critical and useful information. In this study, the statistical dimensionality reduction methods that were compared with each other, through appropriate data set, were Principal Components Analysis, Factor Analysis, Correspondence Analysis, Categorical Principal Components Analysis and Factor Analysis for Mixed Data. For the comparisons of these methods, various strategies were applied. The aims of this study were to compare the results of five dimensionality reduction methods, to check the application of these methods to multidimensional mixed data type and to compare the results’ extraction times from different statistical softwares, with purpose to highlighting significant computational and interpretive disadvantages. The statistical softwares used were Python and SPSS. Important disadvantages of these methods were the “curse of dimensionality”, in the sense of determining the number of important dimensions, the increased computing power that was required, the lack of softwares’ code for certain methods, the differentiation in terms of calculations between the softwares and by extension the extraction of different results, the disability of some softwares in terms of the management of many pseudo-variables and the difficulty of highlighting the most appropriate method for reducing the mathematical dimensions.
Article Details
- How to Cite
-
Kyrana, Z., Pratsinakis, E., Papafilippou, N., Markos, A., & Menexes, G. (2026). Comparison of five dimensionality reduction methods on the multidimensional “Forest Cover Type” dataset . Data Analysis Bulletin, 21(1). Retrieved from https://ejournals.epublishing.ekt.gr/index.php/dab/article/view/33661
- Section
- Empirical studies

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish their work in the journal DATA ANALYSIS BULLETIN agree to the following terms:
1. Authors will not be charged any submission, processing or publication fees for their work. These costs are covered by the Greek Society of Data Analysis.
2. The copyright of papers published in the journal DATA ANALYSIS BULLETIN is protected by the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International license. The Authors retain the Copyright and grant the journal the right of first publication. This license allows third party licensees to use the work in any form for non-commercial purposes only. If third parties modify or adapt the content, they must license the modified material for noncommercial purposes only. If others modify or adapt the material, they must license the modified material under identical terms.
3. Provided that the terms of the licence concerning the reference to the original author and the original publication in the journal DATA ANALYSIS BULLETIN are maintained.
4. Authors may enter into separate and additional contracts and agreements for the non-exclusive distribution of the work as published in the DATA ANALYSIS BULLETIN journal (e.g., deposit in academic repositories), provided that the acknowledgement and citation of the first publication in the DATA ANALYSIS BULLETIN journal is acknowledged.
5. The DATA ANALYSIS BULLETIN journal allows and encourages authors to deposit their work in institutional (e.g. the repository of the National Documentation Centre) or thematic repositories, after publication in DATA ANALYSIS BULLETIN and under Open Access conditions, as determined by their research funders and/or the institutions with which they collaborate, as appropriate. When submitting their work, authors should provide information on the publication of the work in the journal and the sources of funding for their research. Lists of institutional and thematic repositories by country are available at http://opendoar.org/countrylist.php. Authors can deposit their work free of charge in the repository www.zenodo.org, which is supported by OpenAIRE (www.openaire.eu ), as part of the European Commission's policies to support Open Academic Research.