UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 3 von 557

Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration

Briefings in bioinformatics, 2020-12, Vol.21 (6), p.2011-2030

2020

Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte

Titel

Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration

Ist Teil von

Briefings in bioinformatics, 2020-12, Vol.21 (6), p.2011-2030

Ort / Verlag

England: Oxford University Press

Erscheinungsjahr

2020

Quelle

EBSCOhost Business Source Ultimate

Beschreibungen/Notizen

Abstract Recent advances in NGS sequencing, microarrays and mass spectrometry for omics data production have enabled the generation and collection of different modalities of high-dimensional molecular data. The integration of multiple omics datasets is a statistical challenge, due to the limited number of individuals, the high number of variables and the heterogeneity of the datasets to integrate. Recently, a lot of tools have been developed to solve the problem of integrating omics data including canonical correlation analysis, matrix factorization and SM. These commonly used techniques aim to analyze simultaneously two or more types of omics. In this article, we compare a panel of 13 unsupervised methods based on these different approaches to integrate various types of multi-omics datasets: iClusterPlus, regularized generalized canonical correlation analysis, sparse generalized canonical correlation analysis, multiple co-inertia analysis (MCIA), integrative-NMF (intNMF), SNF, MoCluster, mixKernel, CIMLR, LRAcluster, ConsensusClustering, PINSPlus and multi-omics factor analysis (MOFA). We evaluate the ability of the methods to recover the subgroups and the variables that drive the clustering on eight benchmarks of simulation. MOFA does not provide any results on these benchmarks. For clustering, SNF, MoCluster, CIMLR, LRAcluster, ConsensusClustering and intNMF provide the best results. For variable selection, MoCluster outperforms the others. However, the performance of the methods seems to depend on the heterogeneity of the datasets (especially for MCIA, intNMF and iClusterPlus). Finally, we apply the methods on three real studies with heterogeneous data and various phenotypes. We conclude that MoCluster is the best method to analyze these omics data. Availability: An R package named CrIMMix is available on GitHub at https://github.com/CNRGH/crimmix to reproduce all the results of this article.

Sprache: Englisch
Identifikatoren: ISSN: 1477-4054, 1467-5463
eISSN: 1477-4054
DOI: 10.1093/bib/bbz138
Titel-ID: cdi_hal_primary_oai_HAL_cea_02393847v1

Format: –
Schlagworte: Applications, Benchmarks, Clustering, Correlation analysis, Data analysis, Data integration, Datasets, Factor analysis, Feature selection, Heterogeneity, Integration, Mass spectrometry, Mass spectroscopy, Mathematics, Phenotypes, Statistics, Subgroups

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration

Details

Weiterführende Literatur