UB Paderborn / Katalog / Suche / Details

Ergebnis 6 von 18

Cluster computing, 2021-09, Vol.24 (3), p.2249-2268

2021

Autor(en) / Beteiligte

Titel

ScalaParBiBit: scaling the binary biclustering in distributed-memory systems

Ist Teil von

Ort / Verlag

New York: Springer US

Erscheinungsjahr

2021

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

Biclustering is a data mining technique that allows us to find groups of rows and columns that are highly correlated in a 2D dataset. Although there exist several software applications to perform biclustering, most of them suffer from a high computational complexity which prevents their use in large datasets. In this work we present ScalaParBiBit , a parallel tool to find biclusters on binary data, quite common in many research fields such as text mining, marketing or bioinformatics. ScalaParBiBit takes advantage of the special characteristics of these binary datasets, as well as of an efficient parallel implementation and algorithm, to accelerate the biclustering procedure in distributed-memory systems. The experimental evaluation proves that our tool is significantly faster and more scalable that the state-of-the-art tool ParBiBit in a cluster with 32 nodes and 768 cores. Our tool together with its reference manual are freely available at https://github.com/fraguela/ScalaParBiBit .

Sprache: Englisch
Identifikatoren: ISSN: 1386-7857
eISSN: 1573-7543
DOI: 10.1007/s10586-021-03261-z
Titel-ID: cdi_proquest_journals_2918248785

Format: –
Schlagworte: Algorithms, Applications programs, Binary data, Bioinformatics, Clustering, Computer Communication Networks, Computer Science, Data compression, Data mining, Datasets, Distributed memory, Gene expression, Operating Systems, Processor Architectures

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX