UB Paderborn / Katalog / Details

International journal of intelligent information technologies, 2014-07, Vol.10 (3), p.19-35

2014

Autor(en) / Beteiligte

Titel

Extracting Functional Dependencies in Large Datasets Using MapReduce Model

Ist Teil von

International journal of intelligent information technologies, 2014-07, Vol.10 (3), p.19-35

Ort / Verlag

Hershey: IGI Global

Erscheinungsjahr

2014

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

Over the last few years, data are generated in large volume at a faster rate and there has been a remarkable growth in the need for large scale data processing systems. As data grows larger in size, data quality is compromised. Functional dependencies representing semantic constraints in data are important for data quality assessment. Executing functional dependency discovery algorithms on a single computer is hard and laborious with large data sets. MapReduce provides an enabling technology for large scale data processing. The open-source Hadoop implementation of MapReduce has provided researchers a powerful tool for tackling large-data problems in a distributed manner. The objective of this study is to extract functional dependencies between attributes from large datasets using MapReduce programming model. Attribute entropy is used to measure the inter attribute correlations, and exploited to discover functional dependencies hidden in the data.

Sprache: Englisch; Ndonga
Identifikatoren: ISSN: 1548-3657
eISSN: 1548-3665
DOI: 10.4018/ijiit.2014070102
Titel-ID: cdi_gale_infotracacademiconefile_A391720160

Format: –
Schlagworte: Algorithms, Analysis, Computer simulation, Data processing, Datasets, Electronic data processing, Entropy, Freeware, Open source software, Parallel processing, Parallel programming (Computer science), Programming, Quality assessment, Semantics, Technology application

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX