UB Paderborn / Katalog / Suche / Details

Ergebnis 10 von 104

Journal of synchrotron radiation, 2024-05, Vol.31 (Pt 3), p.635-645

2024

Autor(en) / Beteiligte

Titel

A distributed data processing scheme based on Hadoop for synchrotron radiation experiments

Ist Teil von

Ort / Verlag

United States: John Wiley & Sons, Inc

Erscheinungsjahr

2024

Link zum Volltext

Quelle

Wiley Online Library - AutoHoldings Journals

Beschreibungen/Notizen

With the development of synchrotron radiation sources and high-frame-rate detectors, the amount of experimental data collected at synchrotron radiation beamlines has increased exponentially. As a result, data processing for synchrotron radiation experiments has entered the era of big data. It is becoming increasingly important for beamlines to have the capability to process large-scale data in parallel to keep up with the rapid growth of data. Currently, there is no set of data processing solutions based on the big data technology framework for beamlines. Apache Hadoop is a widely used distributed system architecture for solving the problem of massive data storage and computation. This paper presents a set of distributed data processing schemes for beamlines with experimental data using Hadoop. The Hadoop Distributed File System is utilized as the distributed file storage system, and Hadoop YARN serves as the resource scheduler for the distributed computing cluster. A distributed data processing pipeline that can carry out massively parallel computation is designed and developed using Hadoop Spark. The entire data processing platform adopts a distributed microservice architecture, which makes the system easy to expand, reduces module coupling and improves reliability.

Sprache: Englisch
Identifikatoren: ISSN: 1600-5775, 0909-0495
eISSN: 1600-5775
DOI: 10.1107/S1600577524002637
Titel-ID: cdi_doaj_primary_oai_doaj_org_article_4c312595b2ee441d8ac5e8139555e7d7

Format: –
Schlagworte: apache hadoop, Big Data, Computer networks, Computer Programs, Data processing, Data storage, distributed data processing, Distributed processing, microservice architecture, Parallel processing, Pipelining (computers), Radiation, Radiation sources, Resource scheduling, Synchrotron radiation

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX