UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 4 von 422

Streamlining CASTOR to manage the LHC data torrent

Journal of physics. Conference series, 2014-01, Vol.513 (4), p.42031-6, Article 042031

2014

Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte

Titel

Streamlining CASTOR to manage the LHC data torrent

Ist Teil von

Journal of physics. Conference series, 2014-01, Vol.513 (4), p.42031-6, Article 042031

Ort / Verlag

Bristol: IOP Publishing

Erscheinungsjahr

2014

Quelle

EZB Electronic Journals Library

Beschreibungen/Notizen

This contribution describes the evolution of the main CERN storage system, CASTOR, as it manages the bulk data stream of the LHC and other CERN experiments, achieving over 90 PB of stored data by the end of LHC Run 1. This evolution was marked by the introduction of policies to optimize the tape sub-system throughput, going towards a cold storage system where data placement is managed by the experiments' production managers. More efficient tape migrations and recalls have been implemented and deployed where bulk meta-data operations greatly reduce the overhead due to small files. A repack facility is now integrated in the system and it has been enhanced in order to automate the repacking of several tens of petabytes, required in 2014 in order to prepare for the next LHC run. Finally the scheduling system has been evolved to integrate the internal monitoring. To efficiently manage the service a solid monitoring infrastructure is required, able to analyze the logs produced by the different components (about 1 kHz of log messages). A new system has been developed and deployed, which uses a transport messaging layer provided by the CERN-IT Agile Infrastructure and exploits technologies including Hadoop and HBase. This enables efficient data mining by making use of MapReduce techniques, and real-time data aggregation and visualization. The outlook for the future is also presented. Directions and possible evolution will be discussed in view of the restart of data taking activities.

Sprache: Englisch
Identifikatoren: ISSN: 1742-6596, 1742-6588
eISSN: 1742-6596
DOI: 10.1088/1742-6596/513/4/042031
Titel-ID: cdi_proquest_miscellaneous_1762068837

Format: –
Schlagworte: CERN, Cold storage, Data management, Data mining, Data transmission, Evolution, Infrastructure, Monitoring, Physics, Policies, Recall, Streamlining, Torrents

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Streamlining CASTOR to manage the LHC data torrent

Details

Weiterführende Literatur