UB Paderborn / Katalog / Details

Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...

Recognition of bacteria named entity using conditional random fields in Spark

BMC systems biology, 2018-11, Vol.12 (Suppl 6), p.106-106, Article 106

2018

Details

Autor(en) / Beteiligte

Titel

Recognition of bacteria named entity using conditional random fields in Spark

Ist Teil von

BMC systems biology, 2018-11, Vol.12 (Suppl 6), p.106-106, Article 106

Ort / Verlag

England: BioMed Central Ltd

Erscheinungsjahr

2018

Link zum Volltext

Quelle

EZB Electronic Journals Library

Beschreibungen/Notizen

Microbe plays a crucial role in the functional mechanism of an ecosystem. Identification of the interactions among microbes is an important step towards understand the structure and function of microbial communities, as well as of the impact of microbes on human health and disease. Despite the importance of it, there is not a gold-standard dataset of microbial interactions currently. Traditional approaches such as growth and co-culture analysis need to be performed in the laboratory, which are time-consuming and costly. By providing predicted candidate interactions to experimental verification, computational methods are able to alleviate this problem. Mining microbial interactions from mass medical texts is one type of computational methods. Identification of the named entity of bacteria and related entities from the text is the basis for microbial relation extraction. In the previous work, a system of bacteria named entities recognition based on the dictionary and conditional random field was proposed. However, it is inefficient when dealing with large-scale text. We implemented bacteria named entity recognition on Spark platform and designed experiments for comparison to verify the correctness and validity of the proposed system. The experimental results show that it can achieve higher F-Measure on the comparison of correctness. Moreover, the predicting speed is much faster than the previous version in large-scale biomedical datasets, and the computational efficiency is improved remarkably by about 3.1 to 6.7 times. The system for bacteria named entity recognition solves the inefficiency of the previous proposed system on large-scale datasets. The proposed system has good performance in accuracy and scalability.

Sprache: Englisch
Identifikatoren: ISSN: 1752-0509
eISSN: 1752-0509
DOI: 10.1186/s12918-018-0625-3
Titel-ID: cdi_proquest_miscellaneous_2138633590

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Recognition of bacteria named entity using conditional random fields in Spark

Details

Weiterführende Literatur