Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 7 von 49
The Journal of chemical physics, 2022-07, Vol.157 (3), p.034102-034102
2022
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Classifying the toxicity of pesticides to honey bees via support vector machines with random walk graph kernels
Ist Teil von
  • The Journal of chemical physics, 2022-07, Vol.157 (3), p.034102-034102
Ort / Verlag
Melville: American Institute of Physics
Erscheinungsjahr
2022
Quelle
American Institute of Physics
Beschreibungen/Notizen
  • Pesticides benefit agriculture by increasing crop yield, quality, and security. However, pesticides may inadvertently harm bees, which are valuable as pollinators. Thus, candidate pesticides in development pipelines must be assessed for toxicity to bees. Leveraging a dataset of 382 molecules with toxicity labels from honey bee exposure experiments, we train a support vector machine (SVM) to predict the toxicity of pesticides to honey bees. We compare two representations of the pesticide molecules: (i) a random walk feature vector listing counts of length-L walks on the molecular graph with each vertex- and edge-label sequence and (ii) the Molecular ACCess System (MACCS) structural key fingerprint (FP), a bit vector indicating the presence/absence of a list of pre-defined subgraph patterns in the molecular graph. We explicitly construct the MACCS FPs but rely on the fixed-length-L random walk graph kernel (RWGK) in place of the dot product for the random walk representation. The L-RWGK-SVM achieves an accuracy, precision, recall, and F1 score (mean over 2000 runs) of 0.81, 0.68, 0.71, and 0.69, respectively, on the test data set—with L = 4 being the mode optimal walk length. The MACCS-FP-SVM performs on par/marginally better than the L-RWGK-SVM, lends more interpretability, but varies more in performance. We interpret the MACCS-FP-SVM by illuminating which subgraph patterns in the molecules tend to strongly push them toward the toxic/non-toxic side of the separating hyperplane.
Sprache
Englisch
Identifikatoren
ISSN: 0021-9606
eISSN: 1089-7690
DOI: 10.1063/5.0090573
Titel-ID: cdi_crossref_primary_10_1063_5_0090573

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX