Publication Details

Category Text Publication
Reference Category Journals
DOI 10.1016/j.fluid.2021.113349
Document author version
Title (Primary) Can deep learning algorithms enhance the prediction of solute descriptors for linear solvation energy relationship approaches?
Author Ulrich, N.; Ebert, A. ORCID logo
Source Titel Fluid Phase Equilibria
Year 2022
Department OEC; AUC
Volume 555
Page From art. 113349
Language englisch
Topic T9 Healthy Planet
Keywords Physicochemical property prediction; Partition coefficients; Data augmentation LSERD; Quantitative structure-property relationship QSPR; Absolv
Abstract Experimental solute descriptors for about 8,000 chemicals are currently available to apply physicochemical property predictions based on linear solvation energy relationship (LSER) models. The solute descriptors can be predicted by fragmental-based quantitative structure-property relationship (QSPR) models. However, the predictions are problematic for larger chemical structures, including multiple functional groups. We developed deep neural networks (DNNs) as alternative prediction models based on graph representations of the chemicals. The root mean square errors rmses range between 0.11 and 0.46 for the different solute descriptors. The predictions of the solute descriptors were compared to predictions from the QSPR of LSERD (an online database) and ACD/Absolv (a commercial software). We further investigated the predictive power of all tools based on three different datasets of experimentally determined partition coefficients, namely the octanol-water partition coefficient (Kow), the octanol-air partition coefficient (Koa), and the water-air partition coefficient (Kwa). Additionally, we used two different sets of retention data for GC and LC to evaluate the results of all prediction tools. All prediction tools perform comparably well with rmses of ∼ 1.0 log unit for the Kow dataset (12,010 chemicals) and ∼ 1.3 log units for the Kwa dataset (696 chemicals), for example. Nevertheless, larger chemical structures are predicted poorly by each approach. We recommend to use the novel DNN model as a complementary prediction tool.
Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=25496
Ulrich, N., Ebert, A. (2022):
Can deep learning algorithms enhance the prediction of solute descriptors for linear solvation energy relationship approaches?
Fluid Phase Equilib. 555 , art. 113349 10.1016/j.fluid.2021.113349