Abstract
Non-ionic deep eutectic solvents (DESs) are non-ionic designer solvents with various applications in catalysis, extraction, carbon capture, and pharmaceuticals. However, discovering new DES candidates is challenging due to a lack of efficient tools that accurately predict DES formation. The search for DES relies heavily on intuition or trial-and-error processes, leading to low success rates or missed opportunities. Recognizing that hydrogen bonds (HBs) play a central role in DES formation, we aim to identify HB features that distinguish DES from non-DES systems and use them to develop machine learning (ML) models to discover new DES systems. We first analyze the HB properties of 38 known DES and 111 known non-DES systems using their molecular dynamics (MD) simulation trajectories. The analysis reveals that DES systems have two unique features compared to non-DES systems: The DESs have ① more imbalance between the numbers of the two intra-component HBs and ② more and stronger inter-component HBs. Based on these results, we develop 30 ML models using ten algorithms and three types of HB-based descriptors. The model performance is first benchmarked using the average and minimal receiver operating characteristic (ROC)-area under the curve (AUC) values. We also analyze the importance of individual features in the models, and the results are consistent with the simulation-based statistical analysis. Finally, we validate the models using the experimental data of 34 systems. The extra trees forest model outperforms the other models in the validation, with an ROC-AUC of 0.88. Our work illustrates the importance of HBs in DES formation and shows the potential of ML in discovering new DESs.
| Original language | English |
|---|---|
| Pages (from-to) | 74-83 |
| Number of pages | 10 |
| Journal | Engineering |
| Volume | 39 |
| DOIs | |
| State | Published - Aug 2024 |
Bibliographical note
Publisher Copyright:© 2024 THE AUTHORS
Funding
This work was supported by Ignite Research Collaborations (IRC), Startup funds, and the UK Artificial Intelligence (AI) in Medicine Research Alliance Pilot (NCATS UL1TR001998 and NCI P30 CA177558), University of Kentucky Center for Computational Sciences and Information Technology Services Research Computing for the use of the Lipscomb Compute Cluster of the University of Kentucky. Usman L. Abbas, Yuxuan Zhang, Joseph Tapia, Selim Md, Jin Chen, Jian Shi, and Qing Shao declare that they have no conflict of interest or financial conflicts to disclose. This work was supported by Ignite Research Collaborations (IRC), Startup funds, and the UK Artificial Intelligence (AI) in Medicine Research Alliance Pilot (NCATS UL1TR001998 and NCI P30 CA177558), University of Kentucky Center for Computational Sciences and Information Technology Services Research Computing for the use of the Lipscomb Compute Cluster of the University of Kentucky.
| Funders | Funder number |
|---|---|
| Joseph Tapia | |
| UK Artificial Intelligence | |
| University of Kentucky | |
| Kentucky Transportation Center, University of Kentucky | |
| NCATS CCTS | UL1TR001998 |
| Columbus NCI | P30 CA177558 |
Keywords
- Deep eutectic solvents
- Hydrogen bond
- Machine learning
- Molecular design
- Molecular dynamics simulations
ASJC Scopus subject areas
- Environmental Engineering
- General Computer Science
- Materials Science (miscellaneous)
- General Chemical Engineering
- Energy Engineering and Power Technology
- General Engineering