TY - JOUR
T1 - Machine learning-driven prediction of deep eutectic solvents’ heat capacity for sustainable process design
AU - Halder, Amit Kumar
AU - Haghbakhsh, Reza
AU - Ferreira, Elisabete S. C.
AU - Duarte, Ana Rita C.
AU - Cordeiro, M. Natália D. S.
N1 - info:eu-repo/grantAgreement/FCT/Concurso de avaliação no âmbito do Programa Plurianual de Financiamento de Unidades de I&D (2017%2F2018) - Financiamento Base/UIDB%2F50006%2F2020/PT#
info:eu-repo/grantAgreement/FCT/Concurso para Atribuição do Estatuto e Financiamento de Laboratórios Associados (LA)/LA%2FP%2F0008%2F2020/PT#
info:eu-repo/grantAgreement/FCT/Concurso de avaliação no âmbito do Programa Plurianual de Financiamento de Unidades de I&D (2017%2F2018) - Financiamento Programático/UIDP%2F50006%2F2020/PT#
info:eu-repo/grantAgreement/FCT/Concurso de avaliação no âmbito do Programa Plurianual de Financiamento de Unidades de I&D (2017%2F2018) - Financiamento Base/UIDB%2F50006%2F2020/PT#
info:eu-repo/grantAgreement/FCT/CEEC IND 3ed/2020.01423.CEECIND%2FCP1596%2FCT0003/PT#
info:eu-repo/grantAgreement/FCT/CEEC IND5ed/2022.05803.CEECIND%2FCP1725%2FCT0003/PT#
Funding Information:
This work received financial support from FCT/MCTES (UIDB/50006/2020 DOI 10.54499/UIDB/50006/2020) through national funds. This work received financial support from FCT/MCTES (LA/P/0008/2020 DOI 10.54499/LA/P/0008/2020, UIDP/50006/2020 DOI 10.54499/UIDP/50006/2020, and UIDB/50006/2020 DOI 10.54499/UIDB/50006/2020), through national funds. The authors also acknowledge FCT by funding CEEC projects 2020.01423.CEECIND/CP1596/CT0003 and 10.54499/2022.05803.CEECIND/CP1725/CT0003.
Publisher Copyright:
© 2024 The Author(s)
PY - 2025/1/15
Y1 - 2025/1/15
N2 - Heat capacity, a crucial physical property for chemical processes, is often understudied in Deep Eutectic Solvents (DESs), which in turn are promising green alternatives to environmentally hazardous conventional solvents. This work addresses this gap by developing a machine learning model to predict DES heat capacity and identify key structural features influencing it. We employed a dataset of 530 DESs with corresponding experimental heat capacity values. Quantum-chemical COSMO-RS-based descriptors, capturing detailed information about DES structures, were calculated for each data point. Various machine learning algorithms, namely k-Nearest Neighbours (kNN), Random Forests (RF), Neural Network Multilayer Perceptron (MLP), and Support Vector Machines (SVM) were explored alongside a linear model (Multiple Linear Regression, MLR). Hyperparameter optimisation ensured all models were fine-tuned for optimal performance. The most successful model, based on the MLP technique, achieved remarkably low Average Absolute Relative Deviation (AARD) values of 0.500 % and 3.999 % for the training and test sets, respectively. This signifies a significant improvement in prediction accuracy compared to traditional methods. Furthermore, by applying a SHapley Additive exPlanations (SHAP) analysis, we identified the most crucial structural factors within DES components that govern their heat capacity. This comprehensive investigation offers valuable insights that can pave the way for an efficient design of novel DESs in the future.
AB - Heat capacity, a crucial physical property for chemical processes, is often understudied in Deep Eutectic Solvents (DESs), which in turn are promising green alternatives to environmentally hazardous conventional solvents. This work addresses this gap by developing a machine learning model to predict DES heat capacity and identify key structural features influencing it. We employed a dataset of 530 DESs with corresponding experimental heat capacity values. Quantum-chemical COSMO-RS-based descriptors, capturing detailed information about DES structures, were calculated for each data point. Various machine learning algorithms, namely k-Nearest Neighbours (kNN), Random Forests (RF), Neural Network Multilayer Perceptron (MLP), and Support Vector Machines (SVM) were explored alongside a linear model (Multiple Linear Regression, MLR). Hyperparameter optimisation ensured all models were fine-tuned for optimal performance. The most successful model, based on the MLP technique, achieved remarkably low Average Absolute Relative Deviation (AARD) values of 0.500 % and 3.999 % for the training and test sets, respectively. This signifies a significant improvement in prediction accuracy compared to traditional methods. Furthermore, by applying a SHapley Additive exPlanations (SHAP) analysis, we identified the most crucial structural factors within DES components that govern their heat capacity. This comprehensive investigation offers valuable insights that can pave the way for an efficient design of novel DESs in the future.
KW - COSMO-RS
KW - Deep eutectic solvents
KW - Heat capacity prediction
KW - Machine learning
KW - Thermodynamic modelling
UR - https://www.scopus.com/pages/publications/85211970998
U2 - 10.1016/j.molliq.2024.126707
DO - 10.1016/j.molliq.2024.126707
M3 - Article
AN - SCOPUS:85211970998
SN - 0167-7322
VL - 418
SP - 1
EP - 10
JO - Journal of Molecular Liquids
JF - Journal of Molecular Liquids
M1 - 126707
ER -