TY - JOUR
T1 - QSPR Modeling of Liquid-liquid Equilibria in Two-phase Systems of Water and Ionic Liquid
AU - Klimenko, Kyrylo Oleksandrovych
AU - Inês, João Miguel
AU - Esperança, José Manuel Silva Simões
AU - Rebelo, Luís Paulo Nieto
AU - Aires-de-Sousa, João
AU - Carrera, Gonçalo Valente Silva Mariño
N1 - UIDB/50006/2020
PY - 2020/9
Y1 - 2020/9
N2 - The increasing application of new ionic liquids (IL) creates the need of liquid-liquid equilibria data for both miscible and quasi-immiscible systems. In this study, equilibrium concentrations at different temperatures for ionic liquid+water two-phase systems were modeled using a Quantitative-Structure-Property Relationship (QSPR) method. Data on equilibrium concentrations were taken from the ILThermo Ionic Liquids database, curated and used to make models that predict the weight fraction of water in ionic liquid rich phase and ionic liquid in the aqueous phase as two separate properties. The major modeling challenge stems from the fact that each single IL is characterized by several data points, since equilibrium concentrations are temperature dependent. Thus, new approaches for the detection of potential data point outliers, testing set selection, and quality prediction have been developed. Training set comprised equilibrium concentration data for 67 and 68 ILs in case of water in IL and IL in water modeling, respectively. SiRMS, MOLMAPS, Rcdk and Chemaxon descriptors were used to build Random Forest models for both properties. Models were subjected to the Y-scrambling test for robustness assessment. The best models have also been validated using an external test set that is not part of the ILThermo database. A two-phase equilibrium diagram for one of the external test set IL is presented for better visualization of the results and potential derivation of tie lines.
AB - The increasing application of new ionic liquids (IL) creates the need of liquid-liquid equilibria data for both miscible and quasi-immiscible systems. In this study, equilibrium concentrations at different temperatures for ionic liquid+water two-phase systems were modeled using a Quantitative-Structure-Property Relationship (QSPR) method. Data on equilibrium concentrations were taken from the ILThermo Ionic Liquids database, curated and used to make models that predict the weight fraction of water in ionic liquid rich phase and ionic liquid in the aqueous phase as two separate properties. The major modeling challenge stems from the fact that each single IL is characterized by several data points, since equilibrium concentrations are temperature dependent. Thus, new approaches for the detection of potential data point outliers, testing set selection, and quality prediction have been developed. Training set comprised equilibrium concentration data for 67 and 68 ILs in case of water in IL and IL in water modeling, respectively. SiRMS, MOLMAPS, Rcdk and Chemaxon descriptors were used to build Random Forest models for both properties. Models were subjected to the Y-scrambling test for robustness assessment. The best models have also been validated using an external test set that is not part of the ILThermo database. A two-phase equilibrium diagram for one of the external test set IL is presented for better visualization of the results and potential derivation of tie lines.
KW - ILThermo
KW - Ionic liquids
KW - outlier detection
KW - phase diagrams
UR - http://www.scopus.com/inward/record.url?scp=85085555789&partnerID=8YFLogxK
U2 - 10.1002/minf.202000001
DO - 10.1002/minf.202000001
M3 - Article
C2 - 32469147
AN - SCOPUS:85085555789
SN - 1868-1743
VL - 39
JO - Molecular Informatics
JF - Molecular Informatics
IS - 9
M1 - 2000001
ER -