(64e) Prediction of Organic Compound Aqueous Solubility Using Interpretable Machine Learning-a Comparison Study of Descriptor-Based and Topological Models
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Materials Engineering and Sciences Division
Poster Session: Materials Engineering & Sciences (08F - Composite Materials)
Tuesday, November 7, 2023 - 3:30pm to 5:00pm
Our results were compared to a blind, low molecular database consists of 32 low organic molecules with the number of C atoms ranging from 1 to 12 with specified aqueous solubility experiments, revealing that using a fingerprint method has a lower average absolute calculation error (â¼0.25 log units), which is comparable to other group contribution methods currently available. The average uncertainty in measured aqueous solubility for organic molecules represented â¼0.6 log units or higher, when the solubility values were gathered from various published sources. We also gained insight into how important features impact an ML's output using SHAP analysis and calculated Gibbs energies for these features to investigate their thermodynamic favorability. The fingerprint method can predict the aqueous solubility with low error, and its interpretability ability will distinguish it from the other currently available methods.