(449g) A Systematic Procedure for Designing Training Data for Molecular Property Prediction

Conference

AIChE Annual Meeting

Year

2018

Proceeding

2018 AIChE Annual Meeting

Group

Engineering Sciences and Fundamentals

Session

Data-Driven Screening of Chemical and Materials Space

Time

Wednesday, October 31, 2018 - 9:30am to 9:45am

Authors

Li, B. - Presenter, Lehigh University

Rangarajan, S., Lehigh University - Dept of Chem & Biomolecular

Organic material design requires thoroughly exploring chemical compound space to gain desired property information. The astronomical size of molecule space, however, makes it impossible to use experiments of quantum chemistry to evaluate every molecule in the space; consequently, data-driven semi-empirical models are required to calculate molecular properties rapidly. To this end, the focus of this talk is two-fold. First, we show that the group contributions approach can be generalized using a combination of Cheminformatics-based path fingerprints and sparse modeling techniques to derive an ab initio data-driven model to accurately estimate heats of atomization of small organic molecules, reaches the mean absolute error of 1.59 kcal/mol and 2.61 kcal/mol for QM7 and QM9 dataset excluding molecules with fused rings. Further, we show that modern experimental design tools and cheminformatics-based subset selection techniques can be combined to systematically minimize the amount of data needed to train a model. We specifically show that, given a set of molecules for which data-driven models are sought, a carefully chosen subset of even 2- 10% of original dataset is sufficient to train a model that is as accurate as one that is trained on >80% of the data.

Topics

Computational Molecular Engineering

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

2024 mRNA Technology Conference

5th Engineering Cosmetics and Consumer Products Conference

Upcoming Conferences & Events

2024 Eckhardt Northeast Student Regional Conference

2024 mRNA Technology Conference

5th Engineering Cosmetics and Consumer Products Conference

2024 DIERS Virtual Spring Meeting

2024 Pacific Northwest Student Regional Conference

2024 Western Student Regional Conference

CCPS Middle East Regional Meeting

Hydrogen Fueling Station Safety

Streamlining Permit-to-Work Processes With a Digital Solution

CEP: April 2024

CEP: March 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(449g) A Systematic Procedure for Designing Training Data for Molecular Property Prediction

AIChE Annual Meeting

2018

2018 AIChE Annual Meeting

Engineering Sciences and Fundamentals

Data-Driven Screening of Chemical and Materials Space

Wednesday, October 31, 2018 - 9:30am to 9:45am

Authors

Topics

More Conference Links

Cancelation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams