(611a) Open Chemistry and Jupyter: Platform for Data Mining and Machine Learning
AIChE Annual Meeting
2018
2018 AIChE Annual Meeting
Computational Molecular Science and Engineering Forum
Data Mining and Machine Learning in Molecular Sciences II
Thursday, November 1, 2018 - 8:00am to 8:15am
Extension of the Python software kernels and web interface with chemistry specific capabilities results in a powerful software for chemical data. The open source platform will be described, along with links to a number of community codes from the computational chemistry community, machine learning, and the emerging computational chemistry machine learning community. The project is being developed in the open, with robust interfaces that facilitate the addition of new techniques. The core of the platform is a chemically aware data server, coupled with job submission capabilities, and data analytics.
The use of industry standard web programming interfaces, data formats, and modern HTML5 web components will be described. The use of the next-generation JupyterLab interface, its extension, and the integration of batch scheduling, search, and analytics will be covered. All major components will work in any modern web browser, including 3D visualization, analysis, and code execution. The use of Python kernels offers an ideal environment for analysis, reusing the tools already available in that ecosystem including significant investments in data mining and machine learning tools. A companion HTML5 application offers a simpler view of results that can be shared more widely, with full access to data, and linkage to the notebooks producing the data.