(606c) Rational Reduction of Computationally Predicted Crystal Energy Landscapes Using Molecular Dynamics and Enhanced Sampling Techniques.
AIChE Annual Meeting
2022
2022 Annual Meeting
Pharmaceutical Discovery, Development and Manufacturing Forum
Computational Solid State Pharmaceutics I
Thursday, November 17, 2022 - 8:50am to 9:15am
While successful predictions of the most stable polymorphs of organic crystals based on accurate lattice (free) energy estimates are becoming increasingly common, the number of predicted putative polymorphs remains a standing issue.
Typically, the overall number of potential structures computationally identified as putative polymorphs usually grossly exceeds the number of polymorphs observed experimentally [1]. This problem in the field is typically referred to as "overprediction".
A significant reason for CSP methods over-predicting possible polymorphs is that temperature increases molecular motions so that some structures that appear as distinct lattice energy minima merge into the same free energy minimum at finite temperature [2].
This work discusses applying a physics-driven computational procedure [3,4] to reduce the number of relevant local minima of crystal energy landscapes based on a systematic and scalable application of molecular dynamics and enhanced sampling simulations.
In order to identify persistent crystal structures, we perform classical molecular dynamics simulations at finite temperature on CSP-generated crystal structures. Unstable configurations are then automatically removed by comparing the distribution of internal orientations against the random arrangement typical of a disordered melted state.
To identify crystal structures that convert to a common finite-temperature state, we carry out a clustering analysis based on probabilistic fingerprints that capture information on the relative position, orientation, and conformation of molecules within a dynamic crystal supercell. Differences in the probabilistic fingerprints concur to define a dissimilarity metric between finite-temperature structures, which is then used to perform unsupervised clustering of finite temperature structures.
In this contribution, we discuss the application of this method to systems of increasing size and complexity (urea, succinic acid, ibuprofen, olanzapine and others), spanning from a few dozens to thousands of structures. We note that, in all cases, we substantially reduce the crystal energy landscapes while consistently retaining the experimentally observed crystal structures.
Developing a Python library [5] that manages the setup of MD simulations and automatically analyses the resulting trajectories has been instrumental in achieving scalability over a large set of crystal structures, enabling us to apply MD workflows to sets of crystal structures approaching the size and complexity of real-world CSP applications.
References
[1] DA Bardwell, CS Adjiman, YA Arnautova, E Bartashevich, SXM Boerrigter, et al., 2011, Acta Cryst. B 67:535-55
[2] Price SL. 2013, Acta Cryst. B. 69:313-28.
[3] Francia, N.F., Price, L.S., Nyman, J., Price, S.L. and Salvalaglio, M., 2020. Crystal Growth & Design, 20(10), pp.6847-6862.
[4] Francia, N.F., Price, L.S. and Salvalaglio, M., 2021, CrystEngComm, 23(33), pp.5575-5584.
[5] https://github.com/mme-ucl/pypol
Figure: Reduction of the crystal energy landscape of Urea. In the outer circle the lattice energy minima, while in the inner circle the 12 structures that survived the reduction process. In light grey the structures that melt while in different colours those that coalesce to the same crystal structure.