Download citation
Download citation
link to html
Machine learning was employed on the experimental crystal structures of the Cambridge Structural Database (CSD) to derive an intermolecular force field for all available types of atoms (general force field). The obtained pairwise interatomic potentials of the general force field allow for the fast and accurate calculation of intermolecular Gibbs energy. The approach is based on three postulates regarding Gibbs energy: the lattice energy must be below zero, the crystal structure must be a local minimum, and, if available, the experimental and the calculated lattice energy must coincide. The parametrized general force field was then validated regarding these three conditions. First, the experimental lattice energy was compared with the calculated energies. The observed errors were found to be in the order of experimental errors. Second, Gibbs lattice energy was calculated for all structures available in the CSD. Their energy values were found to be below zero in 99.86% of the cases. Finally, 500 random structures were minimized, and the change in density and energy was examined. The mean error in the case of density was below 4.06%, and for energy it was below 5.7%. The obtained general force field calculated Gibbs lattice energies of 259 041 known crystal structures within a few hours. Since Gibbs energy defines the reaction energy, the calculated energy can be used to predict chemical–physical properties of crystals, for instance, the formation of co-crystals, polymorph stability and solubility.

Supporting information

ods

Open Spreadsheet Text (ODS) file https://doi.org/10.1107/S2053273323000268/lu5022sup1.ods
The minimal observed van der Waals distances.

ods

Open Spreadsheet Text (ODS) file https://doi.org/10.1107/S2053273323000268/lu5022sup2.ods
Critical bond distances. Atom pairs with distances shorter than these values are considered as bonded (see Section 2.2).

ods

Open Spreadsheet Text (ODS) file https://doi.org/10.1107/S2053273323000268/lu5022sup3.ods
Trained potential parameters. The table contains the four parameters for any atom pair interaction (see equation 4). Atom pair combinations not occurring in the CSD are marked in grey, atom pair combinations occurring rarely and for this reason set to a default potential are marked in orange.

tif

Tagged Image Format File (TIF) image https://doi.org/10.1107/S2053273323000268/lu5022sup4.tif
ConQuest filters used to select the training set.


Follow Acta Cryst. A
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds