A template-based automatic fragmentation algorithm for complex and large systems in the generalized energy-based fragmentation framework.

Xuerong Wang,Junhui Sun,Linke He,Jin Wen,Jianyi Wang,Wei Li,Shuhua Li

Published 2026 in Journal of Chemical Physics

ABSTRACT

A major bottleneck in low-scaling energy-based fragmentation methods is the need for manual intervention in the fragmentation step, which is time-consuming, inconsistent, and hard to generalize across diverse molecular systems. To address this challenge, we develop a template-based automatic fragmentation algorithm that extends the generalized energy-based fragmentation (GEBF) approach to a wide range of large and complex molecules. A hierarchical SMILES-encoded GEBF template library for both cyclic and acyclic functional groups enables chemically meaningful and efficient partitioning via structure conversion, macrocycle detection, substructure matching, and small-fragment merging. Controlling fragment sizes ensures a balance between accuracy and computational cost, while user-defined templates offer enhanced flexibility. Benchmarks on biomacromolecules, macrocycles, porous organic cages, polyamide oligomers, and ionic liquids reproduce conventional quantum-chemistry results within a few kcal · mol-1 (or sub-meV/atom), while reducing the largest subsystem basis size to less than one-third of the full system. The accuracy of the GEBF forces is further validated, enabling reliable geometry optimizations and spectroscopic predictions with near-experimental agreement. Large polyamide oligomers with ≈1500 atoms can be computed within practical timeframes. The method also predicts reaction barriers and reaction energies for enzyme-catalyzed reactions at the level of electron correlation. This work paves the way for fully automated, scalable, low-cost, high-accuracy quantum chemistry, bridging theory and large-scale real-world applications.

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-76 of 76 references · Page 1 of 1

CITED BY

  • No citing papers are available for this paper.

Showing 0-0 of 0 citing papers · Page 1 of 1