Simulating Tandem Mass Spectra for Small Molecules using a General-Purpose Large-Language Model

Tuan Nguyen,D. Barupal

Published 2025 in bioRxiv

ABSTRACT

We show a practical application of the Google Gemini large-language-model for simulating tandem mass spectra for compounds from the Blood Exposome Database. This approach bypasses the need for domain-specific model training, suggesting that the chemical fragmentation knowledge could be latently encoded within the Gemini model. General-purpose LLMs represent a useful and accessible tool for expanding in-silico spectral libraries and may accelerate the compound annotation in mass spectrometry-based metabolomics and exposomics.

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-17 of 17 references · Page 1 of 1

CITED BY

  • No citing papers are available for this paper.

Showing 0-0 of 0 citing papers · Page 1 of 1