We show a practical application of the Google Gemini large-language-model for simulating tandem mass spectra for compounds from the Blood Exposome Database. This approach bypasses the need for domain-specific model training, suggesting that the chemical fragmentation knowledge could be latently encoded within the Gemini model. General-purpose LLMs represent a useful and accessible tool for expanding in-silico spectral libraries and may accelerate the compound annotation in mass spectrometry-based metabolomics and exposomics.
Simulating Tandem Mass Spectra for Small Molecules using a General-Purpose Large-Language Model
Published 2025 in bioRxiv
ABSTRACT
PUBLICATION RECORD
- Publication year
2025
- Venue
bioRxiv
- Publication date
2025-11-11
- Fields of study
Biology, Medicine, Chemistry, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-17 of 17 references · Page 1 of 1
CITED BY
- No citing papers are available for this paper.
Showing 0-0 of 0 citing papers · Page 1 of 1