Olfaction—how molecules are perceived as odors to humans—is a relatively less understood sensory system compared to vision or hearing. Recently, the principal odor map (POM) was introduced to digitize the olfactory properties of single compounds. However, smells in real life are not pure single molecules, but complex mixtures of molecules, whose representations remain relatively under-explored due to limited data in olfactory mixtures. We introduce POMMix, a mixture model extension of POM which leverages mono-molecular olfactory data to build meaningful mixture representations of smells. Our model builds upon the symmetries of the problem space in a hierarchical manner: (1) graph neural networks for building mono-molecular embeddings, (2) attention mechanisms for aggregating molecular representations into mixture representations, and (3) cosine prediction heads to encode olfactory perceptual distance in the mixture embedding space. POMMix achieves state-of-the-art performance across multiple datasets. We perform comprehensive ablation studies of the components of POMMix to understand the contribution of each component. We evaluate the generalizability of the model, explore olfactory phenomena with the representations, and analyze the interpretability of the representations. Our work advances the effort to digitize olfaction, highlighting the synergy of domain expertise and deep learning in crafting mixture representations in low-data regimes.
Does this smell the same? Learning representations of olfactory mixtures using inductive biases
Gary Tom,Cher Tian Ser,Ella M Rajaonson,Stanley Lo,Hyun Suk Park,Brian K. Lee,Benjamín Sánchez-Lengeling
Published 2025 in Machine Learning: Science and Technology
ABSTRACT
PUBLICATION RECORD
- Publication year
2025
- Venue
Machine Learning: Science and Technology
- Publication date
2025-08-27
- Fields of study
Physics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-82 of 82 references · Page 1 of 1