The "bag-of-frames" (BOF) approach, which encodes audio signals as the long-term statistical distribution of short-term spectral features, is commonly regarded as an effective and sufficient way to represent environmental sound recordings (soundscapes). The present paper describes a conceptual replication of a use of the BOF approach in a seminal article using several other soundscape datasets, with results strongly questioning the adequacy of the BOF approach for the task. As demonstrated in this paper, the good accuracy originally reported with BOF likely resulted from a particularly permissive dataset with low within-class variability. Soundscape modeling, therefore, may not be the closed case it was once thought to be.
The bag-of-frames approach: a not so sufficient model for urban soundscapes
M. Lagrange,G. Lafay,Boris Defreville,J. Aucouturier
Published 2014 in Journal of the Acoustical Society of America
ABSTRACT
PUBLICATION RECORD
- Publication year
2014
- Venue
Journal of the Acoustical Society of America
- Publication date
2014-12-11
- Fields of study
Physics, Computer Science, Mathematics, Engineering, Environmental Science, Medicine
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-21 of 21 references · Page 1 of 1
CITED BY
Showing 1-26 of 26 citing papers · Page 1 of 1