Cortical Representations of Speech in a Multitalker Auditory Scene

Published 2017 in Journal of Neuroscience

ABSTRACT

The ability to parse a complex auditory scene into perceptual objects is facilitated by a hierarchical auditory system. Successive stages in the hierarchy transform an auditory scene of multiple overlapping sources, from peripheral tonotopically based representations in the auditory nerve, into perceptually distinct auditory-object-based representations in the auditory cortex. Here, using magnetoencephalography recordings from men and women, we investigate how a complex acoustic scene consisting of multiple speech sources is represented in distinct hierarchical stages of the auditory cortex. Using systems-theoretic methods of stimulus reconstruction, we show that the primary-like areas in the auditory cortex contain dominantly spectrotemporal-based representations of the entire auditory scene. Here, both attended and ignored speech streams are represented with almost equal fidelity, and a global representation of the full auditory scene with all its streams is a better candidate neural representation than that of individual streams being represented separately. We also show that higher-order auditory cortical areas, by contrast, represent the attended stream separately and with significantly higher fidelity than unattended streams. Furthermore, the unattended background streams are more faithfully represented as a single unsegregated background object rather than as separated objects. Together, these findings demonstrate the progression of the representations and processing of a complex acoustic scene up through the hierarchy of the human auditory cortex. SIGNIFICANCE STATEMENT Using magnetoencephalography recordings from human listeners in a simulated cocktail party environment, we investigate how a complex acoustic scene consisting of multiple speech sources is represented in separate hierarchical stages of the auditory cortex. We show that the primary-like areas in the auditory cortex use a dominantly spectrotemporal-based representation of the entire auditory scene, with both attended and unattended speech streams represented with almost equal fidelity. We also show that higher-order auditory cortical areas, by contrast, represent an attended speech stream separately from, and with significantly higher fidelity than, unattended speech streams. Furthermore, the unattended background streams are represented as a single undivided background object rather than as distinct background objects.

PUBLICATION RECORD

Publication year
2017
Venue
Journal of Neuroscience
Publication date
2017-04-10
Fields of study
Biology, Medicine, Computer Science, Psychology
Identifiers
DOI 10.1523/JNEUROSCI.0938-17.2017 PMID 28821680 PMCID PMC5607465
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Auditory Scene Analysis: The Perceptual Organization of Sound by Albert Bregman (review)
2016cited by this paper
Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing.
2015cited by this paper
Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG.
2015cited by this paper
The cortical analysis of speech-specific temporal structure revealed by responses to sound quilts
2015cited by this paper
Functional organization of human auditory cortex: Investigation of response latencies through direct recordings
2014cited by this paper
Joint decorrelation, a versatile tool for multichannel data analysis
2014cited by this paper
Differential activation of human core, non-core and auditory-related cortex during speech categorization tasks as revealed by intracranial recordings
2014cited by this paper
Visual Input Enhances Selective Speech Envelope Tracking in Auditory Cortex at a “Cocktail Party”
2013cited by this paper
Mechanisms underlying selective neuronal tracking of attended speech at a "cocktail party".
2013cited by this paper
Adaptive Temporal Encoding Leads to a Background-Insensitive Cortical Representation of Speech
2013cited by this paper
Selective cortical representation of attended speaker in multi-talker speech perception
2012cited by this paper
Reconstructing Speech from Human Auditory Cortex
2012cited by this paper
Temporal context in speech processing and attentional stream selection: a behavioral and neural perspective.
2012cited by this paper
Emergence of neural encoding of auditory objects while listening to competing speakers
2012cited by this paper
At what time is the cocktail party? A late locus of selective attention to natural speech
2012cited by this paper
A Precluding But Not Ensuring Role of Entrained Low-Frequency Oscillations for Auditory Perception
2012cited by this paper
Neural coding of continuous speech in auditory cortex during monaural and dichotic listening.
2012cited by this paper
Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise
2011cited by this paper
Auditory Evoked Potentials and Their Utility in the Assessment of Complex Sound Processing
2011cited by this paper
Sound Processing Hierarchy within Human Auditory Cortex
2011cited by this paper
Temporal coherence and attention in auditory scene analysis.
2011cited by this paper
Attentional Gain Control of Ongoing Cortical Speech Representations in a “Cocktail Party”
2010cited by this paper
Hierarchical Processing for Speech in Human Auditory Cortex and Beyond
2010cited by this paper
Categorical Speech Representation in Human Superior Temporal Gyrus
2010cited by this paper
Hierarchical organization of human auditory cortex: evidence from acoustic invariance in the response to intelligible speech.
2010cited by this paper
Bayesian t tests for accepting and rejecting the null hypothesis
2009cited by this paper
Resolving precise temporal processing properties of the auditory system using continuous stimuli.
2009cited by this paper
Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing
2009cited by this paper
Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex
2009cited by this paper
Low-frequency neuronal oscillations as instruments of sensory selection.
2009cited by this paper
Influence of context and behavior on stimulus reconstruction from neural activity in primary auditory cortex.
2009cited by this paper
The cocktail party problem.
2009cited by this paper
Speech perception at the interface of neurobiology and linguistics
2008cited by this paper
on the Recognition of Speech, with
2008cited by this paper
Neurons and Objects: The Case of Auditory Cortex
2008cited by this paper
Object-based auditory and visual attention.
2008cited by this paper
Estimating sparse spectro-temporal receptive fields with natural stimuli
2007cited by this paper
Denoising based on time-shift PCA.
2007cited by this paper
The cortical organization of speech processing
2007cited by this paper
Source analysis of auditory evoked potentials in patients with cochlear implants
2007cited by this paper
Genomic and evolutionary analyses of asymmetrically expressed genes in human fetal left and right cerebral cortex.
2006cited by this paper
Separation of Nonlinear Image Mixtures by Denoising Source Separation
2006cited by this paper
Vision as Bayesian inference: analysis by synthesis?
2006cited by this paper
Auditory Evoked Potentials: Basic Principles and Clinical Application
2006cited by this paper
Serial and parallel processing in the human auditory cortex: a magnetoencephalographic study.
2006cited by this paper
Attentional modulation of electrophysiological activity in auditory cortex for unattended sounds within multistream auditory environments
2005cited by this paper
Multiresolution spectrotemporal analysis of complex sounds.
2005cited by this paper
Mapping auditory core, lateral belt, and parabelt cortices in the human superior temporal gyrus
2005cited by this paper
How the brain separates sounds.
2004cited by this paper
What is an auditory object?
2004cited by this paper
Rapid task-related plasticity of spectrotemporal receptive fields in primary auditory cortex
2003cited by this paper
Serial and Parallel
2003cited by this paper
Hierarchical Processing in Spoken Language Comprehension
2003cited by this paper
The planum temporale as a computational hub.
2002cited by this paper
Subdivisions of auditory cortex and processing streams in primates.
2000cited by this paper
Frequency and intensity response properties of single neurons in the auditory cortex of the behaving macaque monkey.
2000cited by this paper
Auditory evoked potentials.
1991influential reference
The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.)
1990cited by this paper
Strathprints Institutional Repository Rhythmic Auditory Cortex Activity at Multiple Timescales Shapes Stimulus–response Gain and Background Firing
year unknowncited by this paper
Journal of Machine Learning Research Submitted 03/04; Revised Denoising Source Separation
year unknowncited by this paper
Journal of Neuroscience Methods Denoising Based on Spatial Filtering
year unknowncited by this paper

CITED BY

Reduced neural speech tracking in adolescents with listening difficulty.
2026cites this paper
Infant cortical tracking of speech shows emerging spatial release from masking in the first year of life
2026cites this paper
A New Perspective on the Speaker Identity Variability Effect
2025cites this paper
Reduced Neural Speech Tracking in Adolescents with Listening Difficulty
2025cites this paper
Optimized feature gains explain and predict successes and failures of human selective listening
2025cites this paper
Attention, musicality, and familiarity shape cortical speech tracking at the musical cocktail party.
2025cites this paper
EEG responses to onset-edge and steady-state segments of continuous speech under selective auditory attention modulation.
2025cites this paper
Attention Decoding at the Cocktail Party: Preserved in Hearing Aid Users, Reduced in Cochlear Implant Users
2025cites this paper
Original speech and its echo are segregated and separately processed in the human brain
2024cites this paper
Cortical encoding of phonetic onsets of both attended and ignored speech in hearing impaired individuals
2024cites this paper
Phoneme-related potentials recorded from normal hearing listeners and cochlear implant users in a selective attention paradigm to continuous speech.
2024cites this paper
FMRI speech tracking in primary and non-primary auditory cortex while listening to noisy scenes
2024cites this paper
Cortical tracking of speakers’ spectral changes predicts selective listening
2024cites this paper
The role of auditory source and action representations in segmenting experience into events
2024cites this paper
Attention-Driven Modulation of Auditory Cortex Activity during Selective Listening in a Multispeaker Setting
2024cites this paper
‘Are you even listening?’ - EEG-based decoding of absolute auditory attention to natural speech
2024influential citation
Target enhancement but not distractor suppression in auditory neural tracking during continuous speech
2023influential citation
Investigating Self-Supervised Deep Representations for EEG-Based Auditory Attention Decoding
2023cites this paper
Attention, Musicality, and Familiarity Shape Cortical Speech Tracking at the Musical Cocktail Party
2023cites this paper
Individual differences in speech-on-speech masking are correlated with cognitive and visual task performance
2023cites this paper
The effect of gaze on EEG measures of multisensory integration in a cocktail party scenario
2023cites this paper
Functional Hearing Difficulties in Blast-Exposed Service Members With Normal to Near-Normal Hearing Thresholds
2023cites this paper
Two effects of perceived speaker similarity in resolving the cocktail party situation - ERPs and functional connectivity.
2023cites this paper
Cortical over-representation of phonetic onsets of ignored speech in hearing impaired individuals
2023cites this paper
Distinct neural encoding of glimpsed and masked speech in multitalker situations
2023cites this paper
Antithetical contribution of primary and non-primary auditory cortex while listening to speech in noisy scenes
2023cites this paper
Automatic Auditory Streaming Restores Missing Temporal Modulations in Echoic Speech
2023cites this paper
The Effects of Speech Masking on Neural Tracking of Acoustic and Semantic Features of Natural Speech
2023cites this paper
Age-related deficits in dip-listening evident for isolated sentences but not for spoken stories
2022cites this paper
Deep neural networks effectively model neural adaptation to changing background noise and suggest nonlinear noise filtering methods in auditory cortex
2022cites this paper
Separate neural subsystems support goal-directed speech listening
2022cites this paper
Neural dynamics differentially encode phrases and sentences during spoken language comprehension
2022cites this paper
Auditory neural tracking reflects target enhancement but not distractor suppression in a psychophysically augmented continuous-speech paradigm
2022cites this paper
Speech prosody supports speaker selection and auditory stream segregation in a multi-talker situation
2022cites this paper
Do we parse the background into separate streams in the cocktail party?
2022cites this paper
Neural encoding of phrases and sentences in spoken language comprehension
2021cites this paper
Cortical Tracking of the Speech Envelope in Logopenic Variant Primary Progressive Aphasia
2021cites this paper
The integration of continuous audio and visual speech in a cocktail-party environment depends on attention
2021cites this paper
Effects of Hearing Aid Noise Reduction on Early and Late Cortical Representations of Competing Talkers in Noise
2021cites this paper
Errors on a Speech-in-Babble Sentence Recognition Test Reveal Individual Differences in Acoustic Phonetic Perception and Babble Misallocations
2021cites this paper
Linguistic processing of task-irrelevant speech at a cocktail party
2021cites this paper
Estimated Prevalence of Functional Hearing Difficulties in Blast-Exposed Service Members With Normal to Near–Normal-Hearing Thresholds
2021cites this paper
EEG alpha and pupil diameter reflect endogenous auditory attention switching and listening effort
2021cites this paper
Hearing, listening and deep neural networks in hearing aids
2021cites this paper
Decoding Object-Based Auditory Attention from Source-Reconstructed MEG Alpha Oscillations
2021influential citation
Modulating Cortical Instrument Representations During Auditory Stream Segregation and Integration With Polyphonic Music
2021cites this paper
Selective auditory attention within naturalistic scenes modulates reactivity to speech sounds
2021influential citation
Listening to speech in noisy scenes: Antithetical contribution of primary and non-primary auditory cortex
2021cites this paper
Cortical tracking of voice pitch in the presence of multiple speakers depends on selective attention
2021cites this paper
Tracking selective attention in a musical cocktail
2021cites this paper
Pitch, timbre and intensity interdependently modulate neural responses to salient sounds.
2020cites this paper
Decoding of Envelope vs. Fundamental Frequency During Complex Auditory Stream Segregation
2020cites this paper
Poor early cortical differentiation of speech predicts perceptual difficulties of severely hearing-impaired listeners in multi-talker environments
2020cites this paper
Cortical processing of distracting speech in noisy auditory scenes depends on perceptual demand
2020cites this paper
Dynamic Processing of Background Speech at the Cocktail Party: Evidence for Early Active Cortical Stream Segregation
2020influential citation
Attentional Modulation of Hierarchical Speech Representations in a Multi-Talker Environment
2020influential citation
Continuous speech processing.
2020cites this paper
Neural Representation Enhanced for Speech and Reduced for Background Noise With a Hearing Aid Noise Reduction Scheme During a Selective Attention Task
2020cites this paper
Continuous speech processing R1 changes accepted
2020cites this paper
Paying attention to speech: The role of working memory capacity and professional experience
2020cites this paper
Auditory stimulus-response modeling with a match-mismatch task
2020cites this paper
The audiology of Oticon More TM
2020cites this paper
The effects of speech processing units on auditory stream segregation and selective attention in a multi-talker (cocktail party) situation.
2020cites this paper
Oticon More ™ clinical evidence
2020cites this paper
Temporal Coherence Principle in Scene Analysis
2020cites this paper
Linguistic processing of task-irrelevant speech at a Cocktail Party
2020cites this paper
Attention Differentially Affects Acoustic and Phonetic Feature Encoding in a Multispeaker Environment
2020cites this paper
A model of listening engagement (MoLE).
2019cites this paper
The effects of distractor set‐size on neural tracking of attended speech
2019cites this paper
Cortical Tracking of Speech-in-Noise Develops from Childhood to Adulthood
2019cites this paper
Effect of Task and Attention on Neural Tracking of Speech
2019cites this paper
Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers
2019influential citation
Auditory Cortex Tracks Masked Acoustic Onsets in Background Speech: Evidence for Early Cortical Stream Segregation
2019cites this paper
Hierarchical Encoding of Attended Auditory Objects in Multi-talker Speech Perception.
2019cites this paper
Cognitive resources are distributed among the entire auditory landscape in auditory scene analysis.
2019cites this paper
Invariance to background noise as a signature of non-primary auditory cortex
2019cites this paper
Paying Attention to Speech: The Role of Cognitive Capacity and Acquired Experience
2019cites this paper
Look at me when I'm talking to you: Selective attention at a multisensory cocktail party can be decoded using stimulus reconstruction and alpha power modulations
2019cites this paper
A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding
2018cites this paper
Brainstem‐cortical functional connectivity for speech is differentially challenged by noise and reverberation
2018cites this paper
Neural Signatures of the Processing of Temporal Patterns in Sound
2018cites this paper
Characterizing neural mechanisms of attention-driven speech processing
2018cites this paper
Transformation from auditory to linguistic representations across auditory cortex is rapid and attention dependent for continuous speech
2018cites this paper
Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.
2018cites this paper
Rapid transformation from auditory to lin-1 guistic representations of continuous 2 speech 3
2018cites this paper
Recent advances in understanding the auditory cortex
2018cites this paper
Musicians at the Cocktail Party: Neural Substrates of Musical Training During Selective Listening in Multispeaker Situations.
2018cites this paper
Causal cortical dynamics of a predictive enhancement of speech intelligibility
2018cites this paper
Cortical tracking of multiple streams outside the focus of attention in naturalistic auditory scenes
2018influential citation
Neural Decoding of Bistable Sounds Reveals an Effect of Intention on Perceptual Organization
2017cites this paper
Tracking Temporal Hazard in the Human Electroencephalogram Using a Forward Encoding Model
2017cites this paper
Auditory Figure-Ground Segregation Is Impaired by High Visual Load
2017cites this paper
Speech processing using adaptive auditory receptive fields
2017cites this paper
2 0 2 0
year unknowncites this paper
2 0 2 4
year unknowncites this paper