TY - JOUR
T1 - The perceptual flow of phonetic feature processing
AU - Greenberg, Steven
AU - Christiansen, Thomas Ulrich
N1 - Copyright (2008) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America.
PY - 2008
Y1 - 2008
N2 - How does the brain process spoken language? It is our thesis that word intelligibility and consonant identification are insufficient by themselves to model how the speech signal is decoded - a finer-grained approach is required. In this study, listeners identified 11 different Danish consonants spoken in a Consonant + Vowel + [l] environment. Each syllable was processed so that only a portion of the original audio spectrum was present. Three-quarter-octave bands of speech, centered at 750, 1500, and 3000 Hz, were presented individually and in combination with each other. The conditional, posterior probabilities associated with phonetic-feature decoding were computed from confusion matrices in order to deduce the temporal flow of phonetic processing. Decoding the feature, Manner-of-Articulation, depends on accurate decoding of the feature Voicing (but not vice-versa), and decoding Place-of-Articulation requires precise decoding of Manner (but not the converse). From these data, we conclude that Voicing is processed prior to Manner-of-Articulation, and that Manner is decoded prior to Place-of-Articulation. Voicing and Manner cues are often correctly decoded in conditions where Place is not. This asymmetric pattern of feature decoding may provide extra-segmental information of utility for speech processing, particularly in adverse listening conditions.
AB - How does the brain process spoken language? It is our thesis that word intelligibility and consonant identification are insufficient by themselves to model how the speech signal is decoded - a finer-grained approach is required. In this study, listeners identified 11 different Danish consonants spoken in a Consonant + Vowel + [l] environment. Each syllable was processed so that only a portion of the original audio spectrum was present. Three-quarter-octave bands of speech, centered at 750, 1500, and 3000 Hz, were presented individually and in combination with each other. The conditional, posterior probabilities associated with phonetic-feature decoding were computed from confusion matrices in order to deduce the temporal flow of phonetic processing. Decoding the feature, Manner-of-Articulation, depends on accurate decoding of the feature Voicing (but not vice-versa), and decoding Place-of-Articulation requires precise decoding of Manner (but not the converse). From these data, we conclude that Voicing is processed prior to Manner-of-Articulation, and that Manner is decoded prior to Place-of-Articulation. Voicing and Manner cues are often correctly decoded in conditions where Place is not. This asymmetric pattern of feature decoding may provide extra-segmental information of utility for speech processing, particularly in adverse listening conditions.
U2 - 10.1121/1.2935993
DO - 10.1121/1.2935993
M3 - Journal article
VL - 123
SP - 3932
EP - 3932
JO - Acoustical Society of America. Journal
JF - Acoustical Society of America. Journal
SN - 0001-4966
IS - 5
ER -