Audiovisual speech processing /
Audiovisual speech processing /
edited by G. Bailly, P. Perrier, and E. Vatikiotis-Bateson.
- Cambridge : Cambridge University Press, 2012.
- 1 online resource (xxxvi, 470 pages) : digital, PDF file(s).
Title from publisher's bibliographic system (viewed on 05 Oct 2015).
Three puzzles of multimodal speech perception / Visual speech perception / Dynamic information for face perception / Investigating auditory-visual speech perception development / Brain bases for seeing speech: fMRI studies of speechreading / Temporal organization of cued speech production / Bimodal perception within the natural time-course of speech production / Visual and audiovisual synthesis and recognition of speech by computers / Audiovisual automatic speech recognition / Image-based facial synthesis / A trainable videorealistic speech animation system / Empirical perceptual-motor linkage of multimodal speech / Sensorimotor characteristics of speech production / R.E. Remez -- L.E. Bernstein -- K. Lander and V. Bruce -- D. Burnham and K. Sekiyama -- R. Campbell and M. MacSweeney -- D. Beautemps, M.-A. Cathiard, V. Attina, and C. Savariaux -- M.-A. Cathiard, A. Vilain, R. Laboissière, H. Loevenbruck, C. Savariaux, and J.-L. Schwartz -- N.M. Brooke and S.D. Scott -- G. Potamianos, C. Neti, J. Luettin, and I. Matthews -- M. Slaney and C. Bregler -- T. Ezzat, G. Geiger, and T. Poggio -- D.W. Massaro, M.M. Cohen, M. Tabain, J. Beskow, and R. Clark -- E. Vatikiotis-Bateson and K.G. Munhall -- G. Bailly, P. Badin, L. Revéret, and A. Ben Youssef. 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. Animated speech: research progress and applications / 13. 14.
When we speak, we configure the vocal tract which shapes the visible motions of the face and the patterning of the audible speech acoustics. Similarly, we use these visible and audible behaviors to perceive speech. This book showcases a broad range of research investigating how these two types of signals are used in spoken communication, how they interact, and how they can be used to enhance the realistic synthesis and recognition of audible and visible speech. The volume begins by addressing two important questions about human audiovisual performance: how auditory and visual signals combine to access the mental lexicon and where in the brain this and related processes take place. It then turns to the production and perception of multimodal speech and how structures are coordinated within and across the two modalities. Finally, the book presents overviews and recent developments in machine-based speech recognition and synthesis of AV speech.
9780511843891 (ebook)
Speech perception.
Speech processing systems.
BF463.S64 / A89 2012
616.85/5
Title from publisher's bibliographic system (viewed on 05 Oct 2015).
Three puzzles of multimodal speech perception / Visual speech perception / Dynamic information for face perception / Investigating auditory-visual speech perception development / Brain bases for seeing speech: fMRI studies of speechreading / Temporal organization of cued speech production / Bimodal perception within the natural time-course of speech production / Visual and audiovisual synthesis and recognition of speech by computers / Audiovisual automatic speech recognition / Image-based facial synthesis / A trainable videorealistic speech animation system / Empirical perceptual-motor linkage of multimodal speech / Sensorimotor characteristics of speech production / R.E. Remez -- L.E. Bernstein -- K. Lander and V. Bruce -- D. Burnham and K. Sekiyama -- R. Campbell and M. MacSweeney -- D. Beautemps, M.-A. Cathiard, V. Attina, and C. Savariaux -- M.-A. Cathiard, A. Vilain, R. Laboissière, H. Loevenbruck, C. Savariaux, and J.-L. Schwartz -- N.M. Brooke and S.D. Scott -- G. Potamianos, C. Neti, J. Luettin, and I. Matthews -- M. Slaney and C. Bregler -- T. Ezzat, G. Geiger, and T. Poggio -- D.W. Massaro, M.M. Cohen, M. Tabain, J. Beskow, and R. Clark -- E. Vatikiotis-Bateson and K.G. Munhall -- G. Bailly, P. Badin, L. Revéret, and A. Ben Youssef. 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. Animated speech: research progress and applications / 13. 14.
When we speak, we configure the vocal tract which shapes the visible motions of the face and the patterning of the audible speech acoustics. Similarly, we use these visible and audible behaviors to perceive speech. This book showcases a broad range of research investigating how these two types of signals are used in spoken communication, how they interact, and how they can be used to enhance the realistic synthesis and recognition of audible and visible speech. The volume begins by addressing two important questions about human audiovisual performance: how auditory and visual signals combine to access the mental lexicon and where in the brain this and related processes take place. It then turns to the production and perception of multimodal speech and how structures are coordinated within and across the two modalities. Finally, the book presents overviews and recent developments in machine-based speech recognition and synthesis of AV speech.
9780511843891 (ebook)
Speech perception.
Speech processing systems.
BF463.S64 / A89 2012
616.85/5