Robust Emotion Recognition using Spectral and Prosodic Features

In this brief, the authors discuss recently explored spectral (sub-segmental and pitch synchronous) and prosodic (global and local features at word and syllable levels in different parts of the utterance) features for discerning emotions in a robust manner. The authors also delve into the complement...

Full description

Bibliographic Details
Main Authors: Rao, K. Sreenivasa, Koolagudi, Shashidhar G. (Author)
Format: eBook
Language:English
Published: New York, NY Springer New York 2013, 2013
Edition:1st ed. 2013
Series:SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 02648nmm a2200373 u 4500
001 EB000365128
003 EBX01000000000000000218180
005 00000000000000.0
007 cr|||||||||||||||||||||
008 130626 ||| eng
020 |a 9781461463603 
100 1 |a Rao, K. Sreenivasa 
245 0 0 |a Robust Emotion Recognition using Spectral and Prosodic Features  |h Elektronische Ressource  |c by K. Sreenivasa Rao, Shashidhar G. Koolagudi 
250 |a 1st ed. 2013 
260 |a New York, NY  |b Springer New York  |c 2013, 2013 
300 |a XII, 118 p. 37 illus., 15 illus. in color  |b online resource 
505 0 |a Introduction -- Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features -- Robust Emotion Recognition using Word and Syllable Level Prosodic Features -- Robust Emotion Recognition using Combination of Excitation Source, Spectral and Prosodic Features -- Robust Emotion Recognition using Speaking Rate Features -- Emotion Recognition on Real Life Emotions -- Summary and Conclusions -- MFCC Features -- Gaussian Mixture Model (GMM) 
653 |a User interfaces (Computer systems) 
653 |a Computational Linguistics 
653 |a Computational linguistics 
653 |a Signal, Speech and Image Processing 
653 |a Natural Language Processing (NLP) 
653 |a Signal processing 
653 |a User Interfaces and Human Computer Interaction 
653 |a Human-computer interaction 
653 |a Natural language processing (Computer science) 
700 1 |a Koolagudi, Shashidhar G.  |e [author] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
490 0 |a SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning 
028 5 0 |a 10.1007/978-1-4614-6360-3 
856 4 0 |u https://doi.org/10.1007/978-1-4614-6360-3?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 621.382 
520 |a In this brief, the authors discuss recently explored spectral (sub-segmental and pitch synchronous) and prosodic (global and local features at word and syllable levels in different parts of the utterance) features for discerning emotions in a robust manner. The authors also delve into the complementary evidences obtained from excitation source, vocal tract system and prosodic features for the purpose of enhancing emotion recognition performance. Features based on speaking rate characteristics are explored with the help of multi-stage and hybrid models for further improving emotion recognition performance. Proposed spectral and prosodic features are evaluated on real life emotional speech corpus