MULTI-FEATURE ANALYSIS AND ENSEMBLE LEARNING FOR IMPROVED EMOTION RECOGNITION IN WHISPERED SPEECH

Authors

  • D. Sunitha, Dr. P. Narahari Sastry Author

Abstract

Abstract: This paper presents a novel method for emotion recognition from whispered speech, integrating advanced techniques in feature extraction, feature selection, and classification to enhance accuracy and robustness. The approach begins with extracting three types of features: wavelet features for multi-resolution analysis, prosodic features for pitch and intensity, and spectral features such as formants, Mel-Frequency Cepstral Coefficients (MFCCs), and Long-Term Average Spectrum (LTAS) to capture comprehensive emotional information. A two-step feature selection process, involving partial correlation analysis and Linear Discriminant Analysis (LDA), is employed to identify and retain the most informative features while reducing dimensionality. Classification is performed using an ensemble learning strategy that combines Support Vector Machine (SVM) and Decision Tree classifiers, with SVM distinguishing between neutral and emotional states and the Decision Tree further categorizing emotions. Simulation results using the GeWEC dataset demonstrate the effectiveness of the proposed method, achieving significant improvements in Unweighted Average Recall (UAR) across various configurations. This underscores the method’s capability to accurately recognize emotional states from whispered speech, offering valuable insights for practical applications in emotion recognition systems.

Downloads

Published

2024-07-31

Issue

Section

Articles

How to Cite

MULTI-FEATURE ANALYSIS AND ENSEMBLE LEARNING FOR IMPROVED EMOTION RECOGNITION IN WHISPERED SPEECH. (2024). JOURNAL OF BASIC SCIENCE AND ENGINEERING, 21(1), 1943-1958. https://yigkx.org.cn/index.php/jbse/article/view/277