Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
- Title
- Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
- Creator
- Mohanty A.; Cherukuri R.C.; Prusty A.R.
- Description
- Speech emotion recognition (SER) is a dynamic area of research which includes features extraction, classification and adaptation of speech emotion dataset. There are many applications where human emotions play a vital role for giving smart solutions. Some of these applications are vehicle communications, classification of satisfied and unsatisfied customers in call centers, in-car board system based on information on drivers mental state, human-computer interaction system and others. In this contribution, an improved emotion recognition technique has been proposed with Deep Convolutional Neural Network (DCNN) by using both speech spectral and prosodic features to classify seven human emotionsanger, disgust, fear, happiness, neutral, sadness and surprise. The proposed idea is implemented on different datasets such as RAVDESS, SAVEE, TESS and CREMA-D with accuracy of 96.54%, 92.38%, 99.42% and 87.90%, respectively, and compared with other pre-defined machine learning and deep learning methods. To test the real-time accuracy of the model, it has been implemented on the combined datasets with accuracy of 90.27%. This research can be useful for development of smart applications in mobile devices, household robots and online learning management system. 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
- Source
- Lecture Notes in Networks and Systems, Vol-608, pp. 117-129.
- Date
- 2023-01-01
- Publisher
- Springer Science and Business Media Deutschland GmbH
- Subject
- Data augmentation; Deep convolutional neural network; Emotion recognition; Speech dataset; Speech features
- Coverage
- Mohanty A., CHRIST (Deemed to be University), Karnataka, Bangalore, India; Cherukuri R.C., CHRIST (Deemed to be University), Karnataka, Bangalore, India; Prusty A.R., DGT, RDSDE, NSTI(W), West Bengal, Kolkata, India
- Rights
- Restricted Access
- Relation
- ISSN: 23673370; ISBN: 978-981199224-7
- Format
- Online
- Language
- English
- Type
- Conference paper
Collection
Citation
Mohanty A.; Cherukuri R.C.; Prusty A.R., “Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features,” CHRIST (Deemed To Be University) Institutional Repository, accessed February 27, 2025, https://archives.christuniversity.in/items/show/19997.