数据集:

KELONMYOSA/dusha_emotion_audio

语言:

ru

大小:

100K<n<1M
中文

This dataset was taken from the creators GitHub repository and converted for my own studying needs.

Dusha dataset

Dusha is a bi-modal corpus suitable for speech emotion recognition (SER) tasks. The dataset consists of about 300 000 audio recordings with Russian speech, their transcripts and emotional labels. The corpus contains approximately 350 hours of data. Four basic emotions that usually appear in a dialog with a virtual assistant were selected: Happiness (Positive), Sadness, Anger and Neutral emotion.

License

English Version

Russian Version

Authors

  • Artem Sokolov
  • Fedor Minkin
  • Nikita Savushkin
  • Nikolay Karpov
  • Oleg Kutuzov
  • Vladimir Kondratenko