abikaki / awesome-speech-emotion-recognition
😎 Awesome lists about Speech Emotion Recognition
☆83Updated 2 months ago
Alternatives and similar repositories for awesome-speech-emotion-recognition:
Users that are interested in awesome-speech-emotion-recognition are comparing it to the libraries listed below
- ☆64Updated 6 months ago
- EMO-SUPERB submission☆42Updated 6 months ago
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆28Updated 4 months ago
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark☆210Updated 8 months ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆315Updated 5 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆120Updated last week
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation☆144Updated last month
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆121Updated 2 months ago
- Official implement of SpeechFormer written in Python (PyTorch).☆77Updated last year
- ☆158Updated 8 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆36Updated 8 months ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆131Updated 3 months ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 2 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆128Updated 2 months ago
- Official repository of NeXt-TDNN for speaker verification☆67Updated 5 months ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆146Updated 3 years ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆58Updated 8 months ago
- ☆49Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆51Updated 10 months ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆18Updated 2 months ago
- ☆104Updated 2 years ago
- [WACV 2023] Audio-Visual Efficient Conformer (AVEC) for Robust Speech Recognition☆92Updated 2 years ago
- This is the audio sample repository for speech separation model "MossFormer2".☆120Updated 3 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆113Updated 3 months ago
- ☆115Updated 2 years ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆193Updated 10 months ago
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆140Updated this week
- The official implementation of EmoSphere-TTS☆111Updated last month