mansourehk / ShEMOLinks
Sharif Emotional Speech Database
☆36Updated 4 years ago
Alternatives and similar repositories for ShEMO
Users that are interested in ShEMO are comparing it to the libraries listed below
Sorting:
- A large-scale validated database for Persian speech emotion detection.☆24Updated 3 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Updated 11 months ago
- Wav2Vec for speech recognition, classification, and audio classification☆265Updated 3 years ago
- fine-tune Wav2vec2. an ASR model released by Facebook☆37Updated 3 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- Persian Grapheme To Phoneme with Transformer in Pytorch☆11Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Grapheme To Phoneme☆73Updated 11 months ago
- In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any …☆41Updated 6 months ago
- Script to train a German n-gram Language Model on articles of Wikipedia☆13Updated 6 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆26Updated 3 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20Updated 2 years ago
- ☆30Updated 2 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- ☆19Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- Persian Grapheme-to-Phoneme (G2P) converter☆20Updated 4 years ago
- ☆11Updated 2 years ago
- ☆66Updated 10 months ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Updated 2 months ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆123Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆36Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- ☆51Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- [ICASSP 2025] Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners".☆28Updated 2 weeks ago