HumeAI / competitionsLinks
Hume AI ML Competitions
☆25Updated 2 years ago
Alternatives and similar repositories for competitions
Users that are interested in competitions are comparing it to the libraries listed below
Sorting:
- Transformer-based visually grounded speech models☆19Updated 2 years ago
- Official code for Wav2Seq☆96Updated 2 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Updated last year
- Deep Articulatory Synthesis and Inversion☆52Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- ☆51Updated 3 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- asr2k☆50Updated last year
- Evaluation kit for the HEAR Benchmark☆59Updated last month
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- ☆40Updated 3 weeks ago
- The official repository for Audio ALBERT☆65Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆26Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆57Updated 3 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- ☆36Updated 4 years ago
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.☆44Updated last week
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ☆36Updated 2 years ago
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- ☆52Updated 4 years ago
- ☆17Updated 2 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- ☆11Updated 2 years ago
- ☆21Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago