BirgerMoell / tmh
☆18Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for tmh
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago
- ☆59Updated 2 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- ☆37Updated 3 years ago
- ☆45Updated 3 years ago
- The official repository for Audio ALBERT☆64Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆17Updated 5 months ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆89Updated 2 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- The VoxTube dataset official repository☆61Updated 9 months ago
- ☆74Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆72Updated 5 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- multilingual speech aligner☆72Updated last year
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆42Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆19Updated 2 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆24Updated 2 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- asr2k☆48Updated 5 months ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆32Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- A unified dataset of multilingual emotional human utterances☆23Updated 2 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- ☆98Updated 2 years ago