bhattbhavesh91 / wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
☆30Updated 3 years ago
Alternatives and similar repositories for wav2vec2-huggingface-demo:
Users that are interested in wav2vec2-huggingface-demo are comparing it to the libraries listed below
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ☆43Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆126Updated last month
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated last year
- Various speech datasets made available to the public☆115Updated 3 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆148Updated 4 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆91Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆75Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆261Updated 2 years ago
- ☆45Updated 2 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 5 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆38Updated 5 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- ☆91Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆213Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 3 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆129Updated 2 months ago