bhattbhavesh91 / wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
☆30Updated 3 years ago
Alternatives and similar repositories for wav2vec2-huggingface-demo:
Users that are interested in wav2vec2-huggingface-demo are comparing it to the libraries listed below
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆72Updated 3 years ago
- ☆42Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆32Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆47Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆23Updated 11 months ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- ☆179Updated 2 years ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆138Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆12Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 2 years ago
- ☆42Updated 2 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆72Updated 4 years ago
- Official Implementation of Mockingjay in Pytorch☆53Updated last year
- ☆18Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆15Updated 3 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆38Updated 5 years ago