nessessence / Kaldi_ASR_Tutorial
speech recognition using Kaldi framework
☆12Updated 5 years ago
Alternatives and similar repositories for Kaldi_ASR_Tutorial:
Users that are interested in Kaldi_ASR_Tutorial are comparing it to the libraries listed below
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 9 months ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆23Updated 2 years ago
- ☆42Updated 2 years ago
- ☆28Updated 4 years ago
- ☆27Updated last year
- asr2k☆49Updated 10 months ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 9 months ago
- PyTorch based speaker embedding model☆16Updated 11 months ago
- ☆32Updated 3 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- ☆24Updated last year
- Deep Speech Distances PyTorch☆27Updated 3 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 4 years ago
- Emotional Speech Conversion using Nonparallel Data☆16Updated 5 years ago
- Speech synthesis using LPC☆20Updated 3 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆40Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago