nessessence / Kaldi_ASR_TutorialLinks
speech recognition using Kaldi framework
☆12Updated 6 years ago
Alternatives and similar repositories for Kaldi_ASR_Tutorial
Users that are interested in Kaldi_ASR_Tutorial are comparing it to the libraries listed below
Sorting:
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- ☆67Updated 6 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- ☆30Updated 3 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆23Updated 6 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆108Updated 2 weeks ago
- Word Error Rate Estimation☆15Updated 5 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆33Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆61Updated 5 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Grapheme To Phoneme☆73Updated last year
- ☆101Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆27Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆52Updated 5 years ago
- ☆27Updated 4 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- asr2k☆52Updated last year