nessessence / Kaldi_ASR_TutorialLinks
speech recognition using Kaldi framework
☆12Updated 5 years ago
Alternatives and similar repositories for Kaldi_ASR_Tutorial
Users that are interested in Kaldi_ASR_Tutorial are comparing it to the libraries listed below
Sorting:
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Machine learning speaker characteristics☆35Updated 2 weeks ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆40Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 2 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Official PyTorch implementation of TTS Style Transfer☆23Updated 3 years ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆38Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- asr2k☆50Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆23Updated 3 years ago
- ☆43Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 6 months ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 2 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Perform the forced decoding with target transcription☆11Updated 6 years ago
- ☆30Updated 2 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Updated 3 weeks ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆42Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- ☆56Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆25Updated last year