nessessence / Kaldi_ASR_Tutorial
speech recognition using Kaldi framework
☆12Updated 4 years ago
Related projects: ⓘ
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆17Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆22Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Updated 2 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆20Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆9Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆19Updated 7 months ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆31Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- ☆31Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆32Updated 7 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆36Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆22Updated 6 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated 11 months ago
- asr2k☆48Updated 3 months ago
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆24Updated 5 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆39Updated last year
- ☆23Updated 5 years ago
- Speech synthesis using LPC☆19Updated 3 years ago
- ☆56Updated last year