nessessence / Kaldi_ASR_Tutorial
speech recognition using Kaldi framework
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Kaldi_ASR_Tutorial
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆9Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆19Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆22Updated 9 months ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆30Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆23Updated 4 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Speech synthesis using LPC☆19Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- PyTorch based speaker embedding model☆15Updated 7 months ago
- ☆32Updated 2 months ago
- Conformer-based Metric GAN for speech enhancement☆26Updated 6 months ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- ☆40Updated 2 years ago
- ☆10Updated last year
- ☆23Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- ☆11Updated last year
- Repository for Accent Recognition (Hackathon @SLT2022)☆23Updated 6 months ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Interspeech2024 | Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models☆12Updated 3 months ago
- ☆11Updated last year