nessessence / Kaldi_ASR_TutorialLinks
speech recognition using Kaldi framework
☆12Updated 5 years ago
Alternatives and similar repositories for Kaldi_ASR_Tutorial
Users that are interested in Kaldi_ASR_Tutorial are comparing it to the libraries listed below
Sorting:
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- ☆32Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- PyTorch based speaker embedding model☆16Updated last year
- Digital Speech Processing in PyTorch.☆14Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- ☆43Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- A library of speech gadgets.☆13Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 2 weeks ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 3 months ago
- Generative Adversarial Networks for different impaired speech conversions☆37Updated 2 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Updated 3 years ago
- Speech synthesis using LPC☆22Updated 4 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 2 weeks ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated 10 months ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Updated 2 years ago