msalhab96 / Listen-Attend-and-SpellLinks
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Updated 3 years ago
Alternatives and similar repositories for Listen-Attend-and-Spell
Users that are interested in Listen-Attend-and-Spell are comparing it to the libraries listed below
Sorting:
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆44Updated 4 years ago
- ☆19Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆36Updated 2 months ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 3 years ago
- multilingual speech aligner☆77Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Online streaming speaker change detection model in Pytorch☆43Updated 2 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆63Updated last year
- ☆25Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆11Updated 2 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- torch version of LPCNet☆21Updated 5 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆51Updated 5 months ago
- ☆28Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- ☆30Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- ☆54Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 6 months ago
- ☆26Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago