msalhab96 / Listen-Attend-and-SpellLinks
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Updated 3 years ago
Alternatives and similar repositories for Listen-Attend-and-Spell
Users that are interested in Listen-Attend-and-Spell are comparing it to the libraries listed below
Sorting:
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- ☆11Updated last year
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Updated 4 years ago
- ☆25Updated 3 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆13Updated last year
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- ☆30Updated 2 years ago
- Went online decode demo☆30Updated 4 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆17Updated 3 years ago
- ☆12Updated 4 months ago
- ☆18Updated 9 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- ☆10Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆43Updated 4 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 8 months ago
- torch version of LPCNet☆21Updated 4 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆49Updated last week
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆20Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆19Updated 10 months ago
- The implementation of g2pL with a new open dataset.☆16Updated 2 years ago
- with alignment learning and continuous wavelet transform☆21Updated 2 years ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago