msalhab96 / Listen-Attend-and-SpellLinks
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Updated 3 years ago
Alternatives and similar repositories for Listen-Attend-and-Spell
Users that are interested in Listen-Attend-and-Spell are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- ☆11Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audio☆36Updated last month
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆16Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- ☆26Updated last month
- ☆25Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated last month
- ☆26Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆31Updated last year
- NSNet2 Deep Noise Suppression (DNS) package☆38Updated 3 years ago
- ☆29Updated 3 years ago
- An unofficial PyTorch implementation of "HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversari…☆23Updated 4 years ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆18Updated 5 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 3 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- Speech synthesis using LPC☆23Updated 4 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Prosody and Pronunciation Modification Network☆59Updated 6 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago