msalhab96 / Listen-Attend-and-SpellLinks
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Updated 3 years ago
Alternatives and similar repositories for Listen-Attend-and-Spell
Users that are interested in Listen-Attend-and-Spell are comparing it to the libraries listed below
Sorting:
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆11Updated 2 years ago
- ☆18Updated 10 months ago
- ☆23Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆49Updated last month
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Updated 4 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆20Updated 9 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆27Updated 10 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated 2 years ago
- This is the project page of our paper "MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion".☆11Updated 4 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 2 months ago
- Crowdsourced and Automatic Speech Prominence Estimation☆21Updated last year
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- ☆45Updated 2 years ago
- ☆12Updated 9 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 2 months ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆14Updated 7 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …☆36Updated 4 years ago
- ☆17Updated 10 months ago
- torch version of LPCNet☆21Updated 5 years ago