msalhab96 / Listen-Attend-and-Spell
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Listen-Attend-and-Spell
- Example python scripts to evaluate various ASR methods☆12Updated 2 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated 5 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆13Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆34Updated 11 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆12Updated 2 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- ☆27Updated 2 years ago
- ☆16Updated 2 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆37Updated 5 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆50Updated 2 weeks ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆19Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆17Updated 3 months ago
- Speech synthesis using LPC☆19Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆13Updated 5 months ago
- ☆29Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆21Updated 2 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆16Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- ☆23Updated 5 months ago
- Baseline kaldi script for UA-SPEECH corpus☆29Updated last month
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆31Updated 5 months ago
- ☆27Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 4 months ago
- ClearVoice☆13Updated this week
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago