sunprinceS / MetaASR-CrossAccent
Meta-Learning for End-to-End ASR
☆10Updated 4 years ago
Alternatives and similar repositories for MetaASR-CrossAccent:
Users that are interested in MetaASR-CrossAccent are comparing it to the libraries listed below
- ☆20Updated 3 years ago
- ☆22Updated 4 years ago
- PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences☆11Updated 5 years ago
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Updated 5 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Updated 5 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Updated 2 years ago
- A spoken question answering dataset on SQUAD☆47Updated 2 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆99Updated 2 weeks ago
- Stellenbosch University ZeroSpeech 2019 System☆10Updated 6 years ago
- Dataset and baseline for the first Audiocaption task☆79Updated 9 months ago
- An evaluation toolkit for voice conversion models.☆42Updated 3 years ago
- Non-Autoregressive Predictive Coding☆51Updated 4 years ago
- An implementation of SkipVQVC with various settings.☆75Updated 4 years ago
- Meta-learning model agnostic (MAML) implementation for cross-accented ASR☆44Updated last year
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆36Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- Tacotron2 with Global Style Tokens☆63Updated 6 years ago
- ☆15Updated 3 years ago
- The official repository for Audio ALBERT☆65Updated 3 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- ☆52Updated 4 years ago
- Instructions on downloading and using the LibriAdapt dataset☆46Updated 3 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated 2 years ago
- ☆36Updated 2 years ago