sunprinceS / MetaASR-CrossAccent
Meta-Learning for End-to-End ASR
☆10Updated 4 years ago
Alternatives and similar repositories for MetaASR-CrossAccent:
Users that are interested in MetaASR-CrossAccent are comparing it to the libraries listed below
- ☆22Updated 4 years ago
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Updated 5 years ago
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- Stellenbosch University ZeroSpeech 2019 System☆10Updated 5 years ago
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆35Updated 4 years ago
- ☆20Updated 3 years ago
- A spoken question answering dataset on SQUAD☆45Updated 2 years ago
- Meta-learning model agnostic (MAML) implementation for cross-accented ASR☆44Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 5 years ago
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- ☆53Updated 4 years ago
- The official repository for Audio ALBERT☆64Updated 3 years ago
- ☆36Updated 2 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆98Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆49Updated last year
- Official Implementation of Mockingjay in Pytorch☆53Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆70Updated 2 years ago
- Dataset and baseline for the first Audiocaption task☆79Updated 6 months ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- Alignment files of LibriTTS.☆61Updated 4 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- ☆36Updated 3 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- An implementation of SkipVQVC with various settings.☆75Updated 4 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 5 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 2 years ago