MiuLab / TaylorGANLinks
☆31Updated 4 years ago
Alternatives and similar repositories for TaylorGAN
Users that are interested in TaylorGAN are comparing it to the libraries listed below
Sorting:
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Updated 5 years ago
- ☆15Updated 3 years ago
- Meta-Learning for End-to-End ASR☆10Updated 4 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Updated 2 years ago
- ☆10Updated 2 years ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆11Updated 2 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Updated last year
- ☆20Updated 4 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 6 years ago
- ☆10Updated 6 years ago
- My vim comfiguration☆43Updated 3 weeks ago
- Stellenbosch University ZeroSpeech 2019 System☆10Updated 6 years ago
- PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences☆11Updated 5 years ago
- A spoken question answering dataset on SQUAD☆49Updated last month
- ☆22Updated 5 years ago
- ASR text preprocessing utility☆21Updated 10 months ago
- Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816☆45Updated 4 years ago
- ☆16Updated this week
- PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network☆110Updated last year
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- Taiwanese Speech Synthesis with Tacotron2☆20Updated 2 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model☆110Updated 5 years ago
- ☆23Updated 7 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 4 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆15Updated 6 months ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 2 months ago
- Non-Autoregressive Predictive Coding☆51Updated 4 years ago