unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
Alternatives and similar repositories for Unsupervised-ASR
Users that are interested in Unsupervised-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 20, 2022Updated 3 years ago
- ☆10Dec 16, 2018Updated 7 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- ☆15Jun 17, 2019Updated 6 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Aug 15, 2019Updated 6 years ago
- a standalone pitch extractor☆13Oct 19, 2017Updated 8 years ago
- A TensorFlow Implementation of the Transformer for machine translation.☆24Dec 27, 2018Updated 7 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- ☆20Feb 21, 2022Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- An implementation of SkipVQVC with various settings.☆75Jun 22, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 8 months ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆23Feb 24, 2022Updated 4 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- This is a sample code for AutoSimulTrans Workshop (https://autosimtrans.github.io)☆18Dec 25, 2020Updated 5 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated last year
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109May 1, 2022Updated 3 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- 2nd Version of jSymbolic☆32Jan 26, 2023Updated 3 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆119Nov 25, 2022Updated 3 years ago
- A collection of high-quality public recordings of Bach's sonatas and partitas for solo violin (BWV 1001–1006)☆39Feb 19, 2022Updated 4 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 6 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆124Oct 8, 2019Updated 6 years ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago