eastonYi/Unsupervised-ASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eastonYi/Unsupervised-ASR)

eastonYi / Unsupervised-ASR

unsupervised ASR (mainly phone classifier) using EODM and GAN

☆12

Alternatives and similar repositories for Unsupervised-ASR

Users that are interested in Unsupervised-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
jjery2243542 / semi-supervised-ASR
View on GitHub
☆10Dec 16, 2018Updated 7 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
isl-mt / fluent-fisher
View on GitHub
☆15Jun 17, 2019Updated 7 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago
LvHang / pitch
View on GitHub
a standalone pitch extractor
☆13Oct 19, 2017Updated 8 years ago
chqiwang / transformer
View on GitHub
A TensorFlow Implementation of the Transformer for machine translation.
☆24Dec 27, 2018Updated 7 years ago
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
MarkWuNLP / SemanticMask
View on GitHub
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Jun 9, 2020Updated 6 years ago
aispeech-lab / w2v-cif-bert
View on GitHub
☆37Jun 28, 2021Updated 5 years ago
ikuinen / regularized_two-branch_proposal_network
View on GitHub
☆20Feb 21, 2022Updated 4 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
CODEJIN / Glow_TTS
View on GitHub
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
☆55Sep 14, 2022Updated 3 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
ericwudayi / SkipVQVC
View on GitHub
An implementation of SkipVQVC with various settings.
☆75Jun 22, 2020Updated 6 years ago
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
MTG / singing-synthesis-demos
View on GitHub
Sound examples for the Neural Parametric Singing Synthesizer (NPSS)
☆23Feb 24, 2022Updated 4 years ago
getalp / mass-dataset
View on GitHub
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Sep 16, 2024Updated last year
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
voidful / SpeechMix
View on GitHub
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆46Jul 3, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
autosimtrans / SimulTransBaseline
View on GitHub
This is a sample code for AutoSimulTrans Workshop (https://autosimtrans.github.io)
☆18Dec 25, 2020Updated 5 years ago
Deepest-Project / Transformer-TTS
View on GitHub
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Jul 6, 2023Updated 3 years ago
bzhangGo / st_from_scratch
View on GitHub
Revisiting End-to-End Speech-to-Text Translation From Scratch
☆13Feb 21, 2023Updated 3 years ago
salu133445 / bach-violin-dataset
View on GitHub
A collection of high-quality public recordings of Bach's sonatas and partitas for solo violin (BWV 1001–1006)
☆39Feb 19, 2022Updated 4 years ago
asappresearch / wav2seq
View on GitHub
Official code for Wav2Seq
☆97Jul 19, 2022Updated 4 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
Dianezzy / ParaLip
View on GitHub
Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code
☆109May 1, 2022Updated 4 years ago
DDMAL / jSymbolic2
View on GitHub
2nd Version of jSymbolic
☆35Jun 24, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago
atosystem / SpeechCLIP
View on GitHub
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
☆120Nov 25, 2022Updated 3 years ago
tbright17 / accent-feat
View on GitHub
Feature extraction for accented-speech or pathological speech
☆18Apr 2, 2019Updated 7 years ago
hhhaaahhhaa / ASR-TTA
View on GitHub
☆16Nov 4, 2025Updated 8 months ago
open-speech / cn-text-normalizer
View on GitHub
A python module that convert chinese written string to read string. 一个python包：将中文书面字符串转换为口语字符串。
☆124Oct 8, 2019Updated 6 years ago
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago