ajaybati / miipher2.0Links

Reimplementation of Miipher

☆29

Alternatives and similar repositories for miipher2.0

Users that are interested in miipher2.0 are comparing it to the libraries listed below

Sorting:

pengzhendong / streaming-vocos
Streaming Vocos
☆29Updated 8 months ago
mcf330 / efts2code
source code of EfficientTTS 2
☆20Updated last year
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆36Updated 2 years ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆27Updated 7 months ago
exercise-book-yq / FreeCodec
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Updated last year
thuhcsi / SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Updated 2 years ago
ftshijt / Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Updated 2 years ago
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆19Updated 2 years ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆22Updated this week
ozspeech / OZSpeech
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Updated last year
redmist328 / APNet2
Source code of APNet2, a vocoder
☆58Updated 2 years ago
yukara-ikemiya / Open-Miipher-2
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆64Updated 4 months ago
zengchang233 / CrossSinger
The source code for the paper CrossSinger (asru2023)
☆18Updated 2 years ago
xinshengwang / robpitch
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Updated last year
reppy4620 / x-vits
☆14Updated 6 months ago
zjwang21 / mix-phoneme-bert
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Updated 2 years ago
the-bird-F / Expressive-Vectors
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆36Updated last month
lucadellalib / discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
☆24Updated last year
zengchang233 / xiaoicesing2
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Updated 2 years ago
shang0712 / HierTTS
☆46Updated 2 years ago
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated 2 years ago
walker-hyf / FCTalker
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Updated last year
WangHelin1997 / DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆37Updated 2 years ago
Mddct / transformer-vocos
☆36Updated 5 months ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆61Updated last year
yangdongchao / ALMTokenizer2
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆42Updated 5 months ago
SonyResearch / VRVQ
Variable Bitrate Residual Vector Quantization for Audio Coding
☆51Updated 9 months ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
BiSinger-SVS / BiSinger
Bilingual Singing Voice Synthesis
☆18Updated last year
exercise-book-yq / Supercodec
☆49Updated 10 months ago