Wataru-Nakata/ssl-vocoders

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Wataru-Nakata/ssl-vocoders)

Wataru-Nakata / ssl-vocoders

Implementation of vocoders empowered with pytorch lightning

☆18

Alternatives and similar repositories for ssl-vocoders

Users that are interested in ssl-vocoders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
yukara-ikemiya / wavefit-pytorch
View on GitHub
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆70Jul 13, 2026Updated 2 weeks ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
VOICEVOX / voicevox_blog
View on GitHub
VOICEVOX公式サイトを作るコードです
☆14Jul 18, 2026Updated last week
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
Wataru-Nakata / miipher
View on GitHub
Unofficial implementation of miipher
☆137Apr 19, 2024Updated 2 years ago
X-LANCE / UniCATS-CTX-txt2vec
View on GitHub
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Nov 18, 2024Updated last year
sholokhovalexey / online-speaker-clustering
View on GitHub
[ICASSP'23] Online speaker clustering
☆18Feb 22, 2026Updated 5 months ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
prml-lab-speech-team / demo
View on GitHub
☆26Aug 8, 2024Updated last year
Open-Speech-EkStep / data-acquisition-pipeline
View on GitHub
☆18Apr 28, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zjwang21 / mix-phoneme-bert
View on GitHub
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Jul 10, 2023Updated 3 years ago
6gsn / marine
View on GitHub
☆38Sep 20, 2022Updated 3 years ago
thelinhbkhn2014 / Text2PhonemeSequence
View on GitHub
☆53Aug 28, 2024Updated last year
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
hbredin / pyannotebook
View on GitHub
🎹 pyannote + 🗒 notebook = pyannotebook
☆27Jun 12, 2023Updated 3 years ago
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
lab260ru / balalaika
View on GitHub
[INTERSPEECH 2026] Official code for "Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech"
☆21Jul 19, 2026Updated last week
Wataru-Nakata / latentlm-tts
View on GitHub
☆29Jul 3, 2026Updated 3 weeks ago
lifeiteng / TTS-TextAnalyzer
View on GitHub
TTS Text Analyzer
☆31Jul 20, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
nobutaka-ito / pulse
View on GitHub
Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)
☆43Jul 24, 2023Updated 3 years ago
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
nttrd-mdlab / wearable-seld-dataset
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
Hiroshiba / vv_core_inference
View on GitHub
VOICEVOXのコア内で用いられているディープラーニングモデルの推論コード
☆31Dec 3, 2025Updated 7 months ago
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆152Aug 22, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
projecte-aina / oTranscribe-plus
View on GitHub
A free & open tool for transcribing audio interviews with offline ASR support
☆25Dec 21, 2023Updated 2 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 3 years ago
BUTSpeechFIT / DVBx
View on GitHub
Discriminative Training of VBx Diarization
☆28Sep 23, 2024Updated last year
imdanboy / jets
View on GitHub
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
☆111Jun 6, 2022Updated 4 years ago
pengzhendong / streaming-vocos
View on GitHub
Streaming Vocos
☆31Jun 10, 2025Updated last year