Plachtaa/StreamVoiceAnon

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Plachtaa/StreamVoiceAnon)

Plachtaa / StreamVoiceAnon

[ICASSP'26] Real-time streaming voice anonymization & voice conversion

☆82

Alternatives and similar repositories for StreamVoiceAnon

Users that are interested in StreamVoiceAnon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jerrister / X-VC
View on GitHub
X-VC: Zero-shot Streaming Voice Conversion in Codec Space
☆69May 6, 2026Updated 2 months ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
P1ping / TokAN-Legacy
View on GitHub
☆27Jun 22, 2026Updated last month
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆70Sep 22, 2025Updated 10 months ago
AmphionTeam / Emilia-NV
View on GitHub
Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"
☆92Sep 18, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
ASLP-lab / MeanVC
View on GitHub
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
☆298Jan 8, 2026Updated 6 months ago
ShawnPi233 / HQ-SVC
View on GitHub
Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)
☆108Jun 17, 2026Updated last month
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆146Mar 8, 2026Updated 4 months ago
zeyuxie29 / SemanticVocoder
View on GitHub
☆28Apr 6, 2026Updated 3 months ago
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago
Berkeley-Speech-Group / RT-VC
View on GitHub
☆34Mar 29, 2025Updated last year
Shy-98 / MELLE
View on GitHub
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Jun 28, 2025Updated last year
sunnyxrxrx / X-Voice
View on GitHub
X-Voice
☆177Jun 5, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated 2 weeks ago
cwx-worst-one / WavTTS
View on GitHub
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling
☆210Jun 6, 2026Updated last month
zhaoyx239 / X-Translator
View on GitHub
☆26Jul 21, 2026Updated last week
smallbraineng / smalltts
View on GitHub
superfast text to speech in any voice
☆62Feb 16, 2026Updated 5 months ago
nonverbalspeech38k / nonverspeech38k
View on GitHub
The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…
☆68Dec 26, 2025Updated 7 months ago
bigai-nlco / UltraVoice
View on GitHub
Official Repository of UltraVoice
☆63Oct 28, 2025Updated 9 months ago
jiaqili3 / DualCodec
View on GitHub
[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec
☆72Mar 11, 2026Updated 4 months ago
User-tian / Conan
View on GitHub
Official Implementation of "Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion"
☆28Nov 12, 2025Updated 8 months ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
XiaomiMiMo / MiMo-Audio-Tokenizer
View on GitHub
A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.
☆145Sep 19, 2025Updated 10 months ago
mmp-effml-team / mmp_effml_fall_2025
View on GitHub
☆17Dec 15, 2025Updated 7 months ago
kamperh / linearvc
View on GitHub
Voice conversion with just linear regression.
☆37Sep 25, 2025Updated 10 months ago
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
mmp-practicum-team / mmp_dl_spring
View on GitHub
Курс "Введение в глубокое обучение" для бакалавров 3 курса кафедры ММП ВМК МГУ, весенний семестр
☆34Jul 3, 2026Updated 3 weeks ago
Lab-MSP / NaturalVoices
View on GitHub
☆33Oct 28, 2025Updated 9 months ago
idiap / knn-tts
View on GitHub
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆36Apr 29, 2025Updated last year
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
ASLP-lab / MeanVC2
View on GitHub
☆29Jun 9, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
omine-me / LaughterSegmentation
View on GitHub
2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…
☆66Sep 1, 2024Updated last year
M0RJIQUE / tencdm
View on GitHub
☆23Oct 6, 2025Updated 9 months ago
freds0 / free-svc
View on GitHub
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
☆95Jul 23, 2025Updated last year
leto19 / WhiSQA
View on GitHub
Whisper Speech Quality Assessment (WhiSQA)
☆16Apr 14, 2026Updated 3 months ago
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
ZhikangNiu / A-DMA
View on GitHub
[INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"
☆67Jun 16, 2025Updated last year
FrontierLabs / F5R-TTS
View on GitHub
Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"
☆169Mar 3, 2026Updated 4 months ago