High-Quality Voice Cloning TTS for 600+ Languages
☆948Apr 3, 2026Updated last week
Alternatives and similar repositories for OmniVoice
Users that are interested in OmniVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue☆98Updated this week
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆935Dec 2, 2025Updated 4 months ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation☆139Mar 8, 2026Updated last month
- ☆25Jun 19, 2025Updated 9 months ago
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆97Mar 31, 2026Updated last week
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆125Mar 27, 2025Updated last year
- ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice cloning and audio editing with emotion, style, speed control, and m…☆59Dec 4, 2025Updated 4 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆435Sep 13, 2024Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- Official Code for ParrotTTS☆58Oct 13, 2024Updated last year
- ☆117Mar 24, 2026Updated 2 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Self-supervised Generative LM-based Voice Conversion☆56Apr 24, 2025Updated 11 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,312Updated this week
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 4 months ago
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆296Oct 12, 2025Updated 5 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆148Jan 1, 2025Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆148Aug 22, 2022Updated 3 years ago
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆172May 20, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆154Mar 3, 2026Updated last month
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 8 months ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 7 months ago
- ☆118Sep 18, 2025Updated 6 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated 2 months ago
- SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline☆300Jan 19, 2026Updated 2 months ago
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆20Aug 15, 2025Updated 7 months ago
- The repository extends sam-3d-body with the main purpose of mapping the outputs of the estimation into fbx files, allowing posing and rem…☆23Mar 24, 2026Updated 2 weeks ago
- ☆54Jul 16, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆371Sep 3, 2024Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 8 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 8 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆219Apr 14, 2025Updated 11 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆214Sep 19, 2024Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆197Jan 25, 2026Updated 2 months ago