hrnoh24 / stream-vc
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
☆120Updated 9 months ago
Alternatives and similar repositories for stream-vc
Users that are interested in stream-vc are comparing it to the libraries listed below
Sorting:
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆67Updated last month
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆135Updated 11 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆134Updated 4 months ago
- Official Implementation of StyleTTS-VC☆179Updated 4 months ago
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆76Updated 7 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆118Updated last month
- ☆134Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆91Updated 10 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆159Updated 7 months ago
- The open source code for SimpleSpeech series☆138Updated 7 months ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆161Updated 4 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆119Updated 2 years ago
- VoiceLDM: Text-to-Speech with Environmental Context☆175Updated 9 months ago
- ☆69Updated last year
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆195Updated last year
- A sequence-to-sequence voice conversion toolkit.☆97Updated 10 months ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆138Updated 6 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- ☆74Updated 3 months ago
- High-Fidelity Neural Phonetic Posteriorgrams☆112Updated 2 months ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆190Updated last year
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆141Updated last year
- Train the next generation of TTS systems.☆165Updated 8 months ago
- Monotonic Alignment Search☆91Updated 2 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆140Updated last year
- ☆68Updated 8 months ago