uthree / tinyvc
a lightweight voice conversion
☆78Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for tinyvc
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆70Updated 7 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 5 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆60Updated last month
- E2E TTS using Conditional Flow Matching (Experimental*)☆66Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆54Updated 7 months ago
- Adaptive Vocoder for Custom Voice☆58Updated 2 years ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- The open source code for SimpleSpeech series☆111Updated last month
- BigVGAN with Neural Source-Filter☆50Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- The official implementation of EmoSphere-TTS☆85Updated 3 months ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆51Updated last year
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆136Updated last year
- Implementation of Emo-StarGAN☆46Updated 11 months ago
- ☆104Updated last month
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆28Updated 3 months ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆119Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆78Updated 4 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆93Updated 2 weeks ago
- The official implementation of EmoSphere++☆27Updated 2 weeks ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆46Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- ☆65Updated last year
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆80Updated last month
- Huawei Grad-TTS for Chinese☆45Updated last year
- Source code of APNet2, a vocoder☆51Updated 11 months ago
- A sequence-to-sequence voice conversion toolkit.☆86Updated 4 months ago