LWprogramming / audiolm-pytorch-training
audiolm-pytorch training code
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for audiolm-pytorch-training
- ☆45Updated 4 months ago
- ☆14Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆25Updated 7 months ago
- singing voice conversion without f0☆22Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- ☆81Updated 2 months ago
- ☆33Updated last year
- My vocoder experiments☆21Updated last month
- Official Code for ParrotTTS☆42Updated last month
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆28Updated 3 weeks ago
- ☆32Updated 2 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆56Updated 3 weeks ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆37Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆70Updated 7 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆66Updated last year
- 4G GPU & 10 Minutes for train☆12Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆44Updated 3 months ago
- GPT for FACodec☆13Updated 7 months ago
- ☆39Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- ☆34Updated 7 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆16Updated 10 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆25Updated 2 weeks ago
- Codebase and project page for EDMSound☆29Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆14Updated last year