Bill13579 / beltoutLinks
BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC
☆79Updated 5 months ago
Alternatives and similar repositories for beltout
Users that are interested in beltout are comparing it to the libraries listed below
Sorting:
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆325Updated 3 months ago
- Echo-TTS inference codebase☆71Updated last month
- Fast audio super resolution from 16khz to 48khz.☆177Updated last week
- Fork of ACE-Step for LoRA training with < 10 GB VRAM☆60Updated last month
- ☆296Updated 5 months ago
- YuE with mp3 extend, exllama and GUI☆64Updated 10 months ago
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆140Updated last month
- Examples of using the llasa-tts models locally☆182Updated 8 months ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆147Updated 5 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆326Updated 3 weeks ago
- Gradio UI for YuE☆88Updated 9 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆132Updated 5 months ago
- ✨ High‑quality music audio enhancement for ComfyUI: FlashSR Super‑Resolution + Fat Llama spectral enhancement (GPU & CPU).☆43Updated 2 weeks ago
- Awesome music generation model——MG²☆165Updated 9 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 7 months ago
- The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement☆725Updated last month
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated 7 months ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆109Updated last month
- A lightning fast audio upsampler.☆224Updated this week
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆81Updated last year
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆95Updated last month
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated last year
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆34Updated 2 years ago
- SoTA open-source TTS☆131Updated 7 months ago
- ☆135Updated 10 months ago
- Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching☆141Updated 2 months ago
- StyleTTS 2 Optimized Training Fork☆33Updated 11 months ago
- Higgs Audio v2 WebUI + One click installer WIN x64☆19Updated 5 months ago
- ☆187Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 5 months ago