Cross-Product-Labs / csm_finetuneLinks
Finetune Sesame's CSM 1B model, for fun and profit
☆16Updated 3 months ago
Alternatives and similar repositories for csm_finetune
Users that are interested in csm_finetune are comparing it to the libraries listed below
Sorting:
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆56Updated last month
- ☆39Updated last year
- ☆14Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 8 months ago
- ☆23Updated 8 months ago
- Examples of using the llasa-tts models locally☆175Updated 2 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- An open source real-time AI inference engine for seamless scaling☆19Updated 2 weeks ago
- Streaming and Fine-tuning for Chatterbox TTS☆128Updated last month
- ☆21Updated 10 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆46Updated 4 months ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆20Updated 9 months ago
- ☆13Updated 9 months ago
- ☆32Updated last year
- Playing with CSM☆22Updated 4 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 9 months ago
- finetune your florence2 model easy☆20Updated 11 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 10 months ago
- SoTA open-source TTS☆63Updated last month
- ☆40Updated last year
- ☆24Updated last year
- ☆16Updated last year
- ☆13Updated last year
- ☆37Updated last year
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆41Updated 2 weeks ago
- ☆24Updated last year
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated last month
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆31Updated 7 months ago