Cross-Product-Labs / csm_finetune
Finetune Sesame's CSM 1B model, for fun and profit
☆13Updated last month
Alternatives and similar repositories for csm_finetune:
Users that are interested in csm_finetune are comparing it to the libraries listed below
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆45Updated 2 weeks ago
- ☆14Updated 10 months ago
- Open TTS models, built for streaming on the edge☆39Updated last month
- ☆24Updated last year
- ☆39Updated 11 months ago
- ☆22Updated 6 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- ☆12Updated 9 months ago
- ☆32Updated last year
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆40Updated last month
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆20Updated 7 months ago
- Gradio UI for training video models using finetrainers☆27Updated last week
- ☆13Updated last year
- ☆22Updated last year
- Playing with CSM☆19Updated last month
- ☆12Updated 6 months ago
- ☆23Updated 11 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 weeks ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 7 months ago
- finetune your florence2 model easy☆20Updated 9 months ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- An open source real-time AI inference engine for seamless scaling☆18Updated last week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆104Updated 2 weeks ago
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 6 months ago
- ☆18Updated 7 months ago
- ☆13Updated last year
- Wan2.1, quantized and optimized so it fits on your 3090/4090☆30Updated 2 months ago
- Gradio webapp to train AI Video models using Finetrainers☆32Updated last week
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago