Finetune Sesame's CSM 1B model, for fun and profit
☆17Mar 24, 2025Updated 11 months ago
Alternatives and similar repositories for csm_finetune
Users that are interested in csm_finetune are comparing it to the libraries listed below
Sorting:
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆18Mar 18, 2025Updated 11 months ago
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆17Oct 6, 2025Updated 5 months ago
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 11 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆52Mar 17, 2025Updated 11 months ago
- ☆21Apr 6, 2025Updated 11 months ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- ☆37Sep 21, 2025Updated 5 months ago
- ComfyUI workflows☆87Dec 19, 2025Updated 2 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆34Jul 21, 2023Updated 2 years ago
- A car Heads Up Display built using a RGB LED strip and a Teensy microcontroller☆10Jul 5, 2017Updated 8 years ago
- ☆24Jan 26, 2026Updated last month
- A simple lightweight library for text normalization for Indian Languages☆16Sep 30, 2025Updated 5 months ago
- Used GPT for Realtime AI (Artificial intelligence) tutor to help students, learn by talking screenshots of there work.☆13May 14, 2024Updated last year
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Feb 25, 2026Updated last week
- ☆23Jan 25, 2026Updated last month
- Connect CommandFusion iViewer to Crestron processors☆19Aug 19, 2017Updated 8 years ago
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated last month
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Mar 24, 2023Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆10Feb 23, 2026Updated 2 weeks ago
- Allows Smartthings to control a Particle Photon (spark core) as a 8 relay switch☆10Apr 5, 2016Updated 9 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 5 months ago
- Devcon Systems☆14Sep 8, 2018Updated 7 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- An example AWS SAM app showing how to deploy a fastai app using Lambda Container feature☆13Dec 6, 2020Updated 5 years ago
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆40Feb 9, 2026Updated last month
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- ☆10Apr 8, 2024Updated last year
- Use Discord as your interface for ollama☆12Jan 30, 2024Updated 2 years ago
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- QuadTree Compression for ComputerCraft Videos☆15Jan 4, 2022Updated 4 years ago