devilismyfriend / ozen-toolkitView external linksLinks
Audio datasets, easier.
☆86Aug 19, 2023Updated 2 years ago
Alternatives and similar repositories for ozen-toolkit
Users that are interested in ozen-toolkit are comparing it to the libraries listed below
Sorting:
- TorToiSe fine-tuning with DLAS☆226Aug 1, 2024Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 4 months ago
- Fast TorToiSe inference (5x or your money back!)☆830Jul 10, 2024Updated last year
- ☆10Jan 10, 2024Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆127Apr 21, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Dec 24, 2022Updated 3 years ago
- Speech-To-Text Prompter, an extension for stable-diffusion-webui using the Whisper model☆11Mar 14, 2023Updated 2 years ago
- Next-generation, fully open-source refacer. Images. GIFs. TIFFs. Full-length videos. Bulk refacing☆41May 16, 2025Updated 8 months ago
- Automatic1111 to InvokeAI prompt resolver☆17Jun 29, 2024Updated last year
- Demo workflows for changing outfits in AnimateDiff videos☆13Dec 5, 2023Updated 2 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- ☆14Jun 23, 2024Updated last year
- AI Voice Cloning Desktop Application that runs locally on your computer and doesn't cost anything to run☆45Nov 26, 2025Updated 2 months ago
- Finetune Sesame's CSM 1B model, for fun and profit☆17Mar 24, 2025Updated 10 months ago
- ☆46Updated this week
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆20Jun 5, 2022Updated 3 years ago
- Much simpler client for Stable Diffusion WebUI☆16Feb 10, 2025Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Mar 24, 2023Updated 2 years ago
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- Expose your workflows into HTTP endpoints directly from ComfyUI itself.☆26Oct 17, 2025Updated 3 months ago
- A web app that lets you play around with TalkNet models☆124Jul 31, 2023Updated 2 years ago
- 🤯 Lobe Chat - Warning: Quick and dirty fork to enable lobe-chat to send ComfyUI api request + receive image link + local TTS☆21Nov 1, 2024Updated last year
- ☆20Mar 16, 2023Updated 2 years ago
- Finetuning SD in style.☆682Apr 1, 2023Updated 2 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- Easily share your custom workflows for anyone to run☆22Oct 17, 2024Updated last year
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- A SwarmUI extension that adds parameters for ReActor to the the generate tab☆27Jan 4, 2026Updated last month
- AUTOMATIC1111 webUI + Krita Plugin with superb Inpainting☆88Nov 6, 2022Updated 3 years ago
- This are a series of ComfyUI workflows that work together to create and repurpose animation☆39Aug 10, 2025Updated 6 months ago
- Updated fork of wav2lip-hq allowing for the use of current ESRGAN models☆54May 6, 2024Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated 10 months ago
- ☆31Jun 15, 2024Updated last year
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,342Aug 24, 2025Updated 5 months ago
- A tutorial about cloning gosameday.com☆29Oct 11, 2025Updated 4 months ago
- ☆132Jan 21, 2026Updated 3 weeks ago
- ☆784Jun 9, 2025Updated 8 months ago