Cog wrapper for Coqui / xtts-v2
☆82Nov 25, 2024Updated last year
Alternatives and similar repositories for cog-xtts-v2
Users that are interested in cog-xtts-v2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Taming Stable Diffusion for Lip Sync!☆16Mar 18, 2025Updated last year
- Convert an audio file to a waveform video☆11Nov 10, 2023Updated 2 years ago
- Rembg is a tool to remove images background.☆31Dec 2, 2022Updated 3 years ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11Sep 13, 2023Updated 2 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Nov 20, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆18Jan 17, 2025Updated last year
- ☆13Oct 14, 2024Updated last year
- A ComfyUI image generation integration for oobabooga's Text Generation WebUI☆15Aug 12, 2025Updated 8 months ago
- DALL.E image generation in MaxMSP☆10Mar 22, 2023Updated 3 years ago
- ImageBind One Embedding Space to Bind Them All☆26May 19, 2023Updated 2 years ago
- An extension to use Kokoro TTS in text generation webui☆22May 5, 2025Updated 11 months ago
- ☆12Jan 5, 2024Updated 2 years ago
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆47Dec 1, 2023Updated 2 years ago
- A powerful ComfyUI custom node that brings Google's Gemini TTS capabilities directly to your workflow. Generate high-quality speech with …☆21May 23, 2025Updated 10 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 8 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆25Jan 4, 2024Updated 2 years ago
- Instant voice cloning by MyShell.☆26Apr 28, 2024Updated last year
- Run controlnet with flux☆17Oct 8, 2024Updated last year
- Cog template for Stable Diffusion 3 (ComfyUI implementation)☆17Jul 16, 2024Updated last year
- Continuous descriptor-based control for deep audio synthesis☆23Aug 4, 2023Updated 2 years ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 5 months ago
- Colab for training 1.5 and SDXL Loras based on Derrian Distro's Lora_Easy_Training_scripts_Backend☆18Mar 13, 2026Updated 3 weeks ago
- ☆44Oct 29, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆81Mar 2, 2025Updated last year
- Cog wrapper for AI-toolkit LoRA training☆35Aug 15, 2024Updated last year
- pdb's function and global vars to offset☆10Apr 11, 2023Updated 3 years ago
- ☆84Aug 7, 2024Updated last year
- ☆22Oct 19, 2024Updated last year
- Переводилка с гражданского шрифта на ЦСЯ☆21Mar 23, 2026Updated 2 weeks ago
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆18May 15, 2024Updated last year
- driver manual mapper☆13Feb 22, 2020Updated 6 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The serverside backend created for use with the LoRA Easy Training Scripts Frontend☆24May 29, 2025Updated 10 months ago
- Example Windows Kernel-mode Driver which finds process ID by executable file name.☆18Nov 23, 2019Updated 6 years ago
- ☆83Jun 30, 2024Updated last year
- Add caption to any video☆49Feb 19, 2024Updated 2 years ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆398Dec 6, 2024Updated last year
- Max/MSP external wrapper for the PESTO streaming pitch estimation model☆47Jul 24, 2025Updated 8 months ago
- ☆24Jun 6, 2025Updated 10 months ago