Cog wrapper for Coqui / xtts-v2
☆82Nov 25, 2024Updated last year
Alternatives and similar repositories for cog-xtts-v2
Users that are interested in cog-xtts-v2 are comparing it to the libraries listed below
Sorting:
- Taming Stable Diffusion for Lip Sync!☆16Mar 18, 2025Updated last year
- Convert an audio file to a waveform video☆11Nov 10, 2023Updated 2 years ago
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated last year
- nvidia/parakeet-rnnt-1.1b running in Replicate Cog container ⚙️☆16Jan 5, 2024Updated 2 years ago
- Rembg is a tool to remove images background.☆30Dec 2, 2022Updated 3 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11Sep 13, 2023Updated 2 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Nov 20, 2023Updated 2 years ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated 11 months ago
- A ComfyUI image generation integration for oobabooga's Text Generation WebUI☆15Aug 12, 2025Updated 7 months ago
- Cog implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆12Apr 16, 2025Updated 11 months ago
- ImageBind One Embedding Space to Bind Them All☆26May 19, 2023Updated 2 years ago
- An extension to use Kokoro TTS in text generation webui☆22May 5, 2025Updated 10 months ago
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆47Dec 1, 2023Updated 2 years ago
- An example of using the OpenAI API in python to automate email responses☆11Feb 13, 2024Updated 2 years ago
- ⚡️ TypeScript Execute: Node.js enhanced to run TypeScript & ESM☆10May 13, 2024Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆25Jan 4, 2024Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Oct 4, 2024Updated last year
- Cog template for Stable Diffusion 3 (ComfyUI implementation)☆17Jul 16, 2024Updated last year
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- Colab for training 1.5 and SDXL Loras based on Derrian Distro's Lora_Easy_Training_scripts_Backend☆18Mar 13, 2026Updated last week
- ☆81Mar 2, 2025Updated last year
- pdb's function and global vars to offset☆10Apr 11, 2023Updated 2 years ago
- ☆84Aug 7, 2024Updated last year
- ☆22Oct 19, 2024Updated last year
- Переводилка с гражданского шрифта на ЦСЯ☆21Mar 12, 2026Updated last week
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆18May 15, 2024Updated last year
- ☆40May 14, 2025Updated 10 months ago
- driver manual mapper☆12Feb 22, 2020Updated 6 years ago
- A simple memory dumper☆13Feb 11, 2020Updated 6 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Example Windows Kernel-mode Driver which finds process ID by executable file name.☆18Nov 23, 2019Updated 6 years ago
- ☆83Jun 30, 2024Updated last year
- LoRA Explorer model to explore Flux.1[Schnell] with LoRAs☆31Sep 7, 2024Updated last year
- Live audio chats with AI using Groq Llama3-70b and Deepgram Voice☆32Apr 24, 2024Updated last year
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆392Dec 6, 2024Updated last year
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- This are a series of ComfyUI workflows that work together to create and repurpose animation☆39Aug 10, 2025Updated 7 months ago
- This is an MCP server that interacts with a PocketBase instance. It allows you to fetch, list, create, update, and manage records and fil…☆31Apr 22, 2025Updated 11 months ago