Cog wrapper for Coqui / xtts-v2
☆82Nov 25, 2024Updated last year
Alternatives and similar repositories for cog-xtts-v2
Users that are interested in cog-xtts-v2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Taming Stable Diffusion for Lip Sync!☆17Mar 18, 2025Updated last year
- Convert an audio file to a waveform video☆11Nov 10, 2023Updated 2 years ago
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated last year
- nvidia/parakeet-rnnt-1.1b running in Replicate Cog container ⚙️☆16Jan 5, 2024Updated 2 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Sep 21, 2022Updated 3 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆10Nov 20, 2023Updated 2 years ago
- ☆18Jan 17, 2025Updated last year
- ☆13Oct 14, 2024Updated last year
- A ComfyUI image generation integration for oobabooga's Text Generation WebUI☆15Aug 12, 2025Updated 10 months ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated last year
- Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration☆19Jul 15, 2025Updated 11 months ago
- Cog implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆13Apr 16, 2025Updated last year
- ☆12Mar 19, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An extension to use Kokoro TTS in text generation webui☆22May 5, 2025Updated last year
- ImageBind One Embedding Space to Bind Them All☆26May 19, 2023Updated 3 years ago
- A self contained example demonstrating how to use MediaPipe Object Detection with Max's jweb☆11Jun 26, 2023Updated 2 years ago
- ☆12Jan 5, 2024Updated 2 years ago
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆47Dec 1, 2023Updated 2 years ago
- An example of using the OpenAI API in python to automate email responses☆11Feb 13, 2024Updated 2 years ago
- A powerful ComfyUI custom node that brings Google's Gemini TTS capabilities directly to your workflow. Generate high-quality speech with …☆22May 23, 2025Updated last year
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 10 months ago
- Materials for "Audiovisual Interaction w/ Machine Learning", Bachelors (416) and Graduate (616) course @ CalArts MTIID MTEC☆14May 16, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Feb 29, 2024Updated 2 years ago
- Instant voice cloning by MyShell.☆27Apr 28, 2024Updated 2 years ago
- ☆28Aug 22, 2025Updated 9 months ago
- 图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。☆18Dec 14, 2023Updated 2 years ago
- ☆47Oct 29, 2025Updated 7 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Oct 4, 2024Updated last year
- Cog wrapper for AI-toolkit LoRA training☆35Aug 15, 2024Updated last year
- Continuous descriptor-based control for deep audio synthesis☆23Aug 4, 2023Updated 2 years ago
- ☆85Aug 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild☆18May 15, 2024Updated 2 years ago
- Performative Latents for Adaptive Unsupervised DDSP (PLAUD)☆15Updated this week
- ☆40May 14, 2025Updated last year
- The serverside backend created for use with the LoRA Easy Training Scripts Frontend☆24May 29, 2025Updated last year
- ☆84Jun 30, 2024Updated last year
- A Max for Live device based on nn~ for real-time latent interaction and bending in Ableton.☆20Jul 8, 2025Updated 11 months ago
- ☆32Oct 31, 2022Updated 3 years ago