lucataco / cog-xtts-v2
Cog wrapper for Coqui / xtts-v2
☆74Updated 2 months ago
Alternatives and similar repositories for cog-xtts-v2:
Users that are interested in cog-xtts-v2 are comparing it to the libraries listed below
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆41Updated last year
- Add caption to any video☆183Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆57Updated last year
- ☆172Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 2 months ago
- (CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆28Updated 8 months ago
- Fine tune SDXL on YouTube videos☆174Updated 6 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆106Updated 6 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated 11 months ago
- Add caption to any video☆45Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 9 months ago
- OpenClap is a file format for the age of AI content production☆116Updated 8 months ago
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated last week
- [WIP] AI Try-On plugin for Chrome☆27Updated 11 months ago
- Gradio UI for a Cog API☆66Updated 10 months ago
- Cog wrapper for Vchitect/SEINE☆37Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆65Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆76Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆150Updated 7 months ago
- Style-Transfer: Apply the style of an image to another image☆52Updated 10 months ago
- Voice data <= 10 mins can also be used to train a good VC model!☆11Updated last year
- ☆78Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆40Updated 4 months ago
- Cog wrapper for MagicAnimate☆30Updated last year
- LoRA inference model packaged with Cog☆74Updated last year
- Alternative to Flawless AI's TrueSync. Make lips in video match provided audio using the power of Wav2Lip and GFPGAN.☆117Updated 7 months ago
- ☆94Updated 9 months ago
- A web GUI built with Nuxt.js for outpainting with Stable Diffusion using the Replicate API.☆51Updated last year