lucataco / cog-xtts-v2Links
Cog wrapper for Coqui / xtts-v2
☆78Updated 10 months ago
Alternatives and similar repositories for cog-xtts-v2
Users that are interested in cog-xtts-v2 are comparing it to the libraries listed below
Sorting:
- Cog wrapper for Vchitect/SEINE☆37Updated last year
- Add caption to any video☆204Updated last year
- ☆175Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆60Updated last year
- Add caption to any video☆48Updated last year
- Fine tune SDXL on YouTube videos☆175Updated last year
- OpenClap is a file format for the age of AI content production☆118Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated 2 years ago
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆46Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 11 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated last month
- ☆83Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆78Updated last year
- Talk to GPT-4 and create a story together.☆91Updated last year
- Upscale your videos up to 4k on free google colab using Real-ESRGAN☆189Updated 5 months ago
- Chat to Compose Video☆195Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 9 months ago
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- ☆79Updated last year
- The source of the demo app for fal-serverless + Next.js☆122Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆60Updated 3 months ago
- ☆75Updated last year
- ☆28Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- Gradio UI for a Cog API☆69Updated last year
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- Instant voice cloning by MyShell.☆26Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- A browser extension that lets you chat with YouTube videos using Llama2-7b. Built using 🤗 Inference Endpoints and Vercel's AI SDK.☆163Updated 2 years ago
- A curated list of amazing RunPod projects, libraries, and resources☆123Updated last year