lucataco / cog-xtts-v2Links
Cog wrapper for Coqui / xtts-v2
☆78Updated 9 months ago
Alternatives and similar repositories for cog-xtts-v2
Users that are interested in cog-xtts-v2 are comparing it to the libraries listed below
Sorting:
- Add caption to any video☆203Updated last year
- Cog wrapper for Vchitect/SEINE☆37Updated last year
- Fine tune SDXL on YouTube videos☆176Updated last year
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆46Updated last year
- Add caption to any video☆48Updated last year
- A curated list of amazing RunPod projects, libraries, and resources☆122Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆79Updated last year
- ☆175Updated last year
- OpenClap is a file format for the age of AI content production☆119Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 11 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆216Updated last month
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated last year
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- Gradio UI for a Cog API☆70Updated last year
- Replicate Flux LoRA image editor.☆52Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- ☆83Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 9 months ago
- Talking head video AI generator☆79Updated last year
- Chat to Compose Video☆195Updated last year
- Talk to GPT-4 and create a story together.☆91Updated last year
- A demo for running comfy deploy api via nextjs☆169Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆72Updated 2 years ago
- faster-whisper as serverless endpoint☆117Updated 4 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆60Updated last year
- The source of the demo app for fal-serverless + Next.js☆122Updated last year
- A web GUI built with Nuxt.js for outpainting with Stable Diffusion using the Replicate API.☆52Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆157Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆61Updated 3 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆63Updated last year