chenxwh / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆9Updated last year
Related projects: ⓘ
- (Windows/Linux) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 lang…☆37Updated this week
- An app for generating prompts☆22Updated this week
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated 11 months ago
- ☆17Updated 2 weeks ago
- Generate visual podcasts about novels using open source models☆22Updated last year
- ☆19Updated 8 months ago
- Generate video stories with AI ✨☆25Updated 2 weeks ago
- ☆53Updated 8 months ago
- Auto-Video maker handling many AI's☆11Updated 6 months ago
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆38Updated this week
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆41Updated last year
- Add caption to any video☆40Updated 7 months ago
- Generate apppy for Autogen using a simple UI☆18Updated 10 months ago
- ☆9Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆17Updated last year
- [WIP] AI Try-On plugin for Chrome☆24Updated 6 months ago
- ☆16Updated 11 months ago
- ☆31Updated 9 months ago
- Turn text from websites into spoken audio with edge-tts and save as mp3 files☆14Updated 2 weeks ago
- Cog wrapper for collabora/WhisperSpeech☆23Updated 6 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆47Updated 4 months ago
- An auto save extension for text generated with the oobabooga WebUI☆21Updated last year
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆48Updated 9 months ago
- The very first artist assistant☆20Updated last year
- ☆37Updated 7 months ago
- Seamless Voice Interactions with LLMs☆11Updated 10 months ago
- ☆40Updated 5 months ago
- ☆78Updated 8 months ago
- An approach to creating the perfect prompt for any image generation task.☆29Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆63Updated last year