chenxwh / seamless_communicationLinks
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Updated last year
Alternatives and similar repositories for seamless_communication
Users that are interested in seamless_communication are comparing it to the libraries listed below
Sorting:
- Automatically generate engaging AI podcasts from nothing but an episode title.☆118Updated 7 months ago
- ☆13Updated 7 months ago
- ☆83Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- Explore, Install, Innovate — in 1 Click.☆30Updated this week
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆60Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆56Updated last month
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ☆91Updated 2 months ago
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆53Updated last week
- ☆27Updated 2 weeks ago
- Simli WebRTC AI Agent demo☆23Updated 7 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- ☆15Updated 5 months ago
- ☆51Updated 8 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 4 months ago
- Generate video stories with AI ✨☆33Updated 10 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆19Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated last month
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆42Updated last month
- ☆38Updated last year
- kokoro text to speech using javascript☆59Updated 5 months ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- Generate apppy for Autogen using a simple UI☆18Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated last year
- Cog wrapper for Coqui / xtts-v2☆75Updated 8 months ago
- fork of litellm that is open source☆20Updated 7 months ago
- ☆44Updated 5 months ago
- ☆22Updated 9 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆27Updated 2 months ago