chenxwh / seamless_communicationLinks
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Updated last year
Alternatives and similar repositories for seamless_communication
Users that are interested in seamless_communication are comparing it to the libraries listed below
Sorting:
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 2 months ago
- kokoro text to speech using javascript☆62Updated 7 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Generate video stories with AI ✨☆32Updated last year
- ☆13Updated 8 months ago
- fork of litellm that is open source☆21Updated 8 months ago
- ☆91Updated 3 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆34Updated 11 months ago
- Rivet plugin to access E2B goodies☆10Updated 7 months ago
- ☆83Updated last year
- Automatically generate engaging AI podcasts from nothing but an episode title.☆126Updated last month
- ☆51Updated 10 months ago
- Create mini movies from text using fal.ai and ffmpeg-wasm.☆14Updated last year
- ☆19Updated last year
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆54Updated this week
- Seamless Voice Interactions with LLMs☆12Updated last year
- An Open-Source Modular AI Assistant☆29Updated 5 months ago
- VideoDB Python SDK☆80Updated this week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆38Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆60Updated last year
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20Updated last year
- ☆29Updated last year
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆42Updated 3 months ago
- A discord bot to stay up to date with Hugging Face Daily Papers.☆13Updated last year
- Simli WebRTC AI Agent demo☆23Updated 9 months ago
- LLM powered local Search Engine☆27Updated last year
- 🤖📝 A markdown editor powered by AI (Ollama)☆64Updated 10 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆101Updated last week
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆49Updated 7 months ago