chenxwh / seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Updated last year
Alternatives and similar repositories for seamless_communication:
Users that are interested in seamless_communication are comparing it to the libraries listed below
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- Generate video stories with AI ✨☆32Updated 7 months ago
- Local & private voice controlled notepad using whisper.cpp☆24Updated last year
- ☆46Updated 4 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆27Updated 6 months ago
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆37Updated 11 months ago
- Fooocus App deployment using Modal.☆10Updated 6 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Run AuraFlow on Replicate☆14Updated 8 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- ☆10Updated last year
- Garvis: Realtime AI Voice Assistant☆37Updated 10 months ago
- An app for generating prompts☆26Updated 2 months ago
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆16Updated 2 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated last year
- ☆25Updated 3 weeks ago
- An API for VoiceCraft.☆25Updated 9 months ago
- This is a sample example repo on how to extend Vapi functionalities and deploy it on Vercel Edge Functions.☆17Updated 8 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 8 months ago
- Modern AI chatbot supporting multiple LLMs. Switch between Gemini, Mistral, Llama, Claude and ChatGPT.☆54Updated 3 weeks ago
- ☆37Updated last year
- Automatically generate engaging AI podcasts from nothing but an episode title.☆77Updated 3 months ago
- Model Context Protocol server for Replicate's API☆40Updated 3 weeks ago
- A Simple Scenes Based Movie Generation App☆50Updated 4 months ago
- ☆22Updated 5 months ago
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆68Updated last year
- Attend - to what matters.☆14Updated last month
- Replicate Flux LoRA image editor.☆49Updated 7 months ago
- ☆46Updated 4 months ago