chenxwh / seamless_communicationLinks
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Updated last year
Alternatives and similar repositories for seamless_communication
Users that are interested in seamless_communication are comparing it to the libraries listed below
Sorting:
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆16Updated 5 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated this week
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- An app for generating prompts☆27Updated 5 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆31Updated 9 months ago
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆49Updated this week
- ☆19Updated 9 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated last year
- Generate video stories with AI ✨☆32Updated 9 months ago
- ☆37Updated last year
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆34Updated 8 months ago
- ☆29Updated last year
- Create mini movies from text using fal.ai and ffmpeg-wasm.☆13Updated 10 months ago
- ☆28Updated last year
- A quick Crew AI tutorial☆23Updated last year
- Replicate Flux LoRA image editor.☆51Updated 9 months ago
- This project aims to bring a more stable and user friendly check GPT interface designed to allow others to implement their own GPT prompt…☆12Updated last year
- Generate apppy for Autogen using a simple UI☆18Updated last year
- 🧠 Mem4AI: A LLM Friendly memory management library.☆28Updated 7 months ago
- Local & private voice controlled notepad using whisper.cpp☆24Updated last year
- Add caption to any video☆48Updated last year
- LoRA Explorer model to test with LoRAs using Flux.1[Dev] as the base model☆49Updated 8 months ago
- ☆55Updated 7 months ago
- ☆22Updated 8 months ago
- 📦 Metadata for all the public models on Replicate, bundled up into an npm package.☆37Updated this week
- ☆27Updated last year
- Pip Package for MirageML☆25Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- kokoro text to speech using javascript☆58Updated 4 months ago