chenxwh / seamless_communicationLinks
Foundational Models for State-of-the-Art Speech and Text Translation
☆11Updated 2 years ago
Alternatives and similar repositories for seamless_communication
Users that are interested in seamless_communication are comparing it to the libraries listed below
Sorting:
- Local & private voice controlled notepad using whisper.cpp☆25Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 5 months ago
- Automatically generate engaging AI podcasts from nothing but an episode title.☆138Updated 4 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆126Updated last year
- kokoro text to speech using javascript☆63Updated 10 months ago
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- fork of litellm that is open source☆21Updated 11 months ago
- "Just hoof it!" - A spotlight like interface to Ollama☆63Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆34Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Live audio chats with AI using Groq Llama3-70b and Deepgram Voice☆32Updated last year
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆16Updated 11 months ago
- Create mini movies from text using fal.ai and ffmpeg-wasm.☆15Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- An open-source agent toolkit that auto-syncs SDK versions, docs, and examples—built for seamless integration with LLMs, and AI agents ( M…☆43Updated 4 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated last year
- https://narrateit.streamlit.app/☆39Updated 11 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated last year
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆55Updated 3 months ago
- Meeper 📝 - is your secretary for any in-browser conference.☆78Updated 7 months ago
- Rivet plugin to access E2B goodies☆10Updated 10 months ago
- Generate video stories with AI ✨☆35Updated last year
- ☆23Updated last year
- An application that automatically generates Python codes based on GPT (as used in ChatGPT).☆46Updated 2 years ago
- An MCP server that provides image recognition 👀 capabilities using Anthropic and OpenAI vision APIs☆32Updated 7 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- Seamless Voice Interactions with LLMs☆12Updated 2 years ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Updated last year
- A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.☆14Updated 8 months ago