kensonhui / Realtime-Speech-to-Speech-Translation
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
☆37Updated 8 months ago
Alternatives and similar repositories for Realtime-Speech-to-Speech-Translation:
Users that are interested in Realtime-Speech-to-Speech-Translation are comparing it to the libraries listed below
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 8 months ago
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆42Updated 4 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoro☆219Updated this week
- Self-hosted AI voice agent☆101Updated 8 months ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆17Updated 11 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆37Updated 2 months ago
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆38Updated last year
- Knotie-AI - A Completely Open-Source Inbound/Outbound AI Sales Agent which can communicate with your potential lead/customer.☆89Updated 3 months ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆33Updated 2 weeks ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆208Updated 2 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆35Updated 5 months ago
- ☆40Updated last month
- Kno2gether Agent PlayGround☆17Updated 4 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆46Updated 4 months ago
- AI agent to automatically generate and post short videos☆85Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 6 months ago
- FastAPI service on top of WhisperX☆92Updated this week
- Automated voice dubbing for YouTube videos using Docker, OpenVoice, and FastAPI. Translates and dubs videos with original voice timbre.☆50Updated last year
- Build a real-time AI voice assistant using Python that can handle incoming calls, transcribe speech, generate intelligent responses, and …☆39Updated 8 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆79Updated 11 months ago
- An example of using AI and AWS to generate, and deploy websites in under a minute.☆52Updated last year
- Generate broll for a video using AI☆73Updated 3 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆40Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆43Updated this week
- AI Agent for Telephony voice bot - based on vocode, twilio, deepgram, and elevenlabs. Just add your own keys and prompt.☆23Updated 8 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆66Updated last month
- Simulates talk with an AI that can express emotions☆67Updated 9 months ago
- Open source alternative of ChatGPT-4o, FREE!☆38Updated 7 months ago
- AI Video Editor: Use an LLM to stitch together multiple videos and do the rough-cut of video editing for you.☆37Updated 3 weeks ago