kensonhui / Realtime-Speech-to-Speech-Translation
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
☆31Updated 7 months ago
Alternatives and similar repositories for Realtime-Speech-to-Speech-Translation:
Users that are interested in Realtime-Speech-to-Speech-Translation are comparing it to the libraries listed below
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆14Updated 9 months ago
- Self-hosted AI voice agent☆94Updated 7 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆29Updated 4 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆33Updated last month
- Get started using Deepgram's Live Transcription with this Flask demo app☆30Updated last week
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆31Updated 3 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆186Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- Simulates talk with an AI that can express emotions☆58Updated 7 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- ☆36Updated this week
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆51Updated 7 months ago
- FastAPI service on top of WhisperX☆72Updated this week
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs☆180Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆99Updated last month
- WIP exploration using Twilio Media Streams and Generative AI☆39Updated last year
- Knotie-AI - A Completely Open-Source Inbound/Outbound AI Sales Agent which can communicate with your potential lead/customer.☆78Updated 2 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆123Updated 7 months ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆42Updated 7 months ago
- Build a real-time AI voice assistant using Python that can handle incoming calls, transcribe speech, generate intelligent responses, and …☆34Updated 7 months ago
- AI voice assistant web app built using SpeechRecognition,pyttsx3, and streamlit open-source libraries☆12Updated last year
- ☆64Updated last month
- GroqChat: Local ChatGPT-like environment in your browser using best open model LLama 3.1 Series on the Grow fastest inference engine.☆80Updated 7 months ago
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆57Updated 2 months ago
- AI agent to automatically generate and post short videos☆77Updated last year
- Real Time AI Voice Assistant using nodejs☆51Updated 10 months ago
- Whisperx API implementation☆25Updated 10 months ago
- Use ChatGPT over Twilio to create an AI phone agent (works for incoming or outgoing calls).☆106Updated last year
- Modern AI chatbot supporting multiple LLMs. Switch between Gemini, Mistral, Llama, Claude and ChatGPT.☆54Updated last week