kensonhui / Realtime-Speech-to-Speech-TranslationLinks
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
β56Updated last year
Alternatives and similar repositories for Realtime-Speech-to-Speech-Translation
Users that are interested in Realtime-Speech-to-Speech-Translation are comparing it to the libraries listed below
Sorting:
- Self-hosted AI voice agentβ120Updated last year
- ποΈ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoroβ352Updated last month
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β333Updated 5 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archiβ¦β213Updated 2 weeks ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β141Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β58Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ105Updated 5 months ago
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web β¦β62Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capabilityβ44Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β103Updated 3 months ago
- Groq-Powered Real-Time Voice Assistantβ225Updated last year
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding tooβ¦β157Updated 7 months ago
- FastAPI service on top of WhisperXβ156Updated last week
- Get started using Deepgram's Live Transcription with this Flask demo appβ41Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β161Updated last week
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speβ¦β41Updated 10 months ago
- Kno2gether Agent PlayGroundβ23Updated 11 months ago
- Real-time Speech To Text using Faster Whisper.β59Updated last year
- List of curated use cases built using Sesame's CSM 1Bβ73Updated 6 months ago
- Automatically generate engaging AI podcasts from nothing but an episode title.β138Updated 4 months ago
- Simulates talk with an AI that can express emotionsβ82Updated 5 months ago
- β56Updated 8 months ago
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features inclβ¦β19Updated last year
- Live-Transcription (STT) with Whisper PoCβ201Updated last year
- Voxella π - AI Video Translation and Dubbing App: Seamlessly translate and dub videos into multiple languages with Voxella. This powerfuβ¦β16Updated 2 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β70Updated last year
- Docs for Ultravoxβ43Updated this week
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.β89Updated 10 months ago
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meetβ¦β61Updated 7 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β88Updated last week