kensonhui / Realtime-Speech-to-Speech-TranslationLinks
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
β48Updated last year
Alternatives and similar repositories for Realtime-Speech-to-Speech-Translation
Users that are interested in Realtime-Speech-to-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ποΈ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoroβ315Updated last month
- FastAPI service on top of WhisperXβ128Updated last week
- Self-hosted AI voice agentβ113Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β288Updated 2 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archiβ¦β189Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ100Updated 2 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β56Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β136Updated last year
- Simulates talk with an AI that can express emotionsβ78Updated 2 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β95Updated 3 weeks ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β68Updated 11 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β149Updated last week
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capabilityβ43Updated last year
- Simli WebRTC AI Agent demoβ23Updated 9 months ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streamingβ305Updated 2 months ago
- Real-time Speech To Text using Faster Whisper.β58Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β233Updated 3 weeks ago
- Supporting code from my related videoβ40Updated last year
- Build a real-time AI voice assistant using Python that can handle incoming calls, transcribe speech, generate intelligent responses, and β¦β54Updated last year
- Have a natural voice conversation with an LLMβ255Updated 9 months ago
- β51Updated 5 months ago
- Live-Transcription (STT) with Whisper PoCβ194Updated last year
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (Vβ¦β42Updated 10 months ago
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web β¦β55Updated 9 months ago
- List of curated use cases built using Sesame's CSM 1Bβ73Updated 3 months ago
- β68Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β74Updated this week
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features inclβ¦β18Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.β174Updated 2 years ago
- Kno2gether Agent PlayGroundβ21Updated 8 months ago