kensonhui / Realtime-Speech-to-Speech-TranslationLinks
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
β51Updated last year
Alternatives and similar repositories for Realtime-Speech-to-Speech-Translation
Users that are interested in Realtime-Speech-to-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ποΈ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoroβ349Updated last week
- Simulates talk with an AI that can express emotionsβ82Updated 5 months ago
- Self-hosted AI voice agentβ119Updated last year
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archiβ¦β207Updated 7 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β139Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β316Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ104Updated 5 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speβ¦β41Updated 9 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β157Updated last month
- FastAPI service on top of WhisperXβ152Updated last week
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streamingβ309Updated 5 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β102Updated 3 months ago
- Kno2gether Agent PlayGroundβ22Updated 11 months ago
- List of curated use cases built using Sesame's CSM 1Bβ73Updated 5 months ago
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web β¦β61Updated 11 months ago
- Have a natural voice conversation with an LLMβ259Updated last month
- Live-Transcription (STT) with Whisper PoCβ200Updated last year
- A complete voice AI frontend app for LiveKit Agents with Next.jsβ429Updated last week
- β55Updated 7 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding tooβ¦β155Updated 6 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β86Updated this week
- Get started using Deepgram's Live Transcription with this Flask demo appβ40Updated last week
- OpenAI compatible API for Dia-1.6Bβ37Updated 6 months ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detectionβ115Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β240Updated 3 months ago
- β69Updated last year
- LipSyncr is a lip reading web app based on the LipNet model that can lip read videos.β70Updated 2 years ago
- Supporting code from my related videoβ41Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β58Updated last year
- Simli WebRTC AI Agent demoβ23Updated 11 months ago