kensonhui / Realtime-Speech-to-Speech-TranslationLinks
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
β47Updated last year
Alternatives and similar repositories for Realtime-Speech-to-Speech-Translation
Users that are interested in Realtime-Speech-to-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ποΈ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses XTTS, OpenAI, ElevenLabs or Kokoroβ304Updated 2 weeks ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β274Updated last month
- Self-hosted AI voice agentβ114Updated last year
- FastAPI service on top of WhisperXβ121Updated last week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β138Updated last week
- Real-time Speech To Text using Faster Whisper.β59Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ98Updated 2 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.β56Updated last year
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archiβ¦β175Updated 4 months ago
- Live-Transcription (STT) with Whisper PoCβ190Updated last year
- Simulates talk with an AI that can express emotionsβ77Updated 2 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β225Updated last week
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β93Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.β136Updated last year
- Groq-Powered Real-Time Voice Assistantβ223Updated 9 months ago
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web β¦β52Updated 8 months ago
- Conversational voice AI agentsβ363Updated this week
- π€π An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized β¦β147Updated last year
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speβ¦β41Updated 6 months ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capabilityβ43Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.β173Updated 2 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β68Updated 10 months ago
- Knotie-AI - A Completely Open-Source Inbound/Outbound AI Sales Agent which can communicate with your potential lead/customer.β121Updated 7 months ago
- Get started using Deepgram's Live Transcription with this Flask demo appβ39Updated 2 weeks ago
- Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streamingβ304Updated 2 months ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (Vβ¦β41Updated 9 months ago
- List of curated use cases built using Sesame's CSM 1Bβ69Updated 2 months ago
- Starter project for building real-time AI Voice Assistantsβ41Updated 11 months ago
- Have a natural voice conversation with an LLMβ253Updated 8 months ago
- β50Updated 4 months ago