kensonhui / Realtime-Speech-to-Speech-Translation
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
☆18Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Realtime-Speech-to-Speech-Translation
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆63Updated this week
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆30Updated 11 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆43Updated 3 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs☆96Updated this week
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆22Updated 2 months ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆23Updated this week
- Live-Transcription (STT) with Whisper PoC☆155Updated 5 months ago
- Self-hosted AI voice agent☆63Updated 2 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆110Updated 5 months ago
- ☆20Updated 3 months ago
- ☆52Updated 6 months ago
- WIP exploration using Twilio Media Streams and Generative AI☆35Updated 9 months ago
- A simple chat app with vision using Next.js, Vercel AI SDK, and GPT-4V.☆13Updated last year
- AI Voice Assistant: talk to an AI agent that handles event scheduling, managing contacts, accessing your knowledge base and web searching…☆13Updated 3 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆27Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆11Updated 7 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆64Updated 2 months ago
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆46Updated 6 months ago
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆99Updated 3 months ago
- An example application to help you get started with Deepgram text-to-speech☆9Updated 3 months ago
- Lightweight, standalone, multi-platform, and privacy focused local LLM chat interface with optional encryption☆56Updated this week
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆97Updated 11 months ago
- Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Container)☆98Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆69Updated last month
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆15Updated 11 months ago
- ☆15Updated 2 months ago
- Simple front-end interface for querying a local Ollama API server☆23Updated 11 months ago
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆22Updated 10 months ago