lperezmo / real-time-translator
A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.
☆33Updated last month
Alternatives and similar repositories for real-time-translator:
Users that are interested in real-time-translator are comparing it to the libraries listed below
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆31Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆51Updated 7 months ago
- Takes a youtube video, clones the voice and re-creates that video in a different language☆104Updated last year
- Python script for my article and Youtube video on building a streamlit app to use whisper for speech-to-text transcription☆15Updated 2 years ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆36Updated last year
- A real time offline transcriber with gui, based on OpenAI whisper☆14Updated last year
- Get started using Deepgram's Live Transcription with this Flask demo app☆30Updated last week
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- I built a Voice Assistant with ChatGPT, Whisper API, Gradio, and TTS APIs☆53Updated last year
- An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for rob…☆40Updated 10 months ago
- RealVoiceGPT is a web application that lets you have voice conversations with ChatGPT. The project uses ElevenLabs AI text to speech to g…☆31Updated last year
- Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.☆34Updated 10 months ago
- Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description☆75Updated last year
- ☆12Updated last year
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)☆32Updated last year
- Early Alpha: Interact with OpenAI's Latest Assistant API through Natural Language.☆12Updated last year
- Self-hosted AI voice agent☆92Updated 6 months ago
- 🧠 Mem4AI: A LLM Friendly memory management library.☆20Updated 4 months ago
- A tutorial about cloning gosameday.com☆29Updated last year
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆59Updated last year
- Your own GPT-powered Personal Assistant to whom you can ORDER or INSTRUCT to do some task or search for something using your VOICE comman…☆20Updated last year
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses P…☆56Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆42Updated 6 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- SeamlessM4t-Translator: Utilizing the powerful Seamless M4t Facebook model in the backend, this project facilitates seamless translation …☆11Updated last year
- An experimental proof-of-concept script to automatically dub videos to English with the help of local TTS, voice cloning, audio separatio…☆12Updated 10 months ago
- ChatGPT powered Google Home / Alexa type system☆49Updated last year
- Summarize Youtube Videos and Generate Timestamps Efficiently using LLM [Google Gemini Pro, OpenAI ChatGPT]☆67Updated 8 months ago
- Voice activated Python interface for Bard AI. Implements open sourced reverse engineered Bard API, local text to speech and OpenAI Whispe…☆62Updated last year
- ☆22Updated 7 months ago