lperezmo / real-time-translator
A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.
☆23Updated 8 months ago
Related projects: ⓘ
- Takes a youtube video, clones the voice and re-creates that video in a different language☆82Updated 6 months ago
- This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses P…☆54Updated 9 months ago
- Modern AI chatbot supporting multiple LLMs. Switch between Gemini, Mistral, Llama, Claude and ChatGPT.☆46Updated last month
- I built a Voice Assistant with ChatGPT, Whisper API, Gradio, and TTS APIs☆48Updated last year
- Python script for my article and Youtube video on building a streamlit app to use whisper for speech-to-text transcription☆12Updated last year
- CopperAI offers a hands-free, voice-to-voice interaction system with a Large Language Model (LLM)☆28Updated 9 months ago
- Web app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summar…☆55Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆43Updated last month
- Self-hosted AI voice agent☆50Updated 3 weeks ago
- Chat AI (↓↓Cuộn trang để xem thêm↓↓)☆72Updated this week
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆47Updated last year
- A smart AI voice assistant with multi-language support and long-term memory. Currently best for Swedish and English. Compatible with Wind…☆20Updated 9 months ago
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆53Updated 8 months ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆26Updated last year
- A voice AI assistant, built with Next.js, Node, WebSockets, OpenAI, Deepgram, and ElevenLabs☆12Updated last year
- RealVoiceGPT is a web application that lets you have voice conversations with ChatGPT. The project uses ElevenLabs AI text to speech to g…☆23Updated last year
- creates tools for open interpreter to use☆20Updated 10 months ago
- An intellligent AI assistant that can do anything!☆49Updated 4 months ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated last year
- AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models (GPT, API2D GPT4, Cluade) as text inputs…☆63Updated 4 months ago
- A modify of AutoGPT to AutoCluade. Use the 100k api.☆43Updated 11 months ago
- An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for rob…☆35Updated 3 months ago
- Input a YouTube video link or upload a video file and get a video with subtitles.☆93Updated 3 weeks ago
- Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand English, German and Albanian. Ba…☆18Updated 3 months ago
- A simple chat app with vision using Next.js, Vercel AI SDK, and GPT-4V.☆13Updated 9 months ago
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆48Updated 9 months ago
- Subtitle Videos and add text motion graphics - https://www.supertranslate.ai/☆181Updated last year
- A tutorial about cloning gosameday.com☆27Updated 7 months ago
- Unlock GPT-4-32K & Claude-2-100K API Instantly With Open Router☆13Updated last year
- Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description☆70Updated 9 months ago