KevKibe / RealTime-Voice-Translation-using-Whisper
The application allows users to record speech, transcribe it using the Whisper ASR (Automatic Speech Recognition) model, translate the transcribed text into a selected language, and play back the translated text using the Elevenlabs TTS (Text-to-Speech) engine.
☆13Updated last year
Alternatives and similar repositories for RealTime-Voice-Translation-using-Whisper:
Users that are interested in RealTime-Voice-Translation-using-Whisper are comparing it to the libraries listed below
- Thin Plate Spline Motion Model - ONNX. Extended version for FaceSwap - HeadSwap - PartSwap☆10Updated 4 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆33Updated last week
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆68Updated 7 months ago
- One-shot face animation using webcam, capable of running in real time.☆35Updated 8 months ago
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆57Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆33Updated 2 years ago
- Takes a youtube video, clones the voice and re-creates that video in a different language☆103Updated 11 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- an improved version of Real-time-voice-cloning☆48Updated 11 months ago
- A GUI for roop, supports replacing faces specified in videos☆68Updated last year
- Automated short video generated using Artificial intelligence tools☆35Updated 3 months ago
- Generate video stories with AI ✨☆31Updated 5 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- ☆40Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 4 months ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Updated last year
- Auto-Video maker handling many AI's☆10Updated 11 months ago
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated 9 months ago
- Video Translation with LipSync with OpenAi's whisper for ASR, YourTTS for TTS, and Wav2lip for lip sync.☆15Updated last year
- Extract handwritten information like name, student ID and then recognize them with CRNN-CTC-Attention. Using lexicon search on class list…☆20Updated 2 months ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆40Updated 5 months ago
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆43Updated 2 months ago
- ☆37Updated last year
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Ai generated music video with Riffusion and Gradio☆19Updated 2 years ago
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆19Updated last year
- Uses ChatGPT, TTS, and Stable Diffusion to automatically generate videos☆29Updated last year
- The code for some apps built with Sieve.☆74Updated 2 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆56Updated 4 months ago