KevKibe / RealTime-Voice-Translation-using-Whisper
The application allows users to record speech, transcribe it using the Whisper ASR (Automatic Speech Recognition) model, translate the transcribed text into a selected language, and play back the translated text using the Elevenlabs TTS (Text-to-Speech) engine.
☆13Updated last year
Alternatives and similar repositories for RealTime-Voice-Translation-using-Whisper:
Users that are interested in RealTime-Voice-Translation-using-Whisper are comparing it to the libraries listed below
- Thin Plate Spline Motion Model - ONNX. Extended version for FaceSwap - HeadSwap - PartSwap☆9Updated 3 months ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Updated last year
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- AudioLDM text to audio colab☆19Updated last year
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 3 months ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆29Updated 3 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆33Updated last week
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆66Updated 6 months ago
- One-shot face animation using webcam, capable of running in real time.☆34Updated 7 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆54Updated 3 months ago
- An AI try-on application for generating photos with AI character wearing the same clothes as the one in the input photo.☆13Updated last year
- an improved version of Real-time-voice-cloning☆45Updated 10 months ago
- ☆40Updated last year
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆12Updated 4 years ago
- ☆30Updated last year
- A collection of handy helpers for AI art generation, AI writing and other experimental tools☆51Updated 3 months ago
- Video Translation with LipSync with OpenAi's whisper for ASR, YourTTS for TTS, and Wav2lip for lip sync.☆15Updated last year
- Auto-Video maker handling many AI's☆12Updated 10 months ago
- ☆30Updated 11 months ago
- ☆37Updated 11 months ago
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated 8 months ago
- Generate video stories with AI ✨☆29Updated 4 months ago
- optimized wav2lip☆19Updated last year
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆17Updated last year
- ☆55Updated last year
- Visual Clip Picker: Trimming Clips by Face Recognition☆39Updated last year
- ☆46Updated 10 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆32Updated 2 years ago