ibro45 / Speech-to-Speech-Translator
The Babel Fish v0.01 is a translator that can identify the language spoken from the user's audio input and translate it into English speech
☆15Updated 4 years ago
Alternatives and similar repositories for Speech-to-Speech-Translator
Users that are interested in Speech-to-Speech-Translator are comparing it to the libraries listed below
Sorting:
- Virtual news production using Tacotron2 and Wav2Lip☆11Updated last year
- an improved version of Real-time-voice-cloning☆50Updated last year
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆20Updated last year
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- optimized wav2lip☆19Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- ☆11Updated 3 years ago
- Translated vocal synthesis - Clone a voice and output speech in another language☆25Updated 3 years ago
- Use DFL Dataset creator to create a dataset according to Yaw and Pitch of a destination dataset.☆9Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- Uses ChatGPT, TTS, and Stable Diffusion to automatically generate videos☆29Updated 2 years ago
- Wav2Lip UHQ Improvement with ControlNet 1.1☆73Updated last year
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated last year
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆12Updated last month
- code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021☆8Updated 3 years ago
- ☆83Updated 10 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 6 months ago
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆54Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆33Updated last year
- ☆16Updated last month
- ☆148Updated last year
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆14Updated 4 years ago
- Create training data for training a voice cloner for bark text to speech.☆45Updated last year
- ☆12Updated last year
- ☆74Updated 2 years ago
- ☆16Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☆11Updated 2 years ago