adhadse / Deepdubpy
A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)
☆13Updated 2 years ago
Alternatives and similar repositories for Deepdubpy:
Users that are interested in Deepdubpy are comparing it to the libraries listed below
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 8 months ago
- Finally, some decent sample sentences☆22Updated last year
- ☆12Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 4 months ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- A simple voice conversion tool☆17Updated 2 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆33Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Automatic parallel speech database extractor from dubbed movies☆26Updated 5 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated this week
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated 2 years ago
- ☆14Updated last year
- StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and imp…☆10Updated last year
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆23Updated 4 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 5 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- GPT for FACodec☆13Updated 9 months ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆9Updated last year