adhadse / Deepdubpy
A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)
☆11Updated 2 years ago
Related projects: ⓘ
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- ☆56Updated this week
- Finally, some decent sample sentences☆21Updated 9 months ago
- ☆23Updated last year
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Codebase and project page for EDMSound☆29Updated 10 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆19Updated 3 years ago
- ☆26Updated last year
- Implementation of Google's USM speech model in Pytorch☆23Updated last week
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- ☆35Updated 3 weeks ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 2 years ago
- A simple voice conversion tool☆15Updated 2 years ago
- ☆23Updated this week
- ☆14Updated last year
- ☆44Updated this week
- ☆21Updated last year
- Demo for 2022 ICASSP☆64Updated 2 years ago
- ☆14Updated last year
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 3 years ago
- This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.☆12Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated last year
- ☆56Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆15Updated 9 months ago
- Easily turn large sets of audio urls to an audio dataset.☆20Updated last year