Unsupervised video dubbing project
☆40Sep 11, 2020Updated 5 years ago
Alternatives and similar repositories for unsupervised-video-dubbing
Users that are interested in unsupervised-video-dubbing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆14Dec 25, 2025Updated 3 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- DSADCSR FOR AIM2019 Extreme Super-Resolution Challenge - Track 1: Fidelity☆13May 27, 2020Updated 5 years ago
- official code for "EgoVSR: Towards High-Quality Egocentric Video Super-Resolution"☆15Jul 26, 2023Updated 2 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆46Sep 6, 2024Updated last year
- Efficient Space-time Super Resolution using Flow and Mask Upsampling☆10Aug 29, 2021Updated 4 years ago
- Automatic parallel speech database extractor from dubbed movies☆26Aug 20, 2024Updated last year
- The repository for the submission "Visualizing the Impact of Feature Attribution Baselines"☆17Mar 16, 2023Updated 3 years ago
- [CVPR 2023] Official code for paper: Learning to Dub Movies via Hierarchical Prosody Models.☆112Jun 21, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- The code for AIM2022 compressed image super-resolution☆15Apr 27, 2023Updated 2 years ago
- ☆26Dec 4, 2024Updated last year
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- Github for the conference paper GLOD-Gaussian Likelihood OOD detector☆16Apr 18, 2022Updated 3 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- ☆21Jul 24, 2022Updated 3 years ago
- Audio-conditioned video texture generation☆24Sep 16, 2022Updated 3 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆37Feb 11, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution☆24May 11, 2023Updated 2 years ago
- ☆24Feb 24, 2021Updated 5 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- 精简版NEZHA模型权重☆21Dec 23, 2020Updated 5 years ago
- A tool for extracting chunks from Penn Chinese Treebank☆18Jan 12, 2018Updated 8 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Swish Activation - PyTorch CUDA Implementation☆37Oct 10, 2019Updated 6 years ago
- ☆87Feb 9, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Sep 26, 2022Updated 3 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆54Sep 20, 2025Updated 6 months ago
- The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."☆1,109Sep 25, 2023Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆61Sep 5, 2025Updated 6 months ago
- Talking Face Generation system☆19Oct 16, 2023Updated 2 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year