实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换
☆56Sep 9, 2022Updated 3 years ago
Alternatives and similar repositories for Virtual-try-on-4kVideo
Users that are interested in Virtual-try-on-4kVideo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆197Sep 15, 2022Updated 3 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 3 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆55Mar 18, 2024Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统☆221Jun 7, 2025Updated 9 months ago
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Jul 16, 2022Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆147Aug 22, 2022Updated 3 years ago
- ☆11May 7, 2022Updated 3 years ago
- ☆77Apr 26, 2022Updated 3 years ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆81Jan 3, 2024Updated 2 years ago
- A differentiable version of SPTK☆196Feb 26, 2026Updated 3 weeks ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆136Feb 18, 2023Updated 3 years ago
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置 的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆2,009Jun 4, 2023Updated 2 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Oct 10, 2023Updated 2 years ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109May 1, 2022Updated 3 years ago
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Nov 3, 2022Updated 3 years ago
- Official repository of "SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos"☆21Nov 29, 2023Updated 2 years ago
- Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).☆11Apr 14, 2020Updated 5 years ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆196Nov 13, 2023Updated 2 years ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- ☆39Apr 15, 2024Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆153Jun 17, 2022Updated 3 years ago
- A minimum inference engine for DiffSinger☆37Apr 5, 2024Updated last year
- code and demo for our CVPR2022 paper "ClothFormer: Taming Video Virtual Try-on in All Module"☆127Apr 27, 2022Updated 3 years ago
- ☆22Oct 10, 2024Updated last year
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆172Apr 9, 2023Updated 2 years ago