实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换
☆56Sep 9, 2022Updated 3 years ago
Alternatives and similar repositories for Virtual-try-on-4kVideo
Users that are interested in Virtual-try-on-4kVideo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆198Sep 15, 2022Updated 3 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 3 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆56Mar 18, 2024Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Mar 22, 2024Updated 2 years ago
- vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统☆221Jun 7, 2025Updated last year
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Jul 16, 2022Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆151Aug 22, 2022Updated 3 years ago
- ☆76Apr 26, 2022Updated 4 years ago
- ☆11May 7, 2022Updated 4 years ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆82Jan 3, 2024Updated 2 years ago
- A differentiable version of SPTK☆201Jun 2, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆136Feb 18, 2023Updated 3 years ago
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆2,005Jun 4, 2023Updated 3 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Oct 10, 2023Updated 2 years ago
- PersonaTalk Hack☆16Jan 10, 2025Updated last year
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109May 1, 2022Updated 4 years ago
- Official implementation of SawSing (ISMIR'22)☆275Aug 28, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Yin pitch estimator in PyTorch☆119Nov 7, 2022Updated 3 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Nov 3, 2022Updated 3 years ago
- Official repository of "SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos"☆19Nov 29, 2023Updated 2 years ago
- Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).☆11Apr 14, 2020Updated 6 years ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆45Oct 28, 2024Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆197Nov 13, 2023Updated 2 years ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- ☆40Apr 15, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆157Jun 17, 2022Updated 4 years ago
- A minimum inference engine for DiffSinger☆37Apr 5, 2024Updated 2 years ago
- code and demo for our CVPR2022 paper "ClothFormer: Taming Video Virtual Try-on in All Module"☆128Apr 27, 2022Updated 4 years ago
- ☆22Oct 10, 2024Updated last year
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆172Apr 9, 2023Updated 3 years ago