End2End Virtual Try-on with Visual Reference, CVPR2026
☆58Nov 19, 2025Updated 3 months ago
Alternatives and similar repositories for RefVTON
Users that are interested in RefVTON are comparing it to the libraries listed below
Sorting:
- Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations☆43Nov 19, 2025Updated 3 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 3 months ago
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated 3 weeks ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆115Nov 27, 2025Updated 3 months ago
- ☆86Feb 4, 2026Updated 3 weeks ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks☆135Oct 20, 2025Updated 4 months ago
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆106Feb 21, 2026Updated last week
- Reflection Removal through Efficient Adaptation of Diffusion Transformers☆120Dec 5, 2025Updated 2 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆71Jan 15, 2026Updated last month
- UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning☆158Jun 2, 2025Updated 8 months ago
- Wan 2.5 AI Video Generator - Transform text & images into HD videos with synchronized audio☆78Sep 25, 2025Updated 5 months ago
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆68Dec 2, 2025Updated 2 months ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆78Feb 17, 2026Updated last week
- Scaling Zero-Shot Reference-to-Video Generation☆62Dec 11, 2025Updated 2 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆90Sep 11, 2025Updated 5 months ago
- A Unified Visual Generator with Interleaved OmniModal Context☆185Feb 10, 2026Updated 2 weeks ago
- VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control☆315Updated this week
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Updated this week
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆167Dec 11, 2025Updated 2 months ago
- Inference server for MioTTS, a lightweight and fast LLM-based TTS model.☆97Feb 14, 2026Updated 2 weeks ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- LFSMIM: A Low-Frequency Spectral Masked Image Modeling Method for Hyperspectral Image Classification☆12Mar 7, 2024Updated last year
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated last year
- A collection python tools used to create gguf files and upload to huggingface☆17Feb 20, 2026Updated last week
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior", CVPR 2026☆279Feb 7, 2026Updated 3 weeks ago
- ToonOut, a fork of BiRefNet focused on background removal for anime images. We open-source our dataset & our weights. See our paper at: h…☆82Sep 10, 2025Updated 5 months ago
- 中国矿业大学本科毕业论文word模板2023版