[AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing
☆25Nov 20, 2025Updated 3 months ago
Alternatives and similar repositories for Zero2Hero
Users that are interested in Zero2Hero are comparing it to the libraries listed below
Sorting:
- Official implementation of “ACE: Anti-Editing Concept Erasure in Text-to-Image Models”☆14Jan 5, 2026Updated 2 months ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21Jul 26, 2025Updated 7 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated last year
- ☆17Feb 20, 2025Updated last year
- ☆31Jul 16, 2025Updated 7 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated 11 months ago
- [ACM Multimedia 2024] Shape-Guided Clothing Warping for Virtual Try-On☆29May 14, 2025Updated 9 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 6 months ago
- ☆26Jun 20, 2024Updated last year
- ☆13Dec 9, 2020Updated 5 years ago
- the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)☆13Jan 14, 2025Updated last year
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆39Jun 9, 2025Updated 8 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Non-offical implementation of paper : DenseLiDAR: A Real-time Pseudo Dense Depth Guided Depth Completion Network (ICRA 2021)☆12Dec 23, 2024Updated last year
- [ICCV 2025] "Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning".☆17Dec 11, 2025Updated 2 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k☆11Mar 14, 2024Updated last year
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆48Apr 10, 2025Updated 10 months ago
- ☆47Apr 20, 2025Updated 10 months ago
- official code for our IJCV paper "Relation-Guided Adversarial Learning for Data-Free Knowledge Transfer"☆10Dec 27, 2024Updated last year
- Official implementation of AAAI-2024 paper "Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain"☆13Jun 17, 2024Updated last year
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 2 months ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- ☆11Nov 30, 2025Updated 3 months ago
- 2018-2024 in-depth completion of top papers, open source code summary! (Continuous update)☆13Sep 1, 2024Updated last year
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated last month
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- ☆22Feb 3, 2026Updated last month
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆22Nov 18, 2025Updated 3 months ago
- ☆14May 20, 2025Updated 9 months ago
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- ☆18May 15, 2025Updated 9 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Oct 2, 2023Updated 2 years ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- RealESRGAN high order degradation pipeline☆11Mar 20, 2025Updated 11 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year