SNU-DRL / HRVLinks
Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models" (ICLR 2025)
☆11Updated 5 months ago
Alternatives and similar repositories for HRV
Users that are interested in HRV are comparing it to the libraries listed below
Sorting:
- ☆12Updated 6 months ago
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆11Updated 8 months ago
- unofficial☆12Updated last year
- ☆25Updated 10 months ago
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆61Updated 7 months ago
- ☆25Updated 7 months ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆56Updated 4 months ago
- ☆14Updated last year
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆82Updated 6 months ago
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆36Updated 7 months ago
- ☆20Updated 10 months ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Updated 6 months ago
- Image Tokenizer Needs Post-Training☆24Updated 4 months ago
- ☆20Updated last month
- Code implementation for: From Virtual Games to Real-World Play☆46Updated 7 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated last year
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Updated 3 months ago
- MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE☆48Updated this week
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (ICLR 2026)☆41Updated 7 months ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Updated last year
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆62Updated 3 weeks ago
- ☆52Updated last year
- ☆42Updated 3 weeks ago
- ☆133Updated 10 months ago
- Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)☆33Updated 8 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Updated last year
- ☆25Updated last year
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆29Updated 2 weeks ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated last year
- ☆53Updated last year