SNU-DRL / HRVLinks
Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models" (ICLR 2025)
☆10Updated 3 months ago
Alternatives and similar repositories for HRV
Users that are interested in HRV are comparing it to the libraries listed below
Sorting:
- unofficial☆12Updated last year
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆10Updated 6 months ago
- ☆13Updated 5 months ago
- ☆26Updated 8 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated last year
- Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025☆19Updated 9 months ago
- Image Tokenizer Needs Post-Training☆24Updated 2 months ago
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆61Updated 5 months ago
- ☆14Updated last year
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆19Updated last year
- ☆24Updated 5 months ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆79Updated 4 months ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆32Updated 2 months ago
- ☆11Updated last year
- Official code for the paper: Can3Tok (ICCV2025)☆32Updated 3 months ago
- [NeurIPS 2025]《SD-VLM: Spatial Measuring and Understanding with Depth-encoded Vision Language Models》☆29Updated last month
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆73Updated 2 weeks ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆52Updated 3 months ago
- Code implementation for: From Virtual Games to Real-World Play☆44Updated 5 months ago
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆26Updated 6 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆23Updated last year
- ☆20Updated 8 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆24Updated 8 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆52Updated last year
- An operation trying to do the opposite of F.grid_sample☆20Updated 2 years ago
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆78Updated 4 months ago
- ☆35Updated last year
- ☆51Updated last year
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Updated 2 years ago
- Official implementation of LaVin-DiT☆49Updated 10 months ago