SNU-DRL / HRVLinks
Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models" (ICLR 2025)
☆9Updated 8 months ago
Alternatives and similar repositories for HRV
Users that are interested in HRV are comparing it to the libraries listed below
Sorting:
- unofficial☆10Updated 9 months ago
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆10Updated 2 months ago
- ☆10Updated 3 weeks ago
- ☆9Updated 6 months ago
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆17Updated last week
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆9Updated 3 months ago
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆17Updated 2 months ago
- ☆23Updated last month
- ☆14Updated 9 months ago
- ☆14Updated 4 months ago
- ☆14Updated 9 months ago
- [ICLR 25'] InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting☆20Updated 4 months ago
- ☆11Updated last week
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated 10 months ago
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆48Updated 3 months ago
- Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025☆18Updated 4 months ago
- ☆11Updated 4 months ago
- ☆19Updated 4 months ago
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆17Updated 8 months ago
- ☆13Updated 4 months ago
- ☆20Updated 2 weeks ago
- Code implementation for: From Virtual Games to Real-World Play☆37Updated last month
- An unofficial implementation of Tensor4D with support for the D-NeRF dataset☆12Updated last year
- ☆92Updated last month
- Flash Sculptor: Modular 3D Worlds from Objects☆32Updated 3 months ago
- ☆23Updated 4 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆22Updated 4 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆37Updated this week
- Official implementation of LaVin-DiT☆39Updated 6 months ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆40Updated this week