[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
☆17Feb 11, 2025Updated last year
Alternatives and similar repositories for VGDiffZero
Users that are interested in VGDiffZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- ☆31Jun 14, 2024Updated last year
- 📚 Collection of token-level model compression resources.☆193Sep 3, 2025Updated 6 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆76Feb 9, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Jun 21, 2024Updated last year
- ☆49Mar 3, 2024Updated 2 years ago
- Code for "TAG: Guidance-free Open-Vocabulary Semantic Segmentation"☆15Jul 13, 2024Updated last year
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆214Mar 14, 2025Updated last year
- ☆12Aug 25, 2022Updated 3 years ago
- ☆12Sep 12, 2024Updated last year
- Code for Paper Predicting Osteoarthritis Progression via Unsupervised Adversarial Representation Learning☆14Aug 11, 2022Updated 3 years ago
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆49Feb 10, 2026Updated last month
- Unofficial Implementation of Null-text Inversion (https://arxiv.org/abs/2211.09794)☆12Nov 20, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆29Oct 8, 2024Updated last year
- ☆14Sep 11, 2025Updated 6 months ago
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆34Dec 15, 2025Updated 3 months ago
- ☆13Jul 3, 2024Updated last year
- REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation☆50Mar 18, 2026Updated last week
- Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations is a ServiceNow Research project that was started at Elemen…☆13Jul 31, 2023Updated 2 years ago
- ☆13May 15, 2025Updated 10 months ago
- PyTorch code for our paper "Binarized Dual Residual Network for 3D Whole-body Human Mesh Recovery"☆15Dec 2, 2023Updated 2 years ago
- ☆17Aug 17, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆20Feb 20, 2026Updated last month
- Official implementation of the paper "Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors".☆36Mar 19, 2025Updated last year
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- [CVPR 2023] OCTET: Object-aware Counterfactual Explanations☆19Dec 12, 2024Updated last year
- A new dataset for fusion network training and evaluation☆19Jul 18, 2025Updated 8 months ago
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆44Feb 10, 2026Updated last month
- 一款即插即用的知识蒸馏工具包☆11May 16, 2022Updated 3 years ago
- The source code for paper:Two-level Consistency Metric for Infrared and Visible Image Fusion☆11Jun 30, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆31Nov 27, 2025Updated 4 months ago
- ☆16Mar 24, 2025Updated last year
- This repository will list some codes of image fusion evaluation metrics, including: EN CC SD SCD MI FMI PSNR SF SSIM Qabf MG Q…☆15Jan 9, 2022Updated 4 years ago
- Code for A Dual Domain Multi-exposure Image Fusion Network Based on the Spatial-frequency Integration.☆12Jul 25, 2024Updated last year
- 第九届中国软件杯视频全量分析“一等奖”&第十届中国软件杯A2百度paddlepaddle跟踪赛道“二等奖”☆10Jul 10, 2023Updated 2 years ago
- ☆18Mar 21, 2025Updated last year
- SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability☆17May 8, 2025Updated 10 months ago