[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
☆17Feb 11, 2025Updated last year
Alternatives and similar repositories for VGDiffZero
Users that are interested in VGDiffZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- [EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆82Apr 10, 2026Updated last week
- 📚 Collection of token-level model compression resources.☆194Sep 3, 2025Updated 7 months ago
- [ACCV 2024] Official PyTorch implementation of "Diffusion Model Compression for Image-to-Image Translation"☆22Aug 31, 2025Updated 7 months ago
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆76Feb 9, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Jun 21, 2024Updated last year
- ☆49Mar 3, 2024Updated 2 years ago
- Pytorch implementation and comparison of Fourier Feature Networks and Sinusoidal Representation Networks☆13Jun 27, 2020Updated 5 years ago
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆215Mar 14, 2025Updated last year
- ☆14Dec 20, 2022Updated 3 years ago
- Code for the paper: "U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion", ACM MM 2023☆24Nov 14, 2023Updated 2 years ago
- Code for paper "PoseEmbroider:Towards a 3D, Visual, Semantic-aware Human Pose Representation" (ECCV 2024)☆18Nov 18, 2024Updated last year
- Code for Paper Predicting Osteoarthritis Progression via Unsupervised Adversarial Representation Learning☆14Aug 11, 2022Updated 3 years ago
- Unofficial Implementation of Null-text Inversion (https://arxiv.org/abs/2211.09794)☆12Nov 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30Mar 18, 2026Updated last month
- ☆10Dec 3, 2024Updated last year
- ☆22Jun 19, 2024Updated last year
- ☆29Oct 8, 2024Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆49Feb 10, 2026Updated 2 months ago
- ☆16Sep 11, 2025Updated 7 months ago
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆35Dec 15, 2025Updated 4 months ago
- Implementation of the Doubly Stochastic Neighbor Embedding on Spheres algorithm published by Yao Lu in Sep. 2016 (Source : https://arxiv.…☆15Apr 8, 2018Updated 8 years ago
- ☆13Jul 3, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations is a ServiceNow Research project that was started at Elemen…☆13Jul 31, 2023Updated 2 years ago
- REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation☆49Apr 10, 2026Updated last week
- ☆23Nov 21, 2025Updated 4 months ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 8 months ago
- ☆13May 15, 2025Updated 11 months ago
- ☆17Aug 17, 2021Updated 4 years ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- ☆14Jun 19, 2024Updated last year
- ☆14Oct 19, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)☆13Jan 14, 2025Updated last year
- LLaVA-Next for STVG☆19Dec 5, 2025Updated 4 months ago
- Dual Progressive Prototype Network for Generalized Zero-Shot Learning☆25Dec 9, 2021Updated 4 years ago
- 🧩 [AAAI 2024] ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detecti…☆35Apr 7, 2026Updated last week
- A new dataset for fusion network training and evaluation☆19Jul 18, 2025Updated 9 months ago
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆44Feb 10, 2026Updated 2 months ago
- 一款即插即用的知识蒸馏工具包☆11May 16, 2022Updated 3 years ago