[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
☆17Feb 11, 2025Updated last year
Alternatives and similar repositories for VGDiffZero
Users that are interested in VGDiffZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆42Jan 27, 2026Updated 3 months ago
- [EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆124Apr 29, 2026Updated last week
- 📚 Collection of token-level model compression resources.☆195Sep 3, 2025Updated 8 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification"…☆20Aug 1, 2023Updated 2 years ago
- [ACCV 2024] Official PyTorch implementation of "Diffusion Model Compression for Image-to-Image Translation"☆22Aug 31, 2025Updated 8 months ago
- ☆10Jun 21, 2024Updated last year
- ☆49Mar 3, 2024Updated 2 years ago
- ☆14Dec 20, 2022Updated 3 years ago
- ☆12Aug 25, 2022Updated 3 years ago
- ☆15Nov 21, 2020Updated 5 years ago
- ☆14Sep 12, 2024Updated last year
- Code for paper "PoseEmbroider:Towards a 3D, Visual, Semantic-aware Human Pose Representation" (ECCV 2024)☆18Nov 18, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for Paper Predicting Osteoarthritis Progression via Unsupervised Adversarial Representation Learning☆14Aug 11, 2022Updated 3 years ago
- Unofficial Implementation of Null-text Inversion (https://arxiv.org/abs/2211.09794)☆12Nov 20, 2022Updated 3 years ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆31Mar 18, 2026Updated last month
- ☆22Jun 19, 2024Updated last year
- ☆30Oct 8, 2024Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆52Feb 10, 2026Updated 2 months ago
- ☆16Sep 11, 2025Updated 7 months ago
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆35Dec 15, 2025Updated 4 months ago
- Implementation of the Doubly Stochastic Neighbor Embedding on Spheres algorithm published by Yao Lu in Sep. 2016 (Source : https://arxiv.…☆15Apr 8, 2018Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13Jul 3, 2024Updated last year
- Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations is a ServiceNow Research project that was started at Elemen…☆13Jul 31, 2023Updated 2 years ago
- ☆24Nov 21, 2025Updated 5 months ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 8 months ago
- ☆13May 15, 2025Updated 11 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆21Apr 15, 2026Updated 3 weeks ago
- Official implementation of the paper "Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors".☆36Mar 19, 2025Updated last year
- ☆14Oct 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)☆13Jan 14, 2025Updated last year
- LLaVA-Next for STVG☆19Dec 5, 2025Updated 5 months ago
- [CVPR 2023] OCTET: Object-aware Counterfactual Explanations☆19Dec 12, 2024Updated last year
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆46Feb 10, 2026Updated 2 months ago
- The source code for paper:Two-level Consistency Metric for Infrared and Visible Image Fusion☆11Jun 30, 2023Updated 2 years ago
- ☆32Nov 27, 2025Updated 5 months ago
- https://avocado-captioner.github.io/☆34Oct 16, 2025Updated 6 months ago