xuyang-liu16 / VGDiffZero
[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders
☆13Updated this week
Alternatives and similar repositories for VGDiffZero:
Users that are interested in VGDiffZero are comparing it to the libraries listed below
- ☆25Updated 7 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆114Updated 3 weeks ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆33Updated last year
- [TPAMI2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆19Updated 4 months ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆64Updated 9 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆95Updated 3 weeks ago
- Liquid: Language Models are Scalable Multi-modal Generators☆60Updated last month
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆67Updated 2 months ago
- [NeurIPS 2024] Visual Perception by Large Language Model’s Weights☆35Updated 3 months ago
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆22Updated this week
- ☆14Updated this week
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆80Updated 10 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 7 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆67Updated 3 months ago
- 📚 Collection of token reduction for model compression resources.☆20Updated this week
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆103Updated this week
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆179Updated last month
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation☆17Updated last month
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆23Updated 2 months ago
- ☆117Updated 7 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆37Updated 3 weeks ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆11Updated 3 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆66Updated 3 months ago
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆48Updated 4 months ago
- This is the official implementation for ControlVAR.☆91Updated last month
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆121Updated last month
- ☆31Updated 3 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆105Updated this week
- ☆114Updated 7 months ago