songw-zju / PixelThinkLinks
The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)
☆37Updated 4 months ago
Alternatives and similar repositories for PixelThink
Users that are interested in PixelThink are comparing it to the libraries listed below
Sorting:
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆90Updated 6 months ago
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆55Updated 4 months ago
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆36Updated last week
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆40Updated 6 months ago
- LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆44Updated last month
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆68Updated 3 months ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆46Updated 2 weeks ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆29Updated 5 months ago
- CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms☆24Updated 4 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆38Updated last year
- ☆30Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆47Updated 8 months ago
- ☆39Updated 3 months ago
- ☆13Updated 9 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆75Updated last year
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆23Updated 3 weeks ago
- [CVPR 2025] Test-Time Visual In-Context Tuning☆25Updated 6 months ago
- [IJCV 2024]☆16Updated 10 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆53Updated last year
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆82Updated 2 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆43Updated last year
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆51Updated last year
- Segment Anything with Deictic Prompting☆27Updated 4 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆19Updated 11 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Updated 4 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆53Updated 3 months ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆119Updated 6 months ago
- This is the project for 'USG'.☆27Updated 6 months ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆15Updated 9 months ago