PolyU-ChenLab / UniPixelLinks
๐ฎ UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)
โ66Updated this week
Alternatives and similar repositories for UniPixel
Users that are interested in UniPixel are comparing it to the libraries listed below
Sorting:
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoningโ122Updated 3 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolutionโ52Updated 7 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".โ169Updated 9 months ago
- โ85Updated last month
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perceptionโ126Updated 3 months ago
- Vision Manus: Your versatile Visual AI assistantโ276Updated last month
- โ47Updated 9 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understandingโ203Updated 8 months ago
- โ45Updated 2 months ago
- โ69Updated last year
- Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1.โ236Updated last month
- [NeurIPS2025 Spotlight ๐ฅ ] Official implementation of ๐ธ "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Languโฆโ220Updated last week
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeedโ101Updated 11 months ago
- Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"โ109Updated this week
- Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"โ47Updated 3 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"โ40Updated 6 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.โ59Updated 6 months ago
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"โ253Updated 9 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Groundingโ62Updated 11 months ago
- โ13Updated 9 months ago
- New generation of CLIP with fine grained discrimination capability, ICML2025โ305Updated last week
- [NeurIPS 2024 Spotlight โญ๏ธ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)โ101Updated 2 months ago
- โ86Updated last year
- Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"โ143Updated 2 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Modelโ97Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-trainingโ87Updated 2 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anythingโ69Updated last year
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examplesโ62Updated 11 months ago
- Recognize Any Regionsโ122Updated 9 months ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasetsโ50Updated 2 months ago