songw-zju/PixelThink

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/songw-zju/PixelThink)

songw-zju / PixelThink

The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)

☆40

Alternatives and similar repositories for PixelThink

Users that are interested in PixelThink are comparing it to the libraries listed below

Sorting:

songw-zju / Scribble2Scene
View on GitHub
The official implementation of "Label-efficient Semantic Scene Completion with Scribble Annotations" (IJCAI 2024)
☆14Jul 27, 2024Updated last year
songw-zju / PointLoRA
View on GitHub
The official implementation of "PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning" (CVPR 2025)
☆28Oct 31, 2025Updated 4 months ago
xiaolul2 / Interp3D
View on GitHub
[ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."
☆25Jan 21, 2026Updated last month
MasterHow / OccFiner
View on GitHub
Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving
☆16Feb 10, 2025Updated last year
yangluo7 / V-ReasonBench
View on GitHub
☆31Feb 18, 2026Updated 2 weeks ago
earth-insights / DescribeEarth
View on GitHub
DescribeEarth: Describe Anything for Remote Sensing Images
☆23Feb 24, 2026Updated last week
As-Time-Goes-By / OmniSegNet
View on GitHub
☆18Jan 5, 2026Updated 2 months ago
NJU-LHRS / ScoreRS
View on GitHub
Code and updates for the ScoreRS project.
☆41Sep 19, 2025Updated 5 months ago
Fayeben / ADAS
View on GitHub
A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation
☆13Dec 22, 2022Updated 3 years ago
lisat-bair / LISAt_code
View on GitHub
☆28Sep 2, 2025Updated 6 months ago
ItIsFriday / PcdSeg
View on GitHub
☆12Nov 28, 2022Updated 3 years ago
earth-insights / awesome-MLLM-for-image-segmentation
View on GitHub
Paper list for LLM/MLLM-based image segmentation
☆47Dec 24, 2025Updated 2 months ago
worldbench / awesome-3d-in-the-wild
View on GitHub
🌐 A Roadmap for 3D Scene Understanding in the Wild
☆23Dec 19, 2025Updated 2 months ago
Hatins / FAOD-master
View on GitHub
☆23Feb 27, 2026Updated last week
MasterHow / OneOcc
View on GitHub
An official implementation for "OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera"
☆29Nov 6, 2025Updated 4 months ago
Divadi / MTC_RCNN
View on GitHub
☆15Jul 9, 2021Updated 4 years ago
ltriess / kitti_scan_unfolding
View on GitHub
Python Implementation for KITTI Scan Unfolding
☆16Jun 20, 2025Updated 8 months ago
congvvc / InstructSeg
View on GitHub
[ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"
☆53Feb 10, 2025Updated last year
turingmotors / ACT-Bench
View on GitHub
ACT-Bench – We Evaluate Action-Fidelity of World Models for Autonomous Driving
☆26Dec 23, 2024Updated last year
fscdc / ReasonMap
View on GitHub
[CVPR 2026] ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
☆77Feb 22, 2026Updated 2 weeks ago
EnVision-Research / Scale-BEV
View on GitHub
☆54May 1, 2025Updated 10 months ago
dianzl / SODFormer
View on GitHub
☆57May 18, 2024Updated last year
Joanna-0421 / COSMIC
View on GitHub
[NeurIPS 2024] COSMIC: Compress Satellite Images Efficiently via Diffusion Compensation
☆23Oct 2, 2024Updated last year
worldbench / Calib3D
View on GitHub
[WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
☆70Dec 6, 2025Updated 3 months ago
llijiang / GuidedContrast
View on GitHub
☆28Oct 29, 2022Updated 3 years ago
ldkong1205 / OpenESS
View on GitHub
[CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies
☆72Aug 22, 2025Updated 6 months ago
earth-insights / SegEarth-R1
View on GitHub
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
☆141Jan 21, 2026Updated last month
ldkong1205 / awesome-3d-da
View on GitHub
A curated list of awesome 3D domain adaptation resources
☆27Dec 31, 2021Updated 4 years ago
DylanOrange / geal
View on GitHub
[CVPR 2025] GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
☆44Nov 2, 2025Updated 4 months ago
cfzd / UniFusion
View on GitHub
☆26Mar 20, 2023Updated 2 years ago
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated 11 months ago
SuperZ-Liu / PolarBEV
View on GitHub
The offical code of PolarBEV (CoRL2022).
☆56Sep 17, 2022Updated 3 years ago
FanScy / BEVInstructor
View on GitHub
[ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models
☆30Jul 16, 2024Updated last year
ltriess / semantic_kitti_stats
View on GitHub
Get some nice plots with statistics about the Semantic KITTI dataset
☆27Jun 21, 2022Updated 3 years ago
LeapLabTHU / Text4Point
View on GitHub
☆37Jan 18, 2023Updated 3 years ago
justchenhao / SILI_CD
View on GitHub
Official Pytorch Implementation of “Continuous Cross-resolution Remote Sensing Image Change Detection”
☆33Nov 26, 2023Updated 2 years ago
hzykent / LiDAL
View on GitHub
Implementation of ECCV2022 paper - LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation
☆34Nov 22, 2022Updated 3 years ago
HongkLin / TIDE
View on GitHub
[CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes
☆54Apr 9, 2025Updated 11 months ago
GaavaMa / Causal-Diffusion-Policy
View on GitHub
☆29Aug 6, 2025Updated 7 months ago