Towards Pixel-Level VLM Perception via Simple Points Prediction
☆97Feb 9, 2026Updated last month
Alternatives and similar repositories for SimpleSeg
Users that are interested in SimpleSeg are comparing it to the libraries listed below
Sorting:
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆37Mar 13, 2026Updated last week
- ☆85Oct 10, 2025Updated 5 months ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆25Feb 14, 2026Updated last month
- [CVPR 2026] SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images☆43Jan 24, 2026Updated last month
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆131Feb 6, 2026Updated last month
- Open Ended Medical Reinforcement Learning☆35Updated this week
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆47Mar 7, 2026Updated last week
- ☆60Feb 6, 2026Updated last month
- ☆18Nov 16, 2025Updated 4 months ago
- ☆13Feb 2, 2025Updated last year
- ☆12Oct 7, 2024Updated last year
- official implementation of Splat Feature Solver: https://arxiv.org/abs/2508.12216☆38Feb 4, 2026Updated last month
- ☆32Jan 30, 2026Updated last month
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆34Feb 4, 2026Updated last month
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- Project Page for ICLR'26-CoPRS, offering training overview, inference code, and downloadable links.☆20Mar 11, 2026Updated last week
- [NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation☆31Jan 5, 2026Updated 2 months ago
- Geo-OLMs Repo: Accepted to ACM COMPASS 2025☆20Jun 17, 2025Updated 9 months ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆55Jan 28, 2026Updated last month
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 2 months ago
- ☆27Jun 3, 2025Updated 9 months ago
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 6 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated last month
- TBD☆49Mar 13, 2026Updated last week
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆44Jan 23, 2026Updated last month
- Green-VLA: Staged Vision-Language-Action Model for Generalist Robots☆109Mar 5, 2026Updated 2 weeks ago
- Official implementation of DA²: Depth Anything in Any Direction☆253Dec 9, 2025Updated 3 months ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- ☆43Sep 1, 2025Updated 6 months ago
- official code of KDGraph for road graph extraction☆15Mar 29, 2025Updated 11 months ago
- [ICCV 2025] CompleteMe: Reference-based Human Image Completion☆26Jan 20, 2026Updated 2 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- ☆31May 26, 2025Updated 9 months ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆38Updated this week
- GEOSatDB is a semantic representation of Earth observation satellites and sensors that can be used to easily discover available Earth obs…☆15Aug 6, 2024Updated last year
- ☆228Jul 17, 2025Updated 8 months ago
- Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery. Foundation Model for Field Boundary Delineation.☆109Jan 16, 2026Updated 2 months ago
- Official repo for StyleMe3D☆28Apr 22, 2025Updated 10 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year