Visual-AI / PancapLinks
[NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text
☆33Updated last month
Alternatives and similar repositories for Pancap
Users that are interested in Pancap are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆72Updated last year
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆92Updated last year
- Official code of ACM MM2024 paper- Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection☆24Updated last year
- [ICML 2023] MTPD: Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation☆15Updated 2 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Updated 3 weeks ago
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆30Updated last month
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆25Updated last year
- [ICCV 2023] Pytorch implementation of "Category-aware Allocation Transformer for Weakly Supervised Object Localization".☆14Updated 2 years ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆30Updated last year
- An Examination of the Compositionality of Large Generative Vision-Language Models☆19Updated last year
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆174Updated 3 months ago
- [ICRA 2026] Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆47Updated this week
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆75Updated last year
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆161Updated last year
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆13Updated last month
- [ECCV2024] PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery☆30Updated 9 months ago
- ☆14Updated last year
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆215Updated 9 months ago
- Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"☆109Updated last month
- [NeurIPS 2025] VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation☆65Updated 4 months ago
- [ICCV2023] CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection☆18Updated 9 months ago
- ☆53Updated last year
- [ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…☆94Updated 6 months ago
- This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…☆24Updated last year
- ☆37Updated last year
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆148Updated 3 weeks ago
- [CVPR2025] Rethinking Query-based Transformer for Continual Image Segmentation☆41Updated 6 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆54Updated 2 years ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆291Updated 2 months ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Updated last year