wysnzzzz / DITView external linksLinks
☆18Nov 15, 2024Updated last year
Alternatives and similar repositories for DIT
Users that are interested in DIT are comparing it to the libraries listed below
Sorting:
- ☆11Mar 11, 2025Updated 11 months ago
- [2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation☆19Nov 8, 2025Updated 3 months ago
- ☆33Feb 29, 2024Updated last year
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆22Nov 17, 2025Updated 2 months ago
- [ICML2024]The official implementation of SemiRES in PyTorch.☆33Jun 20, 2024Updated last year
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Oct 18, 2024Updated last year
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 10 months ago
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆23Mar 14, 2025Updated 11 months ago
- CatMAE☆14Dec 13, 2023Updated 2 years ago
- The code of Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks☆25Apr 10, 2024Updated last year
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆70Apr 7, 2024Updated last year
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 9 months ago
- ☆23Jan 24, 2024Updated 2 years ago
- ☆19Oct 9, 2024Updated last year
- [ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"☆73Oct 13, 2024Updated last year
- ☆45Feb 4, 2022Updated 4 years ago
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆22Sep 27, 2023Updated 2 years ago
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆26Aug 24, 2023Updated 2 years ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- ☆30Jan 18, 2026Updated 3 weeks ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated 10 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- ☆26Mar 26, 2025Updated 10 months ago
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆58Aug 1, 2025Updated 6 months ago
- ☆60Aug 12, 2024Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆29Nov 13, 2025Updated 3 months ago
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Nov 21, 2023Updated 2 years ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆27Jun 15, 2025Updated 7 months ago
- Segment Anything with Deictic Prompting☆27May 13, 2025Updated 9 months ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆41Aug 4, 2025Updated 6 months ago
- LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning☆195Apr 16, 2024Updated last year
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆35Nov 2, 2024Updated last year
- Official Implementation of ECCV2024 paper: SLAck☆29Sep 18, 2024Updated last year
- Official PyTorch implementation of PiClick: Picking the desired mask in click-based interactive segmentation.☆26Jul 2, 2024Updated last year
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆56Jun 16, 2025Updated 7 months ago
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆70Jun 8, 2024Updated last year
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆40Sep 10, 2025Updated 5 months ago