yilin-bao / unofficial-SiameseMAE
unofficial pytorch implement for Siamese-Masked Autoencoder
☆9Updated last year
Alternatives and similar repositories for unofficial-SiameseMAE:
Users that are interested in unofficial-SiameseMAE are comparing it to the libraries listed below
- A list of referring video object segmentation papers☆30Updated this week
- ☆17Updated last month
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆14Updated 8 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆31Updated last year
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆16Updated last month
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆69Updated 6 months ago
- ☆16Updated 5 months ago
- ☆57Updated 7 months ago
- Awesome video instance segmentation papers☆38Updated last week
- ☆49Updated 6 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆28Updated last year
- cliptrase☆34Updated 7 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆48Updated 7 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 5 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆72Updated this week
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆70Updated 3 months ago
- ☆16Updated 3 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆80Updated 2 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆79Updated last week
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆22Updated 3 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆60Updated 11 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆152Updated 5 months ago
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆46Updated last year
- ☆20Updated 7 months ago
- High Quality Video Reasoning Segmentation☆18Updated 3 weeks ago
- [CVPR 2024] Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective☆18Updated 7 months ago
- Official PyTorch Code for "ATPrompt: Textual Prompt Learning with Embedded Attributes"☆29Updated 3 months ago
- ☆66Updated 6 months ago
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆24Updated 9 months ago
- NTIRE 2025 Challenge on 1-st Cross-Domain Few-Shot Object Detection @ CVPR 2025☆35Updated last week