lygsbw / UMG-CLIPLinks
☆11Updated 8 months ago
Alternatives and similar repositories for UMG-CLIP
Users that are interested in UMG-CLIP are comparing it to the libraries listed below
Sorting:
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 2 months ago
- ☆27Updated 8 months ago
- ☆34Updated last year
- ☆44Updated 6 months ago
- ☆32Updated last year
- ☆112Updated last year
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆43Updated last year
- ☆29Updated 6 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆60Updated 8 months ago
- A Siamese self-supervised pretraining approach for the Transformer architecture in DETR☆37Updated 2 years ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated last year
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Open-vocabulary Semantic Segmentation☆33Updated last year
- ☆23Updated last year
- Official implementation of TagAlign☆35Updated 7 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 8 months ago
- Unifying Visual Perception by Dispersible Points Learning (ECCV 2022)☆51Updated 2 years ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆56Updated 8 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆81Updated last month
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated last year
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated 2 years ago
- This is the official PyTorch implementation of ASAG (ICCV 2023).☆18Updated last year
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆30Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 6 months ago
- [CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆27Updated 2 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆98Updated last year