seermer / RTGen
☆13Updated 8 months ago
Alternatives and similar repositories for RTGen:
Users that are interested in RTGen are comparing it to the libraries listed below
- ☆41Updated 3 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆28Updated last year
- ☆29Updated 6 months ago
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆34Updated 6 months ago
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆80Updated 7 months ago
- ☆42Updated last year
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆69Updated 8 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆48Updated 7 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆44Updated this week
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆60Updated 3 weeks ago
- cliptrase☆34Updated 6 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 5 months ago
- Official implementation of TagAlign☆34Updated 3 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆73Updated 5 months ago
- ☆9Updated last year
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆51Updated 6 months ago
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆89Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆51Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆77Updated this week
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆54Updated 5 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆80Updated 11 months ago
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆54Updated 9 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆183Updated last year
- ☆24Updated 8 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆57Updated last year
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆25Updated 4 months ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆116Updated 11 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆45Updated 3 weeks ago