hustvl / WeakCLIP
[IJCV 2024]
☆14Updated 4 months ago
Alternatives and similar repositories for WeakCLIP:
Users that are interested in WeakCLIP are comparing it to the libraries listed below
- ☆16Updated last year
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆12Updated 11 months ago
- ☆25Updated 2 months ago
- Official repo for our ECCV'24 paper: Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.☆33Updated 6 months ago
- ☆54Updated 2 weeks ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆41Updated 6 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆29Updated 11 months ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆39Updated 2 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated last year
- [ACM MM 2024] WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition☆47Updated 6 months ago
- Open-Vocabulary Panoptic Segmentation☆23Updated 6 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆45Updated 3 weeks ago
- ☆36Updated 3 weeks ago
- ☆10Updated 4 months ago
- ☆33Updated last week
- Segment Anything with Deictic Prompting☆25Updated 4 months ago
- The offical implemention of JM3D.☆29Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆19Updated last month
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆26Updated 2 weeks ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated 8 months ago
- ☆49Updated 6 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated last year
- [AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection☆11Updated 2 months ago
- This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at C…☆28Updated 3 weeks ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆39Updated last year
- [ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection☆21Updated 2 years ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆79Updated last month
- [CVPR 2025] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"☆63Updated 3 weeks ago