NVIDIA-AI-IOT / clip-distillationLinks
Zero-label image classification via OpenCLIP knowledge distillation
☆138Updated 2 years ago
Alternatives and similar repositories for clip-distillation
Users that are interested in clip-distillation are comparing it to the libraries listed below
Sorting:
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆118Updated last year
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆85Updated 7 months ago
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆60Updated 8 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆106Updated last year
- ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection (CVPR2023)☆54Updated 2 years ago
- ☆128Updated 2 years ago
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆127Updated 5 months ago
- Official codes of ICCV2023 paper: <<FemtoDet: an object detection baseline for energy versus performance tradeoffs>>☆66Updated last year
- Detection Transformers with Assignment☆262Updated 2 years ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆86Updated last year
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆114Updated last week
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated 2 years ago
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆85Updated 2 years ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆225Updated last year
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆103Updated 2 years ago
- ☆38Updated 3 years ago
- 2nd place solution to Google Universal Image Embedding Challenge!☆43Updated 3 years ago
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆78Updated last month
- ☆76Updated 3 years ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆92Updated this week
- A Siamese self-supervised pretraining approach for the Transformer architecture in DETR☆37Updated 2 years ago
- [CVPR2022] "Progressive End-to-End Object Detection in Crowded Scenes" on Deformable-DETR.☆32Updated 3 years ago
- CounTR: Transformer-based Generalised Visual Counting☆119Updated last year
- ☆54Updated 3 years ago
- Combining "segment-anything" with MOT, it create the era of "MOTS"☆155Updated 2 years ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆172Updated last month
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆44Updated last month
- ☆24Updated last year
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆220Updated 2 years ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆72Updated 2 years ago