mirlansmind / awesome-dinov2-extensions
This repo contains extensions to DINO V2 model by Meta, and awesome applications built on top of it.
☆38Updated last year
Related projects ⓘ
Alternatives and complementary repositories for awesome-dinov2-extensions
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆57Updated 3 weeks ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆40Updated last month
- [CVPR2022] "Progressive End-to-End Object Detection in Crowded Scenes" on Deformable-DETR.☆28Updated 2 years ago
- ☆62Updated 11 months ago
- ☆23Updated 3 weeks ago
- 1st solution for the Webly-supervised Fine-grained Recognition competition in https://www.cvmart.net/race/10412/base☆33Updated last year
- Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)☆65Updated last month
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆129Updated 11 months ago
- A cli program of image retrieval using dinov2☆59Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆82Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆37Updated 4 months ago
- Open-vocabulary Semantic Segmentation☆34Updated 9 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆59Updated 2 months ago
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆29Updated last year
- DATE: Dual Assignment for End-to-End Fully Convolutional Object Detection☆41Updated last year
- MobileSAM already integrated into Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆34Updated last year
- ☆38Updated 2 years ago
- ☆83Updated 3 months ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆82Updated last year
- Recognize Any Regions☆118Updated last month
- [TPAMI2024 / ICME2023] Codes for my paper "Body-Part Joint Detection and Association via Extended Object Representation"☆33Updated 10 months ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆22Updated last year
- ☆10Updated 10 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆73Updated 7 months ago
- Combining "segment-anything" with MOT, it create the era of "MOTS"☆146Updated last year
- YOLO-World + EfficientViT SAM☆76Updated 9 months ago
- Code for the paper "Visual Recognition by Request".☆44Updated 2 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆31Updated 2 years ago
- Zero-label image classification via OpenCLIP knowledge distillation☆114Updated last year