CVHub520 / efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
☆24Updated last year
Alternatives and similar repositories for efficientvit:
Users that are interested in efficientvit are comparing it to the libraries listed below
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 11 months ago
- SAM-CLIP module for use with Autodistill.☆13Updated last year
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆83Updated last year
- Codebase for the Recognize Anything Model (RAM)☆73Updated last year
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆37Updated last month
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- ☆13Updated 3 years ago
- MobileSAM already integrated into Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆36Updated last year
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 6 months ago
- ☆63Updated last year
- Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)☆74Updated 5 months ago
- ☆33Updated last year
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆34Updated last month
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 9 months ago
- Distilling the powerful segment anything models into lightweight ones for efficient segmentation.☆29Updated last year
- Code for Learning to Zoom and Unzoom (CVPR 2023)☆47Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆40Updated 5 months ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆130Updated last year
- ☆31Updated 5 months ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆85Updated 7 months ago
- ☆32Updated last year
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆15Updated 8 months ago
- Vision-oriented multimodal AI☆49Updated 8 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year