CVHub520 / efficientvitLinks
EfficientViT is a new family of vision models for efficient high-resolution vision.
☆29Updated 2 years ago
Alternatives and similar repositories for efficientvit
Users that are interested in efficientvit are comparing it to the libraries listed below
Sorting:
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated 2 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆60Updated 7 months ago
- Adobe-EntitySeg dataset☆42Updated 2 years ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆80Updated 5 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆19Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆57Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆136Updated 2 years ago
- SAM-CLIP module for use with Autodistill.☆15Updated last year
- ☆35Updated last year
- Timm model explorer☆42Updated last year
- Codebase for the Recognize Anything Model (RAM)☆85Updated last year
- ☆34Updated 4 months ago
- Add MobileSAM support for Inpaint anything using Segment Anything and inpainting models.☆54Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18Updated last year
- Image Prompter for Gradio☆91Updated last year
- YOLO-World + EfficientViT SAM☆106Updated last year
- Official Code for Tracking Any Object Amodally☆120Updated last year
- Download flickr8k, flickr30k image caption datasets☆30Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)☆100Updated 2 months ago
- NoisyNN: Exploring the impact of information entropy change in learning systems☆22Updated 10 months ago
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆15Updated last month
- VimTS: A Unified Video and Image Text Spotter☆78Updated 11 months ago
- Image/Instance Retrieval using CLIP, A self supervised Learning Model☆29Updated 2 years ago
- ☆13Updated 3 years ago
- Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]☆54Updated 3 years ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated last year
- ☆20Updated 2 years ago