CVHub520 / efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
☆24Updated last year
Alternatives and similar repositories for efficientvit:
Users that are interested in efficientvit are comparing it to the libraries listed below
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆42Updated last month
- SAM-CLIP module for use with Autodistill.☆15Updated last year
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆84Updated last year
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- Codebase for the Recognize Anything Model (RAM)☆78Updated last year
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 11 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆17Updated 9 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆13Updated 3 years ago
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆21Updated last year
- HunyuanDiT with TensorRT and libtorch☆17Updated 11 months ago
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆38Updated this week
- Image/Instance Retrieval using CLIP, A self supervised Learning Model☆28Updated last year
- ☆27Updated 6 months ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆68Updated this week
- This repository is for the first survey on SAM & SAM2 for Videos.☆47Updated last week
- Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO☆19Updated 2 months ago
- ☆34Updated last year
- [CVPR2022] "Progressive End-to-End Object Detection in Crowded Scenes" on Deformable-DETR.☆32Updated 2 years ago
- ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer☆9Updated 3 weeks ago
- DEYOv1.5☆23Updated 9 months ago
- ☆65Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 10 months ago
- ☆48Updated 2 years ago
- Zero-label image classification via OpenCLIP knowledge distillation☆125Updated last year
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆43Updated 5 months ago
- Official Implementation of OneNet☆16Updated 5 months ago
- ☆28Updated 3 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆53Updated last year