WesleyHsieh0806 / Amodal-Expander
Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally
☆14Updated 6 months ago
Alternatives and similar repositories for Amodal-Expander:
Users that are interested in Amodal-Expander are comparing it to the libraries listed below
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 9 months ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Updated 2 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 3 years ago
- ☆16Updated 7 months ago
- ☆29Updated last month
- ☆34Updated 11 months ago
- SAM-CLIP module for use with Autodistill.☆13Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆24Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 8 months ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆24Updated last year
- Detectron2 Toolbox and Benchmark for V3Det☆16Updated 7 months ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆83Updated last year
- ☆13Updated 3 years ago
- Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO☆16Updated last year
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆12Updated 6 months ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 7 months ago
- Our public repo ranked 1st 🏆🏆 at MMSports2023 challenge on segmentation task☆16Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 7 months ago
- [CVPR2022] "Progressive End-to-End Object Detection in Crowded Scenes" on Deformable-DETR.☆30Updated 2 years ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated 7 months ago
- Official PyTorch code for HILA☆28Updated 2 years ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆17Updated last week
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆14Updated last month
- Training with Product Digital Twins for AutoRetail Checkout☆17Updated last year
- ☆52Updated last year
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆76Updated 6 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆33Updated 6 months ago
- OLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆45Updated last month
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Updated last year