BingfengYan / VISAMLinks
Combining "segment-anything" with MOT, it create the era of "MOTS"
☆156Updated 2 years ago
Alternatives and similar repositories for VISAM
Users that are interested in VISAM are comparing it to the libraries listed below
Sorting:
- Implementation of Tracking Every Thing in the Wild, ECCV 2022☆96Updated last year
- Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)☆76Updated 3 years ago
- Official implementation of the paper "Progressive End-to-End Object Detection in Crowded Scenes"☆96Updated 3 years ago
- Video Mask Transfiner for High-Quality Video Instance Segmentation (ECCV'2022)☆30Updated 3 years ago
- DVIS: Decoupled Video Instance Segmentation Framework☆157Updated last year
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆110Updated last year
- Detection Transformers with Assignment☆264Updated 2 years ago
- [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"☆236Updated last year
- [ICCV 2023] AlignDet: Aligning Pre-training and Fine-tuning in Object Detection.☆146Updated 2 years ago
- Recognize Any Regions☆122Updated last year
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆105Updated 2 years ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆89Updated last year
- [CVPR2022] "Progressive End-to-End Object Detection in Crowded Scenes" on Deformable-DETR.☆32Updated 3 years ago
- PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022☆169Updated 3 years ago
- ☆78Updated 2 years ago
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆176Updated 2 years ago
- ☆63Updated 2 years ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated 2 years ago
- CO-MOT: Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking☆97Updated 3 weeks ago
- CounTR: Transformer-based Generalised Visual Counting☆120Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆99Updated 2 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63Updated 2 years ago
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆85Updated 2 years ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆184Updated 2 years ago
- Learning Open-World Object Proposals without Learning to Classify☆203Updated 3 years ago
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆208Updated 2 years ago
- A Siamese self-supervised pretraining approach for the Transformer architecture in DETR☆37Updated 2 years ago
- [ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dat…☆101Updated 10 months ago
- Associating Objects with Transformers for Video Object Segmentation☆145Updated last year
- ☆147Updated last year