niki-amini-naieni / CountVidLinks
Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.
☆79Updated 3 months ago
Alternatives and similar repositories for CountVid
Users that are interested in CountVid are comparing it to the libraries listed below
Sorting:
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆94Updated 7 months ago
- Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆41Updated 4 months ago
- Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"☆252Updated 2 months ago
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆126Updated 4 months ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆90Updated 9 months ago
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆109Updated 4 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆90Updated 5 months ago
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆163Updated 9 months ago
- ☆142Updated 6 months ago
- YOLO-World + EfficientViT SAM☆106Updated last year
- Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)☆599Updated this week
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆103Updated last year
- ☆94Updated last year
- The Missing Point in Vision Transformers for Universal Image Segmentation☆55Updated 5 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆261Updated 6 months ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆465Updated 2 months ago
- CounTR: Transformer-based Generalised Visual Counting☆118Updated last year
- ☆52Updated last year
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆169Updated 2 weeks ago
- Official code for NetTrack [CVPR 2024]☆106Updated last year
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆63Updated last year
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆49Updated 10 months ago
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆68Updated 3 months ago
- ☆69Updated last year
- ☆50Updated 3 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆87Updated last year
- 🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆78Updated 2 months ago
- Implementation of paper - DEYO: DETR with YOLO for End-to-End Object Detection☆96Updated last year
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆116Updated last month
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆173Updated 2 years ago