SebastianJanampa / DETRPoseLinks
DETRPose: Real-time end-to-end transformer model for multi-person pose estimation
☆26Updated last month
Alternatives and similar repositories for DETRPose
Users that are interested in DETRPose are comparing it to the libraries listed below
Sorting:
- CAVIS: Context-Aware Video Instance Segmentation☆88Updated last week
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 5 months ago
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆101Updated last month
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆31Updated last year
- ☆29Updated 4 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆40Updated 6 months ago
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆55Updated 4 months ago
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆24Updated 3 months ago
- Focusing on Tracks for Online Multi-Object Tracking☆62Updated last week
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆43Updated last year
- [T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier☆46Updated 7 months ago
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆47Updated 8 months ago
- ☆30Updated 2 years ago
- Official Code for Tracking Any Object Amodally☆118Updated last year
- ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection (CVPR2023)☆52Updated last year
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆64Updated 3 weeks ago
- ☆193Updated 2 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆68Updated 2 weeks ago
- The Missing Point in Vision Transformers for Universal Image Segmentation☆49Updated 2 months ago
- 🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆70Updated last week
- This repository is for the first survey on SAM & SAM2 for Videos.☆52Updated 3 months ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆26Updated last year
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Updated 9 months ago
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆20Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 11 months ago
- ☆189Updated 2 months ago
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆61Updated 8 months ago
- ☆30Updated 6 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated last year
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆45Updated this week