Raojiyong / KITPoseLinks
[IJCV 2025] The project is an official implementation of our paper "Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation"
☆13Updated 4 months ago
Alternatives and similar repositories for KITPose
Users that are interested in KITPose are comparing it to the libraries listed below
Sorting:
- ☆36Updated 5 months ago
- A paper list of some recent Mamba-based CV works.☆363Updated this week
- ✨✨Latest Papers on Vision Mamba and Related Areas☆352Updated 3 months ago
- A continuously updated project to track the latest progress in the field of multi-modal object tracking. This project focuses solely on s…☆231Updated this week
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆246Updated 4 months ago
- [CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Ref…☆202Updated 3 months ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆313Updated 5 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆95Updated 10 months ago
- [CVPR23] Visual Prompt Multi-Modal Tracking☆291Updated 4 months ago
- ☆32Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆75Updated 6 months ago
- [PRCV-2024] State Space Model based Frame-Event Tracking☆40Updated last week
- [CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels☆303Updated 2 weeks ago
- 🚀【CVPR 2024】Delving into the Trajectory Long-tail Distribution for Muti-object Tracking☆87Updated 3 weeks ago
- ☆26Updated 2 months ago
- [ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…☆12Updated 5 months ago
- Implementation of the CVPR 2024 paper "A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation"☆19Updated 6 months ago
- LSNet: See Large, Focus Small [CVPR 2025]☆238Updated 3 months ago
- ☆111Updated last year
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆21Updated last year
- ☆48Updated 2 months ago
- 这里包含了Mamba的代码以及b站对应的讲解视频☆90Updated last year
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆93Updated 5 months ago
- ☆123Updated 3 months ago
- 🦕 [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆168Updated 2 weeks ago
- This is a cross-modal benchmark for industrial anomaly detection.☆12Updated 2 months ago
- ☆27Updated last year
- [CVPR 2025] Multiple Object Tracking as ID Prediction☆311Updated last week
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆63Updated 6 months ago
- [arXiv 25] Official repo of "UEMM-Air: Enable UAVs to Undertake More Multi-modal Tasks"☆24Updated last month