qubvel / rt-poseLinks
Real-time pose estimation pipeline with π€ Transformers
β61Updated 6 months ago
Alternatives and similar repositories for rt-pose
Users that are interested in rt-pose are comparing it to the libraries listed below
Sorting:
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β126Updated last year
- Inference and fine-tuning examples for vision models from π€ Transformersβ158Updated 3 months ago
- Official Code for Tracking Any Object Amodallyβ118Updated last year
- β24Updated 9 months ago
- β60Updated last year
- β78Updated 4 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmarkβ149Updated 3 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β65Updated 11 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.β49Updated 10 months ago
- An Open-Source Annotated Thermal Human Pose Datasetβ20Updated 2 months ago
- EdgeSAM model for use with Autodistill.β27Updated last year
- β42Updated 6 months ago
- Eye explorationβ27Updated 5 months ago
- β201Updated 3 months ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β28Updated 5 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β84Updated last week
- Dataset and Code for CVSports at CVPR 2024 paper "AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements"β44Updated last year
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. π₯ [Paper + Code + Demo]β740Updated last month
- Repo for event-based binary image reconstruction.β33Updated last year
- Edge Weight Prediction For Category-Agnostic Pose Estimationβ43Updated 2 months ago
- β43Updated last year
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"β303Updated last month
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densiβ¦β21Updated last year
- β48Updated 4 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentationβ68Updated last week
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"β99Updated last month
- This is the code of our paper "Video-Based Human Pose Regression via Decoupled Space-Time Aggregation".β127Updated 3 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β36Updated last year
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision modelsβ123Updated 2 weeks ago
- Official code of DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction (3DV 2025))β161Updated 6 months ago