qubvel / rt-pose
Real-time pose estimation pipeline with π€ Transformers
β49Updated last month
Alternatives and similar repositories for rt-pose:
Users that are interested in rt-pose are comparing it to the libraries listed below
- Inference and fine-tuning examples for vision models from π€ Transformersβ70Updated this week
- EdgeSAM model for use with Autodistill.β26Updated 9 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β116Updated 7 months ago
- An Open-Source Annotated Thermal Human Pose Datasetβ16Updated 5 months ago
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"β49Updated this week
- β38Updated last month
- β39Updated 3 months ago
- β36Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β68Updated last week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β62Updated 7 months ago
- β57Updated 3 months ago
- The official repository of the RePoGen paperβ47Updated 8 months ago
- Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentationβ49Updated 2 weeks ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.β41Updated 6 months ago
- β32Updated 3 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β35Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained modeβ¦β62Updated 5 months ago
- β60Updated last year
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimationβ47Updated 5 months ago
- Edge Weight Prediction For Category-Agnostic Pose Estimationβ41Updated 3 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detectionβ55Updated last year
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmarkβ84Updated this week
- Create topological graph for image segments.β21Updated 5 months ago
- β66Updated 2 months ago
- Official Code for Tracking Any Object Amodallyβ116Updated 8 months ago
- Eye explorationβ25Updated last month
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β66Updated last year
- Dataset and Code for the paper "AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements"β37Updated 9 months ago
- This is the official repo for the implementation of Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera(AAAI 2025).β20Updated 5 months ago
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision modelsβ115Updated last week