qubvel / rt-poseLinks
Real-time pose estimation pipeline with π€ Transformers
β61Updated 5 months ago
Alternatives and similar repositories for rt-pose
Users that are interested in rt-pose are comparing it to the libraries listed below
Sorting:
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β125Updated 11 months ago
- Inference and fine-tuning examples for vision models from π€ Transformersβ154Updated 2 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β64Updated 11 months ago
- β60Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.β48Updated 10 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β79Updated last week
- Eye explorationβ28Updated 5 months ago
- Official Code for Tracking Any Object Amodallyβ118Updated last year
- β40Updated 6 months ago
- β191Updated 3 months ago
- β78Updated 3 months ago
- EdgeSAM model for use with Autodistill.β27Updated last year
- This is the code of our paper "Video-Based Human Pose Regression via Decoupled Space-Time Aggregation".β128Updated 2 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editingβ69Updated last year
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmarkβ138Updated 3 months ago
- 6D Rotation Representation for Unconstrained Head Pose Estimationβ14Updated last year
- β41Updated 5 months ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β27Updated 4 months ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within secondsβ132Updated last week
- SAM Annotaton Toolβ37Updated last year
- β43Updated last year
- Take your LLM to the optometrist.β32Updated last week
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imagβ¦β116Updated 2 years ago
- β45Updated 4 months ago
- SmolVLM2 Demoβ159Updated 3 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained modeβ¦β73Updated 9 months ago
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision modelsβ120Updated 2 months ago
- An SDK for Transformers + YOLO and other SSD family modelsβ63Updated 5 months ago
- Edge Weight Prediction For Category-Agnostic Pose Estimationβ42Updated last month
- Dataset and Code for CVSports at CVPR 2024 paper "AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements"β44Updated last year