qubvel / rt-poseLinks
Real-time pose estimation pipeline with π€ Transformers
β59Updated 3 months ago
Alternatives and similar repositories for rt-pose
Users that are interested in rt-pose are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from π€ Transformersβ147Updated last month
- β23Updated 7 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β64Updated 9 months ago
- Repo for event-based binary image reconstruction.β32Updated last year
- EdgeSAM model for use with Autodistill.β26Updated 11 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.β124Updated 9 months ago
- Eye explorationβ28Updated 3 months ago
- Official Code for Tracking Any Object Amodallyβ118Updated 10 months ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β26Updated 3 months ago
- An Open-Source Annotated Thermal Human Pose Datasetβ18Updated this week
- Official PyTorch implementation of "6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry," ECCV 2024β86Updated last month
- β60Updated last year
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmarkβ132Updated last month
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β70Updated this week
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editingβ69Updated last year
- Edge Weight Prediction For Category-Agnostic Pose Estimationβ41Updated last week
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, modelβ¦β36Updated last year
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorchβ139Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β80Updated last year
- β34Updated 2 weeks ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained modeβ¦β71Updated 7 months ago
- An SDK for Transformers + YOLO and other SSD family modelsβ62Updated 4 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained modeβ¦β12Updated 10 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.β47Updated 8 months ago
- β37Updated 2 months ago
- 6D Rotation Representation for Unconstrained Head Pose Estimationβ13Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.β66Updated last year
- β163Updated last month
- Using the moondream VLM with optical flow for promptable object trackingβ57Updated 3 months ago
- PromptDepthAnything exampleβ138Updated 4 months ago