Shohruh72 / SixDRepNetLinks
6D Rotation Representation for Unconstrained Head Pose Estimation
☆14Updated last year
Alternatives and similar repositories for SixDRepNet
Users that are interested in SixDRepNet are comparing it to the libraries listed below
Sorting:
- The Facial Landmark Preprocessing Toolkit.☆14Updated 3 weeks ago
- ☆23Updated 8 months ago
- This Repository demostrates various examples using YOLO☆13Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆47Updated 9 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 10 months ago
- EdgeSAM model for use with Autodistill.☆27Updated last year
- Focusing on Tracks for Online Multi-Object Tracking☆31Updated last week
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆41Updated last month
- The official repository of the RePoGen paper☆48Updated this week
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- ☆46Updated last year
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆21Updated last year
- BoT-SORT + YOLOX implemented using only onnxruntime, Numpy and scipy, without cython_bbox and PyTorch. Fast human tracker. OSNet is not u…☆36Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆42Updated 6 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- ☆41Updated 4 months ago
- STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion☆52Updated 6 months ago
- Repo for event-based binary image reconstruction.☆33Updated last year
- ☆35Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆125Updated 10 months ago
- The official implementation of the paper "ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations".☆40Updated 5 months ago
- Dataset and Code for CVSports at CVPR 2024 paper "AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements"☆44Updated last year
- An Open-Source Annotated Thermal Human Pose Dataset☆19Updated 3 weeks ago
- ☆74Updated 2 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- ☆74Updated 2 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 7 months ago
- ☆16Updated last year