fkryan / gazelle
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆561Updated last week
Alternatives and similar repositories for gazelle:
Users that are interested in gazelle are comparing it to the libraries listed below
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆878Updated last week
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,279Updated last week
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆542Updated this week
- Efficient Track Anything☆527Updated 3 months ago
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆665Updated 8 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,005Updated this week
- SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images☆765Updated 2 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆297Updated 3 weeks ago
- SynthMoCap Datasets☆439Updated last month
- Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆390Updated this week
- Tracking Any Point (TAP)☆1,474Updated last week
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,198Updated 3 weeks ago
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆138Updated 5 months ago
- YOLOE: Real-Time Seeing Anything☆1,155Updated 3 weeks ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆262Updated 4 months ago
- WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild☆282Updated last month
- [ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution☆345Updated 7 months ago
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆154Updated this week
- [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation☆849Updated 5 months ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆934Updated 3 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆409Updated 10 months ago
- 🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos☆1,060Updated last week
- [CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation☆1,045Updated last week
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆480Updated 5 months ago
- ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)☆420Updated 11 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,265Updated 5 months ago
- Pippo: High-Resolution Multi-View Humans from a Single Image☆517Updated 3 weeks ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆475Updated 4 months ago
- [CVPR 2025] Video Depth without Video Models☆483Updated last month
- Python scripts for the Segment Anythin 2 (SAM2) model in ONNX☆241Updated 7 months ago