fkryan / gazelle
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025)
☆516Updated last week
Alternatives and similar repositories for gazelle:
Users that are interested in gazelle are comparing it to the libraries listed below
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆910Updated last month
- WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild☆247Updated 2 weeks ago
- Efficient Track Anything☆489Updated 2 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆467Updated 3 months ago
- Run Segment Anything Model 2 on a live video stream☆316Updated last month
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆911Updated last month
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆258Updated 3 months ago
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆632Updated 6 months ago
- DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,196Updated 2 weeks ago
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆132Updated 4 months ago
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything☆1,230Updated 4 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆172Updated this week
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆454Updated 3 months ago
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆629Updated last month
- [ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution☆341Updated 6 months ago
- [ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces☆663Updated 3 months ago
- SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images☆727Updated last month
- [Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos☆303Updated 6 months ago
- [CVPR'25] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆784Updated 3 months ago
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆170Updated 2 months ago
- Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"☆484Updated 7 months ago
- DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion☆1,210Updated 3 months ago
- [NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar☆428Updated 3 weeks ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆338Updated 6 months ago
- Pytorch demo code and models for Multi-HMR☆303Updated 2 months ago
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆295Updated 3 months ago