fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆582Updated last month
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,076Updated 2 weeks ago
- Run Segment Anything Model 2 on a live video stream☆420Updated this week
- Efficient Track Anything☆553Updated 5 months ago
- Tracking Any Point (TAP)☆1,544Updated this week
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆956Updated 4 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆295Updated last month
- YOLOE: Real-Time Seeing Anything☆1,325Updated last month
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentation☆1,389Updated last month
- ☆163Updated last month
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆369Updated 3 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,026Updated last month
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆589Updated last month
- WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild☆307Updated 3 weeks ago
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆133Updated last month
- [CVPR 2025] Code for Segment Any Motion in Videos☆358Updated 2 months ago
- Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,142Updated last month
- [CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".☆400Updated this week
- 👀 | MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime I…☆81Updated last week
- Pytorch demo code and models for Multi-HMR☆338Updated 5 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,298Updated last month
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆688Updated 9 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆298Updated last month
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆489Updated 6 months ago
- [ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution☆434Updated 3 weeks ago
- [Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos☆325Updated 9 months ago
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,320Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆413Updated 2 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆416Updated 11 months ago
- ☆1,013Updated last month
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆394Updated last year