fkryan / gazelle
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆570Updated last month
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,065Updated last week
- YOLOE: Real-Time Seeing Anything☆1,227Updated 2 weeks ago
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆563Updated 3 weeks ago
- WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild☆300Updated this week
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆678Updated 9 months ago
- ☆144Updated last month
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆943Updated 3 months ago
- 👀 | MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime I…☆76Updated last month
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,071Updated last week
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆952Updated 3 weeks ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,043Updated 3 weeks ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆341Updated last month
- [Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos☆318Updated 8 months ago
- Efficient Track Anything☆544Updated 4 months ago
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆124Updated last month
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆289Updated last week
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆263Updated 5 months ago
- [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio☆296Updated 4 months ago
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆387Updated last year
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,273Updated 2 weeks ago
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆183Updated last month
- Tracking Any Point (TAP)☆1,521Updated last week
- Pytorch demo code and models for Multi-HMR☆335Updated 5 months ago
- [CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".☆372Updated 3 weeks ago
- OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆214Updated this week
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆485Updated 5 months ago
- Easy and fast 2d human and animal multi pose estimation using SOTA ViTPose [Y. Xu et al., 2022] Real-time performances and multiple skele…☆184Updated 3 weeks ago
- Pippo: High-Resolution Multi-View Humans from a Single Image☆553Updated last month
- Run Segment Anything Model 2 on a live video stream☆387Updated 3 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆230Updated 2 weeks ago