fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆765Updated 4 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆565Updated 4 months ago
- ☆210Updated 4 months ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,020Updated 7 months ago
- Efficient Track Anything☆625Updated 7 months ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,318Updated 2 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,188Updated last month
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆1,659Updated 2 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆768Updated 2 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,334Updated 4 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆427Updated last year
- Run Segment Anything Model 2 on a live video stream☆487Updated 3 months ago
- MiVOLO age & gender transformer neural network☆427Updated last week
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆418Updated 6 months ago
- [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio☆308Updated 2 months ago
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)☆327Updated 4 months ago
- 👀 | MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime I…☆104Updated this week
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆636Updated 4 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆154Updated 4 months ago
- Real-time pose estimation pipeline with 🤗 Transformers☆62Updated 6 months ago
- Tracking Any Point (TAP)☆1,651Updated last month
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,345Updated this week
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆712Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,542Updated last week
- Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)☆278Updated last year
- [CVPR 2025] Code for Segment Any Motion in Videos☆404Updated 2 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆505Updated 9 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆309Updated 2 months ago
- WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild☆353Updated last month
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆266Updated 8 months ago
- Python scripts for the Segment Anythin 2 (SAM2) model in ONNX☆262Updated last year