fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆814Updated 9 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆862Updated this week
- ☆268Updated 9 months ago
- Efficient Track Anything☆772Updated last year
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆2,011Updated 7 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆441Updated last year
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,446Updated 7 months ago
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆683Updated 9 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆842Updated 7 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,361Updated 8 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,327Updated 6 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,702Updated 3 months ago
- Tracking Any Point (TAP)☆1,784Updated last week
- Run Segment Anything Model 2 on a live video stream☆562Updated 7 months ago
- [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio☆320Updated 7 months ago
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆774Updated last year
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,081Updated last year
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆500Updated last month
- MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime Infere…☆163Updated last week
- Real-time pose estimation pipeline with 🤗 Transformers☆66Updated 11 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆455Updated 7 months ago
- UniFace: A Comprehensive Library for Face Analysis: Detection, Recognition, Landmark Analysis, Face Parsing, Gaze Estimation, Age, and Ge…☆525Updated this week
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,113Updated last week
- ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)☆429Updated last year
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)☆367Updated 8 months ago
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,500Updated 2 months ago
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆466Updated last year
- MiVOLO age & gender transformer neural network☆461Updated 2 months ago
- SynthMoCap Datasets☆469Updated 6 months ago
- The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints …☆2,544Updated last month
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆742Updated 7 months ago