fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆806Updated 8 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆842Updated last week
- ☆261Updated 8 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆823Updated 6 months ago
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆1,948Updated 5 months ago
- Efficient Track Anything☆752Updated 11 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆439Updated last year
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,357Updated 7 months ago
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆671Updated 8 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,308Updated 4 months ago
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆483Updated 2 weeks ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,431Updated 5 months ago
- Tracking Any Point (TAP)☆1,758Updated 2 months ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,070Updated 11 months ago
- MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime Infere…☆151Updated last week
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆768Updated last year
- [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio☆321Updated 6 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,826Updated 3 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆448Updated 6 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆741Updated 6 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆173Updated 2 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,620Updated 2 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆243Updated 5 months ago
- MiVOLO age & gender transformer neural network☆451Updated 3 weeks ago
- SynthMoCap Datasets☆469Updated 5 months ago
- Muggled SAM: Segmentation without the magic☆177Updated this week
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆450Updated last year
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆534Updated last year
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,185Updated last week
- Real-time pose estimation pipeline with 🤗 Transformers☆65Updated 10 months ago
- The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints …☆2,260Updated this week