fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆790Updated 6 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆616Updated 6 months ago
- ☆223Updated 6 months ago
- Efficient Track Anything☆658Updated 10 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆804Updated 4 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,272Updated 3 months ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,380Updated 4 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,347Updated 6 months ago
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆1,876Updated 4 months ago
- 👀 | MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime I…☆126Updated this week
- MiVOLO age & gender transformer neural network☆440Updated 2 months ago
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆457Updated this week
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆750Updated last year
- ☆27Updated last month
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,050Updated 9 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆742Updated 5 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆432Updated 2 weeks ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆427Updated 4 months ago
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentation☆1,443Updated 6 months ago
- SynthMoCap Datasets☆466Updated 3 months ago
- Tracking Any Point (TAP)☆1,724Updated 3 weeks ago
- Real-time pose estimation pipeline with 🤗 Transformers☆64Updated 9 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,525Updated last month
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆172Updated 3 weeks ago
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)☆342Updated 6 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,718Updated last month
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆430Updated last year
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆435Updated last year
- [ ICCV 2025 ] FaceXFormer: A Unified Transformer for Facial Analysis☆302Updated 2 months ago
- Frontier Multimodal Foundation Models for Image and Video Understanding☆1,030Updated 2 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆267Updated 10 months ago