fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆781Updated 6 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆606Updated 5 months ago
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆1,821Updated 3 months ago
- Efficient Track Anything☆654Updated 9 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆795Updated 4 months ago
- ☆219Updated 6 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,241Updated 2 months ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,044Updated 8 months ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,358Updated 3 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,346Updated 5 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆740Updated 4 months ago
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆446Updated 7 months ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,680Updated last month
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆433Updated last year
- MiVOLO age & gender transformer neural network☆437Updated last month
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)☆339Updated 5 months ago
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆728Updated last year
- 👀 | MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime I…☆122Updated last month
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆164Updated 3 weeks ago
- Real-time pose estimation pipeline with 🤗 Transformers☆64Updated 8 months ago
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆162Updated 2 months ago
- RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tun…☆3,582Updated 2 weeks ago
- Python scripts for the Segment Anythin 2 (SAM2) model in ONNX☆269Updated last year
- Frontier Multimodal Foundation Models for Image and Video Understanding☆1,004Updated 2 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆421Updated 2 weeks ago
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆330Updated 3 weeks ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆360Updated last year
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆427Updated last year
- [ ICCV 2025 ] FaceXFormer: A Unified Transformer for Facial Analysis☆295Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆476Updated 7 months ago
- [ICCV 2023] Tracking Anything with Decoupled Video Segmentation☆1,436Updated 5 months ago