fkryan / gazelleLinks
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆752Updated 3 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆534Updated 3 months ago
- ☆204Updated 3 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆748Updated last month
- Efficient Track Anything☆610Updated 7 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,160Updated 2 weeks ago
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,000Updated 6 months ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,303Updated last month
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,328Updated 3 months ago
- RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.☆402Updated 5 months ago
- code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction☆422Updated last year
- [ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"☆706Updated 11 months ago
- [IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio☆307Updated last month
- SynthMoCap Datasets☆456Updated last month
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆303Updated last month
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,489Updated this week
- Run Segment Anything Model 2 on a live video stream☆468Updated 2 months ago
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆632Updated 3 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆398Updated 2 months ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆362Updated last month
- Tracking Any Point (TAP)☆1,619Updated 2 weeks ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆149Updated 3 months ago
- This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]☆736Updated 2 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆441Updated 4 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆265Updated 7 months ago
- SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction☆131Updated 2 weeks ago
- [CVPRW'24] SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap (CVPR24 - CVSports workshop)☆320Updated 3 months ago
- Frontier Multimodal Foundation Models for Image and Video Understanding☆927Updated 2 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆502Updated 8 months ago
- MiVOLO age & gender transformer neural network☆420Updated last year
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆147Updated 9 months ago