Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
☆824Apr 19, 2025Updated 10 months ago
Alternatives and similar repositories for gazelle
Users that are interested in gazelle are comparing it to the libraries listed below
Sorting:
- Sharingan: A Transformer Architecture for Multi-Person Gaze Following☆28Nov 11, 2024Updated last year
- [IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context☆76Mar 18, 2024Updated last year
- Text Behind Video. Enjoy it is completely free.☆31Feb 15, 2025Updated last year
- The official PyTorch implementation of L2CS-Net for gaze estimation and tracking☆473Feb 2, 2024Updated 2 years ago
- MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime Infere…☆170Feb 14, 2026Updated 2 weeks ago
- Efficient Track Anything☆781Jan 6, 2025Updated last year
- The public reproducible analysis code used for the gaze project☆11Feb 21, 2026Updated 2 weeks ago
- Converting Google Maps Screenshot to 3D Model☆21Jun 12, 2025Updated 8 months ago
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,045Mar 18, 2025Updated 11 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆877Jan 27, 2026Updated last month
- Automatically extract documents from images and perspectively correct them with classic computer-vision algorithms. In maintenance mode. …☆86Aug 24, 2025Updated 6 months ago
- High-resolution models for human tasks.☆5,296Nov 18, 2024Updated last year
- Terminal on Browser☆28Jun 28, 2025Updated 8 months ago
- [CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos…☆1,411Sep 21, 2025Updated 5 months ago
- A experimental cli tool to encrypt & decrypt files/directories.☆35Dec 30, 2025Updated 2 months ago
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆6,227Feb 26, 2025Updated last year
- [ECCV 2024] 3DGazeNet: Generalizing Gaze Estimation with Weak-Supervision from Synthetic Views☆121Jan 21, 2025Updated last year
- [ECCV 2024 Oral 🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces ------------------------ [ICCVW 2025] ID-Consistent, Preci…☆786Oct 10, 2025Updated 4 months ago
- YouTube History Analyzer☆37Jan 31, 2026Updated last month
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆197Apr 15, 2025Updated 10 months ago
- Cross-platform Search Engine and File Explorer for Multimedia☆31Feb 16, 2025Updated last year
- ☆51Apr 25, 2025Updated 10 months ago
- Pippo: High-Resolution Multi-View Humans from a Single Image☆632Apr 4, 2025Updated 11 months ago
- CVPR2025☆910May 14, 2025Updated 9 months ago
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆682Apr 21, 2025Updated 10 months ago
- Web app for reading and analyzing exported WhatsApp chat files with a clean, intuitive interface and powerful search and analytics☆36Dec 17, 2024Updated last year
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆367Sep 25, 2025Updated 5 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,840Jan 21, 2025Updated last year
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆63Mar 3, 2025Updated last year
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,340Jul 23, 2025Updated 7 months ago
- [CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation☆1,492Nov 2, 2025Updated 4 months ago
- Tracking Any Point (TAP)☆1,804Jan 22, 2026Updated last month
- Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch☆155Feb 17, 2026Updated 2 weeks ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,364May 1, 2025Updated 10 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆46Sep 1, 2024Updated last year
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆261Jan 30, 2025Updated last year
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,771Nov 28, 2025Updated 3 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 5 months ago
- Stega Shade CLI is a user-friendly command-line interface tool designed for image-based steganography. With a focus on simplicity and sec…☆42Jul 19, 2025Updated 7 months ago