☆17Feb 20, 2025Updated last year
Alternatives and similar repositories for tppgaze
Users that are interested in tppgaze are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆25Dec 4, 2025Updated 3 months ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆20Sep 11, 2024Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆63May 9, 2025Updated 10 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆54Jul 14, 2025Updated 8 months ago
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆34Sep 12, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.☆18Aug 31, 2022Updated 3 years ago
- [ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities☆52Jul 2, 2025Updated 8 months ago
- Project Aria Social Eye Tracking Model☆60Jan 12, 2026Updated 2 months ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆67Jul 29, 2025Updated 7 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024☆67Aug 10, 2024Updated last year
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆30Apr 8, 2025Updated 11 months ago
- [ICLR 2026] "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"☆35Mar 6, 2026Updated 2 weeks ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆43Mar 15, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the official repository for the paper "Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing".☆30Mar 28, 2024Updated last year
- [IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives☆32Nov 25, 2025Updated 4 months ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Jul 1, 2025Updated 8 months ago
- Robust Single Sample Face Recognition by Sparsity-Driven Sub-Dictionary Learning Using Deep Features☆18Sep 8, 2020Updated 5 years ago
- Categorization for Eye Tracking - simplified☆32Jun 19, 2025Updated 9 months ago
- ☆13Dec 12, 2022Updated 3 years ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆23Jun 25, 2025Updated 9 months ago
- Python implementation of STAR-FC saccade generator☆16Aug 31, 2024Updated last year
- Visual Search in Natural Scenes benchmark☆19Sep 19, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆32Oct 21, 2025Updated 5 months ago
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆14Dec 2, 2021Updated 4 years ago
- [ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning☆160Aug 8, 2025Updated 7 months ago
- Barcode Scanning for MAUI?☆10Dec 9, 2022Updated 3 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆23May 24, 2023Updated 2 years ago
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆91Aug 23, 2025Updated 7 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- This is the reference implementation of our NeurIPS 2023 paper "Add and Thin: Diffusion for Temporal Point Processes"☆22Mar 4, 2024Updated 2 years ago
- Materials for a training workshop on 'Introduction to Julia for Computational Science'☆12Jul 9, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters☆20Jan 4, 2023Updated 3 years ago
- Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"☆32Aug 16, 2022Updated 3 years ago
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Aug 5, 2021Updated 4 years ago
- ☆20Dec 12, 2022Updated 3 years ago
- A simplistic web app for annotating emotions in human speech video recordings.☆28Oct 13, 2014Updated 11 years ago
- A simple android/ Diaspora-Webclient☆47Jun 13, 2014Updated 11 years ago
- [TIP2025] The implementation of "Uncertainty Guided Refinement for Fine-grained Salient Object Detection"☆17Apr 20, 2025Updated 11 months ago