☆16Feb 20, 2025Updated last year
Alternatives and similar repositories for tppgaze
Users that are interested in tppgaze are comparing it to the libraries listed below
Sorting:
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 3 months ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆20Sep 11, 2024Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆63May 9, 2025Updated 9 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆54Jul 14, 2025Updated 7 months ago
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆34Sep 12, 2025Updated 5 months ago
- [ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities☆52Jul 2, 2025Updated 8 months ago
- Project Aria Social Eye Tracking Model☆58Jan 12, 2026Updated last month
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆65Jul 29, 2025Updated 7 months ago
- Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024☆67Aug 10, 2024Updated last year
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆43Mar 15, 2024Updated last year
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆23Jun 25, 2025Updated 8 months ago
- Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.☆18Aug 31, 2022Updated 3 years ago
- [IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives☆29Nov 25, 2025Updated 3 months ago
- Official implementation of the paper "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"☆31Jun 16, 2025Updated 8 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆31Apr 8, 2025Updated 10 months ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆23May 24, 2023Updated 2 years ago
- Code and model for the paper "ScanGAN360: A Generative Model of Realistic Scanpaths for 360º Images"☆32Aug 16, 2022Updated 3 years ago
- ☆18Sep 23, 2025Updated 5 months ago
- This is the official repository for the paper "Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing".☆30Mar 28, 2024Updated last year
- Categorization for Eye Tracking - simplified☆32Jun 19, 2025Updated 8 months ago
- [TIP2025] The implementation of "Uncertainty Guided Refinement for Fine-grained Salient Object Detection"☆16Apr 20, 2025Updated 10 months ago
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆89Aug 23, 2025Updated 6 months ago
- Barcode Scanning for MAUI?☆10Dec 9, 2022Updated 3 years ago
- Material of the European Summer School on Computational and Mathematical Modeling of Cognition☆11Jul 19, 2022Updated 3 years ago
- [KGC '24] This application is for visualisation of Knowledge Graphs. We employe a novel technique which uses LLM based agent for triple e…☆11Apr 17, 2024Updated last year
- AbationGraph® is a time-series knowledge graph database for real-time data analysis☆16Nov 23, 2023Updated 2 years ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- An Introductory Jupyter Notebook to Manipulate Ontologies with Owlready2☆11Jan 10, 2020Updated 6 years ago
- Contains the code for reproducing the experiments and results of the paper "Neural Superstatistics: A Bayesian Method for Estimating Dyna…☆14Aug 18, 2023Updated 2 years ago
- ☆10Nov 2, 2023Updated 2 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆21Nov 25, 2024Updated last year
- Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applications☆15Feb 22, 2023Updated 3 years ago
- ☆11May 24, 2024Updated last year
- Package to Train LANs (Likelihood approximation networks)☆13Feb 10, 2026Updated 3 weeks ago
- Source code of the paper "The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields".☆17Mar 3, 2025Updated last year
- ☆17Jul 24, 2025Updated 7 months ago
- Materials for a training workshop on 'Introduction to Julia for Computational Science'☆12Jul 9, 2025Updated 7 months ago