aimagelab/ScanDiff

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aimagelab/ScanDiff)

aimagelab / ScanDiff

This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV 2025

☆27

Alternatives and similar repositories for ScanDiff

Users that are interested in ScanDiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cvlab-stonybrook / few-shot-scanpath
View on GitHub
☆16Oct 25, 2025Updated 8 months ago
aimagelab / ReT-2
View on GitHub
Recurrence Meets Transformers for Universal Multimodal Retrieval
☆15Dec 15, 2025Updated 7 months ago
chenxy99 / GazeXplain
View on GitHub
[ECCV 2024 Oral] GazeXplain - Official PyTorch Implementation
☆17Feb 24, 2025Updated last year
phuselab / CLE
View on GitHub
Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.
☆18Aug 31, 2022Updated 3 years ago
aimagelab / DiCO
View on GitHub
[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
☆20Sep 11, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cvlab-stonybrook / Gazeformer
View on GitHub
Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)
☆44Updated this week
aimagelab / DICE
View on GitHub
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
☆15Nov 3, 2025Updated 8 months ago
UARK-AICV / CTScanGaze
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
aimagelab / CoDE
View on GitHub
[ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
☆52Jul 2, 2025Updated last year
aimagelab / VHS
View on GitHub
[CVPR2026 Findings] VHS: Verifier on Hidden States, an efficient inference-time scaling verification framework for DiT-based image genera…
☆16Mar 25, 2026Updated 3 months ago
VLR-CVC / vlm-training
View on GitHub
large scale pre-training VLMs
☆25Jul 6, 2026Updated 2 weeks ago
aimagelab / pacscore
View on GitHub
[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
☆66Jul 29, 2025Updated 11 months ago
facebookresearch / projectaria_eyetracking
View on GitHub
Project Aria Social Eye Tracking Model
☆67Jan 12, 2026Updated 6 months ago
EIT-NLP / Layer_Select_Fuse_for_MLLM
View on GitHub
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆49Oct 29, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
EricDengbowen / QAGNet
View on GitHub
Official repository for CVPR 2024 paper "Advancing Saliency Ranking with Human Fixations: Dataset, Models and Benchmarks".
☆21Jun 21, 2024Updated 2 years ago
aimagelab / HySAC
View on GitHub
Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025
☆31Apr 8, 2025Updated last year
olga-zats / DIFF_MANTA
View on GitHub
[CVPR 2025] MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation
☆27Jun 13, 2025Updated last year
David-Ef / salient360Toolbox
View on GitHub
Toolbox for processing, visualising, comparing and generating data related to gaze in 360 contexts (VR notably)
☆28Jun 12, 2025Updated last year
ykotseruba / pySTAR-FC
View on GitHub
Python implementation of STAR-FC saccade generator
☆16Aug 31, 2024Updated last year
xiangjieSui / ScanDMM
View on GitHub
[2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images
☆23May 24, 2023Updated 3 years ago
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
roy402 / VSGM
View on GitHub
Enhance robot task understanding ability through visual semantic graph
☆10May 20, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
anirgalgali / residual-dynamics
View on GitHub
Code for Galgali et al, 2023
☆14Jan 11, 2023Updated 3 years ago
coveooss / ecommerce-query-embeddings
View on GitHub
☆12Mar 24, 2021Updated 5 years ago
Singularity0104 / NExT-Vid
View on GitHub
Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
☆22Dec 24, 2025Updated 6 months ago
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
HiLab-git / VLM-CPL
View on GitHub
☆23Aug 14, 2025Updated 11 months ago
lorebianchi98 / Talk2DINO
View on GitHub
[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…
☆193Nov 10, 2025Updated 8 months ago
alrojo / biRNN-CRF
View on GitHub
Researching the forward-backward algorithm
☆11Aug 3, 2018Updated 7 years ago
dimipapa / cookingprograms
View on GitHub
Code and data for "Learning Program Representations for Food Images and Cooking Recipes" (oral at CVPR 2022)
☆15Mar 30, 2022Updated 4 years ago
adithyaprem / Hierarchical-Image-Matting-Model-for-Blood-Vessel-Segmentation-in-Fundus-images
View on GitHub
This is a python implementation of Hierarchical Image Matting Model for Segmentation.
☆11Jun 21, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
aimagelab / COGT
View on GitHub
[ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding
☆10Apr 15, 2025Updated last year
jlin816 / rewards-from-language
View on GitHub
Code and data for "Inferring Rewards from Language in Context" [ACL 2022].
☆16May 22, 2022Updated 4 years ago
qim-center / qim3d
View on GitHub
Python library for volumetric data
☆16Updated this week
marukosan93 / ORPDAD
View on GitHub
This is the official code repository of our dataset and ECCV 2024 paper entitled "Oulu Remote-photoplethysmography Physical Domain Attac…
☆14Jul 9, 2025Updated last year
LorenzoAgnolucci / Keyframes-GAN
View on GitHub
[IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".
☆17Dec 10, 2024Updated last year
yixchen / YouRefIt_ERU
View on GitHub
☆20Jul 5, 2023Updated 3 years ago
ExplainableML / flair
View on GitHub
[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations
☆147Mar 12, 2026Updated 4 months ago