phuselab/tppgaze

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/phuselab/tppgaze)

phuselab / tppgaze

☆17

Alternatives and similar repositories for tppgaze

Users that are interested in tppgaze are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aimagelab / DiCO
View on GitHub
[BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
☆20Sep 11, 2024Updated last year
aimagelab / awesome-human-visual-attention
View on GitHub
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…
☆66May 9, 2025Updated last year
aimagelab / ReflectiVA
View on GitHub
[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
☆56Jul 14, 2025Updated last year
aimagelab / ReT-2
View on GitHub
Recurrence Meets Transformers for Universal Multimodal Retrieval
☆15Dec 15, 2025Updated 7 months ago
aimagelab / ReT
View on GitHub
[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
☆37Sep 12, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aimagelab / DICE
View on GitHub
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
☆15Nov 3, 2025Updated 8 months ago
MrZilinXiao / AutoVER
View on GitHub
[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.
☆14Mar 2, 2024Updated 2 years ago
aimagelab / safe-clip
View on GitHub
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
☆67Aug 10, 2024Updated last year
aimagelab / awesome-captioning-evaluation
View on GitHub
[IJCAI 2025] Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
☆36Nov 25, 2025Updated 7 months ago
aimagelab / VHS
View on GitHub
[CVPR2026 Findings] VHS: Verifier on Hidden States, an efficient inference-time scaling verification framework for DiT-based image genera…
☆16Mar 25, 2026Updated 3 months ago
facebookresearch / projectaria_eyetracking
View on GitHub
Project Aria Social Eye Tracking Model
☆67Updated this week
chenxy99 / GazeXplain
View on GitHub
[ECCV 2024 Oral] GazeXplain - Official PyTorch Implementation
☆17Feb 24, 2025Updated last year
VLR-CVC / vlm-training
View on GitHub
large scale pre-training VLMs
☆25Jul 6, 2026Updated 2 weeks ago
cvlab-stonybrook / HAT
View on GitHub
CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"
☆24Jun 25, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aimagelab / Ti-MGD
View on GitHub
This is the official repository for the paper "Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing".
☆31Mar 28, 2024Updated 2 years ago
aimagelab / MissRAG
View on GitHub
[ICCV 2025] MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models
☆26May 12, 2026Updated 2 months ago
EIT-NLP / Layer_Select_Fuse_for_MLLM
View on GitHub
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆49Oct 29, 2025Updated 8 months ago
aimagelab / MaPeT
View on GitHub
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆16Jul 1, 2025Updated last year
Arhosseini77 / SUM
View on GitHub
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆100Aug 23, 2025Updated 11 months ago
aimagelab / HySAC
View on GitHub
Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025
☆31Apr 8, 2025Updated last year
aimagelab / LoCoNav
View on GitHub
☆13Dec 12, 2022Updated 3 years ago
msu-video-group / NTIRE26_Saliency_Prediction
View on GitHub
CVPR-NTIRE 2026 Challenge on Video Saliency Prediction
☆17Mar 20, 2026Updated 4 months ago
ykotseruba / pySTAR-FC
View on GitHub
Python implementation of STAR-FC saccade generator
☆16Aug 31, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
brownhci / irchiver-backend
View on GitHub
the indexer and search engine for irchiver, see https://irchiver.com for license and other information
☆15Dec 2, 2021Updated 4 years ago
lorebianchi98 / Talk2DINO
View on GitHub
[ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…
☆194Nov 10, 2025Updated 8 months ago
aimagelab / LLaVA-MORE
View on GitHub
[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
☆160Aug 8, 2025Updated 11 months ago
xiangjieSui / ScanDMM
View on GitHub
[2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images
☆23May 24, 2023Updated 3 years ago
NeuroLIAA / visions
View on GitHub
Visual Search in Natural Scenes benchmark
☆20Sep 19, 2024Updated last year
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
francescotonini / object-aware-gaze-target-detection
View on GitHub
Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)
☆45Dec 5, 2024Updated last year
Cambridge-ICCS / Summer-School-Julia-Tutorial
View on GitHub
Materials for a training workshop on 'Introduction to Julia for Computational Science'
☆12Jul 9, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
bucky2177 / dRiftDM
View on GitHub
dRiftDM
☆15Jun 6, 2026Updated last month
aimagelab / DynamicConv-agent
View on GitHub
PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters
☆20Jan 4, 2023Updated 3 years ago
kerenfu / RDVS
View on GitHub
☆41Mar 23, 2026Updated 4 months ago
JanaJarecki / summer-school-computational-mathematical-modeling-of-cognition
View on GitHub
Material of the European Summer School on Computational and Mathematical Modeling of Cognition
☆11Jul 19, 2022Updated 4 years ago
ganjiro / OfflineMania
View on GitHub
[COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"
☆12Jul 15, 2024Updated 2 years ago
aimagelab / VATr
View on GitHub
☆89Mar 7, 2025Updated last year
lorebianchi98 / FG-OVD
View on GitHub
[CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…
☆68Apr 4, 2025Updated last year