This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
☆24Apr 27, 2025Updated last year
Alternatives and similar repositories for IllusionVQA
Users that are interested in IllusionVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Unofficial implementation of Meta's MovieGen models☆16Nov 25, 2025Updated 6 months ago
- MNI152NLin2009cAsym☆14Jan 6, 2026Updated 4 months ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆168Sep 27, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆20Feb 20, 2025Updated last year
- Production-ready Supabase self-hosting with Docker Compose, Swarm & Portainer. Complete wiki documentation, automated setup scripts, and …☆39Oct 5, 2025Updated 7 months ago
- Code Repository for Computer Graphics Theory and Sessional!☆16Oct 17, 2022Updated 3 years ago
- 3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆13Jun 19, 2025Updated 11 months ago
- Codebase for EnterpriseOps-Gym from ServiceNow☆91May 8, 2026Updated 3 weeks ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu☆12Feb 27, 2025Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- ☆22Sep 16, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- ☆18May 9, 2024Updated 2 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- PyTorch implementation of the paper "SR-ITM-GAN: Learning 4k UHD HDR with a Generative Adversarial Network“ published at IEEE ACCESS☆15Nov 17, 2020Updated 5 years ago
- Official implementation of the paper "What Makes for a Good Stereoscopic Image" CVPRW 2025☆19May 27, 2025Updated last year
- ☆14Apr 7, 2025Updated last year
- ☆10Jan 11, 2024Updated 2 years ago
- Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis (BMVC2023)☆12Jun 9, 2023Updated 2 years ago
- SAVL: Scene-Adaptive UAV Visual Localization Using Sparse Feature Extraction and Incremental Descriptor Mapping☆15Apr 26, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Jan 5, 2026Updated 4 months ago
- PyTorch Implementation of AutoPruner☆23Jun 11, 2020Updated 5 years ago
- ☆33Feb 17, 2026Updated 3 months ago
- A Python implementation of Adobe's Creative Cloud Lightroom API☆13Apr 15, 2025Updated last year
- ☆29Sep 2, 2025Updated 8 months ago
- [SC2023] POMELO: Fine-grained Population Mapping from Coarse Census Counts and Open Geodata☆13Aug 5, 2024Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated last year
- About This repository is a curated collection of the most exciting and influential CVPR 2026 papers. 🔥 [Paper + Code + Demo]☆252Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository of "A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning" published in NeurIPS'20…☆12Feb 7, 2025Updated last year
- [CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning☆34May 25, 2025Updated last year
- ☆12Jan 6, 2025Updated last year
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆17Mar 5, 2025Updated last year
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- Refactor your code with local LLM in VSCode☆13Mar 14, 2024Updated 2 years ago
- PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images☆16Dec 4, 2024Updated last year