This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
☆24Apr 27, 2025Updated 11 months ago
Alternatives and similar repositories for IllusionVQA
Users that are interested in IllusionVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- Unofficial implementation of Meta's MovieGen models☆16Nov 25, 2025Updated 4 months ago
- LR0.FM: Low-Resolution Zero-shot Classification Benchmark For Foundation Models☆16Aug 29, 2025Updated 7 months ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆164Sep 27, 2025Updated 6 months ago
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆19Feb 20, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆336Oct 14, 2025Updated 6 months ago
- Production-ready Supabase self-hosting with Docker Compose, Swarm & Portainer. Complete wiki documentation, automated setup scripts, and …☆38Oct 5, 2025Updated 6 months ago
- 3D-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆11Jun 19, 2025Updated 10 months ago
- Codebase for EnterpriseOps-Gym from ServiceNow☆81Mar 25, 2026Updated 3 weeks ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu☆12Feb 27, 2025Updated last year
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- ☆22Sep 16, 2025Updated 7 months ago
- Code for "Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement"☆11Apr 18, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- hydra-pl-wandb-sample-project is a NN experiment management code using hydra, pytorch-lightinig, and wandb.☆11Nov 22, 2021Updated 4 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- Siggraph 2025 Journal track☆26Aug 13, 2025Updated 8 months ago
- ☆18May 9, 2024Updated last year
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- PyTorch implementation of the paper "SR-ITM-GAN: Learning 4k UHD HDR with a Generative Adversarial Network“ published at IEEE ACCESS☆14Nov 17, 2020Updated 5 years ago
- Official implementation of the paper "What Makes for a Good Stereoscopic Image" CVPRW 2025☆19May 27, 2025Updated 10 months ago
- ☆14Apr 7, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- SAVL: Scene-Adaptive UAV Visual Localization Using Sparse Feature Extraction and Incremental Descriptor Mapping☆14Aug 6, 2025Updated 8 months ago
- [SC2023] POMELO: Fine-grained Population Mapping from Coarse Census Counts and Open Geodata☆13Aug 5, 2024Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated last year
- Info about me☆14Jul 28, 2020Updated 5 years ago
- [CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning☆33May 25, 2025Updated 10 months ago
- ☆12Jan 6, 2025Updated last year
- Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model☆20Jun 20, 2025Updated 9 months ago
- Refactor your code with local LLM in VSCode☆13Mar 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images☆16Dec 4, 2024Updated last year
- Codebase for SIGNET: Efficient Neural Representations for Light Fields☆15Jul 27, 2023Updated 2 years ago
- Implementation of VIINTER: View Interpolation With Implicit Neural Representations of Images☆12Nov 20, 2022Updated 3 years ago
- Shortest versions of python script for image processing with OpenCV☆13Jun 16, 2022Updated 3 years ago
- Dataset for identifying potential hates (e.g., political, religious, personal, gender abusive, geopolitical, etc.) for under-resourced Be…☆29Apr 14, 2022Updated 4 years ago
- PyTorch block-diagonal ODE CUDA solver, designed for gradient-based optimization☆16Apr 27, 2020Updated 5 years ago
- WorldModel is a MaskGIT model trained on 8x8x8 Minecraft voxel volumes. Beyond generating blocks from scratch, it excels in filling space…☆14Sep 12, 2023Updated 2 years ago