Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"
☆15Jan 25, 2024Updated 2 years ago
Alternatives and similar repositories for GVIL
Users that are interested in GVIL are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2022 Paper DANLI: Deliberative Agent for Following Natural Language Instructions☆18May 1, 2025Updated 10 months ago
- High Resolution Image Quality (HRIQ) database and model☆13May 22, 2024Updated last year
- ☆16Oct 21, 2024Updated last year
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 11 months ago
- This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language M…☆24Apr 27, 2025Updated 10 months ago
- Instant Photorealistic Style Transfer☆29Feb 10, 2026Updated 3 weeks ago
- Official Code for Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency☆25Jun 10, 2025Updated 8 months ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- ☆11Mar 11, 2024Updated last year
- ☆37Oct 21, 2022Updated 3 years ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)☆35Nov 12, 2024Updated last year
- Holistic evaluation of multimodal foundation models☆49Aug 11, 2024Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- Official repository for our ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology☆36Mar 22, 2021Updated 4 years ago
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆15Feb 10, 2026Updated 3 weeks ago
- [Advanced Photonics Research, 2021] Control tightly focused fields via manipulating pupil functions☆10Dec 25, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- Official repo for StableLLAVA☆95Dec 22, 2023Updated 2 years ago
- [Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors☆48Feb 20, 2025Updated last year
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆47Jun 3, 2025Updated 9 months ago
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- ☆12Jan 3, 2021Updated 5 years ago
- ☆11Jan 18, 2024Updated 2 years ago
- [COLM 2024] LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models☆14Jan 4, 2025Updated last year
- Prediction of glycopeptide fragment mass spectra by deep learning☆10Feb 20, 2024Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆11Jan 3, 2023Updated 3 years ago
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- What Would Portland Do? Generative agent experience☆13Mar 13, 2024Updated last year
- A simple JS script to register desired course when slots are available, for UM-SJTU JI students.☆12May 9, 2022Updated 3 years ago
- ☆14Dec 27, 2023Updated 2 years ago
- The code of 'The devil is in the labels: Semantic segmentation from sentences'.☆13Nov 13, 2022Updated 3 years ago
- Self-supervised MPFNet for realistic bokeh effect rendering(JVCIR2022)☆14Jul 5, 2022Updated 3 years ago
- This is the home of the source code for Motion Vector Extrapolation (MOVEX).☆13Jan 3, 2022Updated 4 years ago
- ☆11Feb 28, 2024Updated 2 years ago