anvo25/vlms-are-biased

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anvo25/vlms-are-biased)

anvo25 / vlms-are-biased

Vision Language Models are Biased

☆114

Alternatives and similar repositories for vlms-are-biased

Users that are interested in vlms-are-biased are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DoubtedSteam / MM-GCoT
View on GitHub
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆22Jul 21, 2025Updated last year
VectorInstitute / pmc-data-extraction
View on GitHub
☆18Jul 13, 2026Updated last week
markendo / downscaling_intelligence
View on GitHub
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
☆25Mar 21, 2026Updated 4 months ago
ustc-hyin / HiMAP
View on GitHub
Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference
☆14Jun 7, 2025Updated last year
koaning / datasette-marimo
View on GitHub
Adding Marimo to Datasette
☆21Mar 24, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
allenai / aokvqa
View on GitHub
Official repository for the A-OKVQA dataset
☆117May 8, 2024Updated 2 years ago
alejandro-lozano-dev / open_clip_with_biomedica
View on GitHub
[CVPR 2025] Custom Open CLIP repo to train biomedical CLIP models
☆38Mar 23, 2025Updated last year
EIT-NLP / Layer_Select_Fuse_for_MLLM
View on GitHub
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆49Oct 29, 2025Updated 8 months ago
anitarau / SurgBenchKit
View on GitHub
Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"
☆21Jun 2, 2025Updated last year
allenai / signal-and-noise
View on GitHub
Measuring the Signal to Noise Ratio in Language Model Evaluation
☆31Aug 19, 2025Updated 11 months ago
yuecao0119 / MMFuser
View on GitHub
The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …
☆63Nov 5, 2024Updated last year
harpreetsahota204 / CRADIOv4
View on GitHub
Implementing C-RADIOv4 as a Remote Source Zoo Model for FiftyOne
☆17Feb 4, 2026Updated 5 months ago
som-shahlab / med-nota
View on GitHub
☆15Jun 11, 2025Updated last year
s-sahoo / scaling-dllms
View on GitHub
[ICML 2026] Scaling Beyond Masked Diffusion Language Models
☆31Jul 3, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WujiangXu / MemGym
View on GitHub
The code for paper "MemGym: a Long-Horizon Memory Environment for LLM Agents".
☆18Jun 2, 2026Updated last month
andy9705 / SumGD
View on GitHub
[NAACL 2025 Findings] Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided Decoding
☆15Feb 12, 2026Updated 5 months ago
jaehunjung1 / impossible-distillation
View on GitHub
☆18Jul 3, 2024Updated 2 years ago
FreedomIntelligence / TRIM
View on GitHub
We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…
☆22Jan 11, 2026Updated 6 months ago
anguyen8 / face-vit
View on GitHub
☆26Apr 24, 2024Updated 2 years ago
Raphoo / linear-mech-vlms
View on GitHub
Code for "Linear Mechanisms for Spatiotemporal Reasoning in Vision Language Models"
☆15Feb 16, 2026Updated 5 months ago
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated last week
gordonhu608 / MQT-LLaVA
View on GitHub
[NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models
☆126Jul 1, 2024Updated 2 years ago
Boyeep / Operating-System-2nd-Semester
View on GitHub
Operating Systems Semester 2 coursework covering Linux, shell scripting, process management, concurrency, and synchronization.
☆21Jun 11, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
KAIST-Visual-AI-Group / BezierFlow
View on GitHub
[ICLR 2026] Official code for BézierFlow: Learning Bézier Stochastic Interpolant Schedulers for Few-Step Generation
☆21Apr 13, 2026Updated 3 months ago
ZFancy / DivOE
View on GitHub
[NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"
☆11Oct 6, 2023Updated 2 years ago
jun297 / v1
View on GitHub
v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning
☆21Updated this week
PRIME-RL / RL-Compositionality
View on GitHub
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆68Jan 26, 2026Updated 5 months ago
GeorgeCazenavette / linear-gradient-matching
View on GitHub
☆53Mar 31, 2026Updated 3 months ago
mlfoundations / Gelato
View on GitHub
🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents
☆46Dec 22, 2025Updated 7 months ago
csuhan / Tar
View on GitHub
[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆202Sep 18, 2025Updated 10 months ago
eth-medical-ai-lab / smmile
View on GitHub
[NeurIPS Datasets & Benchmarks 2025] SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning
☆15Dec 2, 2025Updated 7 months ago
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RifleZhang / LLaVA-Reasoner-DPO
View on GitHub
☆116Jan 8, 2025Updated last year
tydpan / OpenPartSeg
View on GitHub
☆17May 26, 2023Updated 3 years ago
minwoosun / biomedica-etl
View on GitHub
[CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
☆107Mar 22, 2025Updated last year
VectorInstitute / mmlearn
View on GitHub
A toolkit for research on multimodal representation learning
☆20Updated this week
BNU-IVC / DroneGait
View on GitHub
Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)
☆10Feb 2, 2024Updated 2 years ago
zjwang21 / StrokeNet
View on GitHub
The official code for our EMNLP 2022 long paper [Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation…
☆27Sep 10, 2025Updated 10 months ago
huggingface / wikirace-llms
View on GitHub
☆27May 7, 2025Updated last year