ZichenWen1 / DIJALinks
(ICLR 2026 π₯) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
β73Updated last week
Alternatives and similar repositories for DIJA
Users that are interested in DIJA are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rβ¦β110Updated 7 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Modelsβ74Updated 8 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigationβ134Updated 4 months ago
- β56Updated last year
- Official Repository of LatentSeekβ76Updated 8 months ago
- β72Updated 6 months ago
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cuesβ44Updated 8 months ago
- Code release for VTW (AAAI 2025 Oral)β64Updated 3 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steeringβ103Updated last year
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.β84Updated 3 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository aggβ¦β183Updated 3 months ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shiβ¦β69Updated last year
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attribβ¦β33Updated 6 months ago
- Doodling our way to AGI βοΈ πΌοΈ π§β121Updated 8 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)β88Updated 4 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ104Updated 4 months ago
- [ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"β143Updated 2 weeks ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.β89Updated 11 months ago
- A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).β41Updated 8 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]β180Updated 8 months ago
- β63Updated 6 months ago
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoningβ96Updated 4 months ago
- A Collection of Papers on Diffusion Language Modelsβ155Updated 4 months ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Modelsβ48Updated 7 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"β113Updated this week
- Towards Efficient Multimodal Large Language Models: A Survey on Token Compressionβ98Updated 3 weeks ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGIβ250Updated 3 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation inβ¦β171Updated 4 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".β84Updated 7 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)β67Updated 9 months ago