Feng-Hong / WINO-DLLMLinks
☆36Updated 4 months ago
Alternatives and similar repositories for WINO-DLLM
Users that are interested in WINO-DLLM are comparing it to the libraries listed below
Sorting:
- A Collection of Papers on Diffusion Language Models☆149Updated 3 months ago
- Code release for VTW (AAAI 2025 Oral)☆65Updated last month
- ☆111Updated 3 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆77Updated 2 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆60Updated 8 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆58Updated this week
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆95Updated 3 weeks ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆285Updated last month
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆67Updated 3 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆76Updated 5 months ago
- Paper List of Inference/Test Time Scaling/Computing☆335Updated 4 months ago
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning☆93Updated 3 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆118Updated 6 months ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆96Updated 2 months ago
- A paper list of Awesome Latent Space.☆251Updated this week
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆73Updated 2 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆103Updated 3 weeks ago
- ☆55Updated last year
- ☆59Updated 5 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆148Updated 6 months ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆48Updated 9 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆102Updated 3 months ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆173Updated 2 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆228Updated 2 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆104Updated last year
- The official implementation of dLLM-Var☆27Updated last month
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆98Updated last year
- Official codebase for the paper Latent Visual Reasoning☆66Updated 2 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆127Updated 3 months ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆125Updated this week