Feng-Hong / WINO-DLLMLinks
☆36Updated 4 months ago
Alternatives and similar repositories for WINO-DLLM
Users that are interested in WINO-DLLM are comparing it to the libraries listed below
Sorting:
- A Collection of Papers on Diffusion Language Models☆151Updated 4 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Updated 2 months ago
- ☆112Updated 4 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆80Updated 2 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆63Updated 9 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆120Updated 7 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆292Updated 2 weeks ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆178Updated 2 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆236Updated 3 months ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆135Updated last week
- ☆55Updated last year
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆73Updated 3 months ago
- Paper List of Inference/Test Time Scaling/Computing☆339Updated 4 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆78Updated 6 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆66Updated 2 weeks ago
- A paper list of Awesome Latent Space.☆289Updated last week
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning☆95Updated 3 months ago
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆125Updated 2 weeks ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆69Updated last month
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆102Updated 3 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆68Updated 3 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆104Updated 2 weeks ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆100Updated last week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆261Updated last week
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…☆69Updated 6 months ago
- ☆307Updated last month
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆68Updated 5 months ago
- One-shot Entropy Minimization☆187Updated 7 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆103Updated last year
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆98Updated 3 months ago