Feng-Hong / WINO-DLLMLinks
☆37Updated 5 months ago
Alternatives and similar repositories for WINO-DLLM
Users that are interested in WINO-DLLM are comparing it to the libraries listed below
Sorting:
- A Collection of Papers on Diffusion Language Models☆155Updated 4 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Updated 3 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆84Updated 3 months ago
- ☆113Updated 5 months ago
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Updated 4 months ago
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆137Updated 3 weeks ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆84Updated 7 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆67Updated 10 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆298Updated last week
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning☆101Updated 4 months ago
- Official codebase for the paper Latent Visual Reasoning☆109Updated 3 months ago
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆73Updated last week
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆70Updated 4 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆104Updated 4 months ago
- [NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models☆70Updated 4 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆237Updated last month
- Doodling our way to AGI ✏️ 🖼️ 🧠☆121Updated 8 months ago
- [ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆143Updated 2 weeks ago
- ☆56Updated last year
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆104Updated last year
- [NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆86Updated 4 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆104Updated last month
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆73Updated last month
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆251Updated 3 months ago
- ☆318Updated last month
- Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆58Updated last month
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆50Updated 11 months ago
- Paper List of Inference/Test Time Scaling/Computing☆344Updated 5 months ago
- A paper list of Awesome Latent Space.☆333Updated this week
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Updated 4 months ago