DreamLM / Dream-CoderLinks
☆22Updated this week
Alternatives and similar repositories for Dream-Coder
Users that are interested in Dream-Coder are comparing it to the libraries listed below
Sorting:
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated 3 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆42Updated this week
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆32Updated 3 weeks ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆40Updated last year
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆99Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆77Updated 5 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆85Updated 3 weeks ago
- A Collection of Papers on Diffusion Language Models☆91Updated 2 weeks ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆62Updated last week
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆64Updated last month
- Official Repository of LatentSeek☆54Updated last month
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆35Updated last month
- ☆15Updated 3 months ago
- ☆37Updated 3 months ago
- ☆91Updated 3 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆78Updated last month
- ☆15Updated 7 months ago
- ☆46Updated 3 months ago
- [arXiv] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆34Updated 2 months ago
- Extending context length of visual language models☆11Updated 7 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆91Updated 2 months ago
- ☆48Updated last month
- A repo for open research on building large reasoning models☆71Updated this week
- Code for Heima☆50Updated 3 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆112Updated 3 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆72Updated 2 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆211Updated 6 months ago
- A collection of papers on discrete diffusion models☆152Updated 3 weeks ago
- ☆320Updated last month
- Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆46Updated last month