DreamLM / DreamOnLinks

Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas

☆41

Alternatives and similar repositories for DreamOn

Users that are interested in DreamOn are comparing it to the libraries listed below

Sorting:

OpenSparseLLMs / MoM
☆96Updated 3 months ago
hkust-nlp / Laser
Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆52Updated 2 months ago
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆38Updated 2 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆64Updated 3 weeks ago
bigai-nlco / LatentSeek
Official Repository of LatentSeek
☆56Updated 2 months ago
RUCBM / DeepCritic
Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"
☆32Updated last month
TingchenFu / MathIF
instruction-following benchmark for large reasoning models
☆36Updated 2 months ago
zjunlp / LightThinker
LightThinker: Thinking Step-by-Step Compression
☆68Updated 3 months ago
kiaia / GIRAFFE
Extending context length of visual language models
☆12Updated 7 months ago
multimodal-art-projection / LatentCoT-Horizon
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆176Updated this week
UCSB-NLP-Chang / ThinkPrune
☆39Updated 3 months ago
TIGER-AI-Lab / General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains
☆159Updated 2 months ago
RLHFlow / RAFT
This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…
☆33Updated 10 months ago
MileBench / MileBench
This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"
☆36Updated last year
OpenBMB / RLPR
Extrapolating RLVR to General Domains without Verifiers
☆134Updated last week
NUS-TRAIL / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆84Updated 2 months ago
ShadeCloak / ADORA
☆46Updated 4 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆173Updated 5 months ago
ruixin31 / Spurious_Rewards
☆323Updated last week
SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆72Updated 8 months ago
bigai-ai / ICE
【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…
☆44Updated 4 months ago
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆120Updated last month
test-time-interaction / TTI
☆53Updated 2 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆134Updated 2 months ago
sail-sg / AnytimeReasoner
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆43Updated 3 weeks ago
ssmisya / PRMBench
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆78Updated 5 months ago
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆212Updated 6 months ago
thu-ml / Noise-Contrastive-Alignment
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆55Updated 9 months ago
OpenSparseLLMs / LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
☆86Updated 8 months ago
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆96Updated last month