Auraithm / LLADA_pretrainingLinks
☆31Updated 4 months ago
Alternatives and similar repositories for LLADA_pretraining
Users that are interested in LLADA_pretraining are comparing it to the libraries listed below
Sorting:
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆148Updated 6 months ago
- A Collection of Papers on Diffusion Language Models☆149Updated 3 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆69Updated 3 weeks ago
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆384Updated 3 weeks ago
- ☆126Updated last week
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆74Updated 7 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆234Updated 3 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 10 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 6 months ago
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆628Updated 3 weeks ago
- ☆55Updated 7 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆393Updated 3 weeks ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆261Updated last week
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆496Updated 2 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆76Updated 7 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Updated 2 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆333Updated 2 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆64Updated 10 months ago
- Paper List of Inference/Test Time Scaling/Computing☆339Updated 4 months ago
- Easy and Efficient dLLM Fine-Tuning☆194Updated 3 weeks ago
- A collection of papers on discrete diffusion models☆167Updated 6 months ago
- ☆304Updated 3 weeks ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆135Updated this week
- Official Repository of LatentSeek☆73Updated 7 months ago
- 🌐 Permanent Hosting Site: http://ai-paper-finder.info/ 🌐 Hugging Face Hosting: https://huggingface.co/spaces/wenhanacademia/ai-paper-f…☆259Updated last week
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆382Updated 2 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆131Updated 10 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆84Updated 6 months ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆85Updated 2 weeks ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆191Updated 10 months ago