Auraithm / LLADA_pretrainingLinks
☆26Updated last month
Alternatives and similar repositories for LLADA_pretraining
Users that are interested in LLADA_pretraining are comparing it to the libraries listed below
Sorting:
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆247Updated last week
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆263Updated last week
- A Collection of Papers on Diffusion Language Models☆126Updated last week
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆145Updated 2 weeks ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆111Updated 2 months ago
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆99Updated this week
- ☆227Updated this week
- DDN: A novel generative model with simple principles and unique properties. (ICLR 2025)☆49Updated last month
- TraceRL - Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆187Updated last week
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆329Updated 2 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆35Updated 2 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆312Updated 2 months ago
- [ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"☆16Updated 7 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆62Updated 3 months ago
- A collection of papers on discrete diffusion models☆160Updated 2 months ago
- [Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆454Updated last week
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 8 months ago
- [ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer☆289Updated 8 months ago
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆106Updated 2 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models"☆48Updated last month
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆59Updated 2 weeks ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆46Updated 3 months ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆63Updated 8 months ago
- Paper List of Inference/Test Time Scaling/Computing☆307Updated 3 weeks ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆98Updated 3 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 7 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆178Updated 6 months ago
- Official PyTorch implementation of EMOVA in CVPR 2025 (https://arxiv.org/abs/2409.18042)☆68Updated 6 months ago
- ☆44Updated 3 months ago
- ☆59Updated 2 months ago