Auraithm / LLADA_pretrainingLinks
☆30Updated 2 months ago
Alternatives and similar repositories for LLADA_pretraining
Users that are interested in LLADA_pretraining are comparing it to the libraries listed below
Sorting:
- dLLM: Simple Diffusion Language Modeling☆189Updated this week
- A Collection of Papers on Diffusion Language Models☆137Updated last month
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆412Updated last week
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆124Updated 4 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆356Updated 3 weeks ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆364Updated last month
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆107Updated last month
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆296Updated 2 weeks ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆57Updated 5 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆185Updated last month
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 4 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆188Updated last week
- [Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆490Updated 2 weeks ago
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆107Updated 4 months ago
- ☆179Updated 5 months ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆107Updated this week
- ☆262Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆264Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 8 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆193Updated 3 weeks ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆37Updated this week
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆66Updated 5 months ago
- A collection of papers on discrete diffusion models☆166Updated 4 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆27Updated 3 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆229Updated 11 months ago
- ☆51Updated 5 months ago
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆62Updated 5 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆343Updated 4 months ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆16Updated 8 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models"☆59Updated 3 months ago