Auraithm / LLADA_pretrainingLinks
☆24Updated 2 weeks ago
Alternatives and similar repositories for LLADA_pretraining
Users that are interested in LLADA_pretraining are comparing it to the libraries listed below
Sorting:
- A Collection of Papers on Diffusion Language Models☆119Updated this week
- OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Rea…☆95Updated 2 months ago
- DDN: A novel generative model with simple principles and unique properties. (ICLR 2025)☆45Updated 2 weeks ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆121Updated last week
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆108Updated 2 months ago
- ☆44Updated last month
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆59Updated 3 months ago
- [Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆392Updated last week
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆52Updated last month
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆106Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆314Updated last month
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆193Updated last week
- ☆218Updated 3 weeks ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆34Updated 2 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆25Updated last month
- A collection of papers on discrete diffusion models☆158Updated 2 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆222Updated 9 months ago
- ☆59Updated last month
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models"☆45Updated last month
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆53Updated 3 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆287Updated 2 months ago
- Modified LLaVA framework for MOSS2, and makes MOSS2 a multimodal model.☆13Updated 11 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆136Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆153Updated 2 weeks ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆43Updated 3 months ago
- ☆57Updated 3 months ago
- Paper List of Inference/Test Time Scaling/Computing☆297Updated this week
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆143Updated 3 weeks ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆84Updated 6 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆180Updated last year