OpenMOSS / LongLLaDALinks
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
☆51Updated 2 months ago
Alternatives and similar repositories for LongLLaDA
Users that are interested in LongLLaDA are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆119Updated 3 weeks ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆41Updated 7 months ago
- ☆17Updated 6 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆99Updated this week
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Updated 6 months ago
- ☆75Updated 7 months ago
- Unofficial Implementation of Selective Attention Transformer☆20Updated last year
- ☆110Updated 4 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆89Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Updated last year
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆55Updated 3 months ago
- ☆47Updated 4 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆57Updated last month
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆26Updated last month
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 9 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Updated 3 weeks ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆68Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- A Sober Look at Language Model Reasoning☆92Updated 2 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆57Updated last week
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated 5 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 9 months ago
- ☆50Updated 11 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆124Updated 10 months ago
- [ICLR 2026] Geometric-Mean Policy Optimization☆99Updated last week
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Updated last month
- ☆59Updated 3 weeks ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆104Updated last week
- Esoteric Language Models☆110Updated 2 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month