codefuse-ai / rodimusLinks
☆177Updated 9 months ago
Alternatives and similar repositories for rodimus
Users that are interested in rodimus are comparing it to the libraries listed below
Sorting:
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Updated 3 weeks ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month
- RADLADS training code☆36Updated 9 months ago
- ☆71Updated last year
- [ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆78Updated 2 weeks ago
- A repository for research on medium sized language models.☆77Updated last year
- When Reasoning Meets Its Laws☆35Updated last month
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆191Updated 7 months ago
- ☆41Updated 9 months ago
- Universal Reasoning Model☆122Updated 3 weeks ago
- The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM…☆290Updated 6 months ago
- ☆25Updated 8 months ago
- Memory optimized Mixture of Experts☆73Updated 6 months ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆21Updated 11 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Updated last year
- Make reasoning models scalable☆46Updated 8 months ago
- Official Repository of Native Parallel Reasoner☆100Updated this week
- ☆100Updated 6 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆108Updated 8 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 9 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 5 months ago
- Esoteric Language Models☆111Updated this week
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Updated 2 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 4 months ago
- The code implementation of Symbolic-MoE☆46Updated 5 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆68Updated 3 weeks ago
- ☆91Updated last year
- The official github repo for "Diffusion Language Models are Super Data Learners".☆221Updated 3 months ago