codefuse-ai / rodimusLinks

☆63

Alternatives and similar repositories for rodimus

Users that are interested in rodimus are comparing it to the libraries listed below

Sorting:

howard-hou / RWKV-X
RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…
☆39Updated last month
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆32Updated 10 months ago
recursal / RADLADS-paper
RADLADS training code
☆24Updated last month
AwesomeSeq / Comba-triton
☆18Updated last week
Infini-AI-Lab / gsm_infinite
☆47Updated 2 weeks ago
menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆38Updated 3 months ago
BlinkDL / LinearAttentionArena
Here we will test various linear attention designs.
☆59Updated last year
VITA-Group / WeLore
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…
☆47Updated 2 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated 11 months ago
kyleliang919 / Online-Subspace-Descent
This repo is based on https://github.com/jiaweizzhao/GaLore
☆28Updated 9 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated last month
Infini-AI-Lab / Multiverse
☆58Updated this week
RobertCsordas / moeut
☆79Updated 10 months ago
qiuzh20 / gated_attention
The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
☆44Updated last month
inclusionAI / Ring
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
☆59Updated last week
OpenMOSE / RWKV-Infer
A large-scale RWKV v6, v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to de…
☆38Updated 3 weeks ago
GSYfate / knnlm-limits
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆23Updated last month
hkust-nlp / PreSelect
[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
☆49Updated 3 months ago
OpenSparseLLMs / Linearization
☆51Updated 3 months ago
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆33Updated 3 months ago
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 4 months ago
RWKV-Vibe / RWKV-LM-V7
RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework
☆31Updated this week
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆76Updated last year
chengyou-jia / AgentStore
☆38Updated 6 months ago
uservan / speculative_thinking
☆20Updated 3 weeks ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆99Updated last month
OpenSparseLLMs / Linear-MoE
☆104Updated 3 weeks ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆32Updated 3 months ago
OpenMOSS / Lorsa
☆20Updated last week
dangxingyu / rnn-icrag
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Updated last year