bcml-ai / rosa-plusLinks
ROSA+: RWKV's ROSA implementation with fallback statistical predictor
☆20Updated last month
Alternatives and similar repositories for rosa-plus
Users that are interested in rosa-plus are comparing it to the libraries listed below
Sorting:
- RWKV-7: Surpassing GPT☆101Updated last year
- ROSA-Tuning☆53Updated this week
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆45Updated last month
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆249Updated 10 months ago
- ☆64Updated 5 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 7 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆111Updated 7 months ago
- PyTorch implementation of models from the Zamba2 series.☆186Updated 10 months ago
- ☆39Updated 7 months ago
- ☆66Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 8 months ago
- [NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆155Updated last week
- Lego for GRPO☆30Updated 6 months ago
- Work in progress.☆75Updated last week
- ☆111Updated 2 weeks ago
- ☆13Updated last year
- Official implementation for Training LLMs with MXFP4☆110Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆202Updated last year
- 👷 Build compute kernels☆190Updated this week
- ☆158Updated 5 months ago
- ☆53Updated last year
- REAP: Router-weighted Expert Activation Pruning for SMoE compression☆129Updated 3 weeks ago
- 3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding☆80Updated 5 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Updated last year
- RADLADS training code☆34Updated 6 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year
- H-Net Dynamic Hierarchical Architecture☆80Updated 2 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆31Updated 2 months ago
- ☆136Updated last year