TorchRWKV / flash-linear-attentionLinks

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

☆29

Alternatives and similar repositories for flash-linear-attention

Users that are interested in flash-linear-attention are comparing it to the libraries listed below

Sorting:

TorchRWKV / rwkv-kit
☆22Updated 5 months ago
OpenMOSE / RWKV-Infer
A large-scale RWKV v6, v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to de…
☆35Updated this week
howard-hou / RWKV-X
RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…
☆37Updated last month
JL-er / WorldRWKV
The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…
☆47Updated 2 weeks ago
yynil / RWKVInside
☆34Updated last month
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated last month
BlinkDL / LinearAttentionArena
Here we will test various linear attention designs.
☆58Updated last year
yynil / RWKVinLLAMA
☆18Updated 5 months ago
LeC-Z / RWKV-nonogram
A 20M RWKV v6 can do nonogram
☆14Updated 7 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated 10 months ago
JL-er / RWKV-PEFT
☆124Updated this week
00ffcc / chunkRWKV6
continous batching and parallel acceleration for RWKV6
☆24Updated 11 months ago
cryscan / web-rwkv-inspector
☆13Updated 5 months ago
shoumenchougou / Awesome-RWKV-Prompts
用户友好、开箱即用的 RWKV Prompts 示例，适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.
☆35Updated 4 months ago
Alic-Li / BlackGoose_Rimer
BlackGoose Rimer: RWKV as a Superior Architecture for Large-Scale Time Series Modeling
☆24Updated last month
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆32Updated 9 months ago
Triang-jyed-driung / rwkv7mini
RWKV-7 mini
☆11Updated 2 months ago
yuunnn-w / RWKV_Pytorch
This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…
☆128Updated 10 months ago
OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…
☆42Updated 2 weeks ago
SmerkyG / RWKV_Explained
RWKV, in easy to read code
☆72Updated 2 months ago
zhiyuan1i / TorchRWKV
RWKV6 in native pytorch and triton:)
☆11Updated 10 months ago
OpenMOSE / RWKV5-LM-LoRA
RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …
☆13Updated last year
Jellyfish042 / RWKV-StateTuning
State tuning tunes the state
☆33Updated 3 months ago
JL-er / MiSS
☆14Updated last week
Yuan-ManX / Titans-PyTorch
PyTorch implementation of Titans.
☆23Updated 4 months ago
BBuf / RWKV-World-HF-Tokenizer
☆34Updated 10 months ago
RWKV / RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…
☆44Updated 2 months ago
fla-org / flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
☆48Updated 4 months ago
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆54Updated 9 months ago
nikhilvyas / SOAP_MUON
Combining SOAP and MUON
☆16Updated 3 months ago