uniartisan / TorchRWKV

RWKV6 in native pytorch and triton:)

☆11

Alternatives and similar repositories for TorchRWKV:

Users that are interested in TorchRWKV are comparing it to the libraries listed below

TorchRWKV / flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
☆15Updated this week
OpenMOSE / RWKV-Infer
A large-scale RWKV v6, v7 inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy on docker. Supports tr…
☆25Updated last week
Jellyfish042 / RWKV-StateTuning
State tuning tunes the state
☆29Updated 10 months ago
glassroom / heinsen_attention
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆24Updated 7 months ago
Doraemonzzz / hgru2-pytorch
☆24Updated 4 months ago
AlirezaMorsali / MLP-Attention
☆14Updated last month
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆52Updated 5 months ago
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆31Updated 5 months ago
kazuki-irie / kv-memory-brain
Official Code Repository for the paper "Key-value memory in the brain"
☆20Updated last week
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 3 months ago
BBuf / flash-rwkv
☆31Updated 8 months ago
BlinkDL / LinearAttentionArena
Here we will test various linear attention designs.
☆58Updated 9 months ago
RyokoAI / BigKnow2022
BigKnow2022: Bringing Language Models Up to Speed
☆14Updated last year
TorchRWKV / rwkv-kit
☆17Updated last month
annosubmission / GRC-Cache
☆16Updated last year
SprocketLab / sparse_matrix_fine_tuning
Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"
☆17Updated 3 weeks ago
habanero-lab / APPy
APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…
☆23Updated last month
smonsays / hypernetwork-attention
Official code for the paper "Attention as a Hypernetwork"
☆23Updated 7 months ago
BBuf / RWKV-World-HF-Tokenizer
☆33Updated 6 months ago
SmerkyG / RWKV_Explained
RWKV, in easy to read code
☆62Updated 2 months ago
NX-AI / mlstm_kernels
A library for fast and efficient mLSTM Kernels.
☆29Updated last month
johanwind / wind_rwkv
☆13Updated last month
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
tencent-ailab / TriNet
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
☆26Updated last year
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆36Updated last year
Doraemonzzz / Awesome-Triton-Resources
Awesome Triton Resources
☆19Updated last month
fla-org / flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
☆38Updated 2 weeks ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆76Updated 8 months ago
pabloiyu / mini-language-model
Implementing Mamba SSM into a mini language model and training it on the open domain works of Sherlock Holmes. Also, implementation of pa…
☆9Updated 10 months ago