codekansas/rwkv

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/codekansas/rwkv)

codekansas / rwkv

RWKV model implementation

☆37

Alternatives and similar repositories for rwkv

Users that are interested in rwkv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sustcsonglin / flash-linear-rnn
View on GitHub
Implementations of various linear RNN layers using pytorch and triton
☆55Aug 4, 2023Updated 2 years ago
PicoCreator / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆10Nov 3, 2023Updated 2 years ago
Jellyfish042 / uncheatable_eval
View on GitHub
Evaluating LLMs with Dynamic Data
☆117May 9, 2026Updated 2 months ago
bdusell / nondeterministic-stack-rnn
View on GitHub
Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)
☆20Mar 21, 2024Updated 2 years ago
rycolab / prefix-parsing
View on GitHub
☆14Feb 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AndPotap / einsum-search
View on GitHub
☆34Oct 4, 2024Updated last year
deep-spin / understanding-spigot
View on GitHub
Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"
☆11May 5, 2021Updated 5 years ago
wetneb / sheetshow
View on GitHub
A renderer for sheet diagrams in bimonoidal categories
☆13Aug 28, 2021Updated 4 years ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
davidweichiang / tikz-qtree
View on GitHub
Easy trees in LaTeX and TikZ
☆15Dec 16, 2022Updated 3 years ago
jsm28 / AperiodicMonotilesLean
View on GitHub
Lean formalization of aperiodic monotiles papers (staging repository for material not yet in mathlib)
☆15Jul 7, 2026Updated 2 weeks ago
OpenNLPLab / HGRN
View on GitHub
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆68Apr 24, 2024Updated 2 years ago
Jellyfish042 / RWKV-15Puzzle
View on GitHub
☆12Dec 14, 2024Updated last year
Prunoideae / web-rwkv-axum
View on GitHub
A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.
☆30Apr 15, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tinkoff-ai / exact
View on GitHub
The original PyTorch implementation of the "EXACT: How Train Your Accuracy"
☆10Sep 22, 2022Updated 3 years ago
stanojevic / Fast-MST-Algorithm
View on GitHub
Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for sing…
☆16Aug 9, 2023Updated 2 years ago
japerry911 / crypto-data-pipeline
View on GitHub
Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.
☆10Jan 23, 2023Updated 3 years ago
ucasligang / awesome-Diffusion
View on GitHub
Reading list for research topics in Diffusion models.
☆18Jan 12, 2024Updated 2 years ago
Runjing-Liu120 / RaoBlackwellizedSGD
View on GitHub
A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions
☆22May 5, 2019Updated 7 years ago
iantbutler01 / ditty
View on GitHub
A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.
☆16Jun 10, 2026Updated last month
RWKV / RWKV-cpp-node
View on GitHub
Node.js implementation binding for the RWKV.cpp module
☆22Aug 2, 2023Updated 2 years ago
ssbuild / rwkv_finetuning
View on GitHub
rwkv finetuning
☆37Apr 22, 2024Updated 2 years ago
hirokatsukataoka16 / cvpaper.challenge-summary
View on GitHub
コンピュータビジョン研究コミュニティcvpaper.challengeのサマリ。サーベイ資料や研究成果など。
☆12Jan 20, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
seasonjs / rwkv
View on GitHub
pure go for rwkv
☆18Dec 31, 2023Updated 2 years ago
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
shtechair / CRFAE-Dep-Parser
View on GitHub
☆19Sep 29, 2019Updated 6 years ago
xlhex / dpe
View on GitHub
☆22Oct 26, 2020Updated 5 years ago
ZeldaHuang / rwkv-cpp-server
View on GitHub
Easily deploy your rwkv model
☆19May 5, 2023Updated 3 years ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
McGill-NLP / length-generalization
View on GitHub
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆139Apr 30, 2024Updated 2 years ago
lukasVierling / FaceRWKV
View on GitHub
Course Project for COMP4471 on RWKV
☆17Feb 11, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
metaphorz / MP-v-diffusion-images512x512
View on GitHub
v objective diffusion inference code for PyTorch.
☆15Feb 17, 2022Updated 4 years ago
AXKuhta / rwkv-onnx-dml
View on GitHub
Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…
☆21Mar 16, 2023Updated 3 years ago
BGU-CS-VIL / dpmmpythonStreaming
View on GitHub
Python wrapper for the DPMMSubClusterStreaming.jl Julia package.
☆14Sep 9, 2022Updated 3 years ago
wozeparrot / tinyrwkv
View on GitHub
tinygrad port of the RWKV large language model.
☆44Mar 9, 2025Updated last year
acosharma / elita-transformer
View on GitHub
Official Repository for Efficient Linear-Time Attention Transformers.
☆18Jun 2, 2024Updated 2 years ago
neulab / neural-lpcfg
View on GitHub
The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)
☆33Sep 22, 2025Updated 10 months ago
FilippoC / diffdp
View on GitHub
Differentiable Perturb-and-Parse operator
☆25Mar 7, 2019Updated 7 years ago