RWKV model implementation
☆37Jul 15, 2023Updated 2 years ago
Alternatives and similar repositories for rwkv
Users that are interested in rwkv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementations of various linear RNN layers using pytorch and triton☆55Aug 4, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Evaluating LLMs with Dynamic Data☆114May 9, 2026Updated 2 weeks ago
- Recursive Bayesian Networks☆11May 11, 2025Updated last year
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆20Mar 21, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Feb 1, 2024Updated 2 years ago
- Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"☆11May 5, 2021Updated 5 years ago
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆28Apr 15, 2024Updated 2 years ago
- Fast modular code to create and train cutting edge LLMs☆67May 16, 2024Updated 2 years ago
- Lean formalization of aperiodic monotiles papers (staging repository for material not yet in mathlib)☆15Apr 18, 2026Updated last month
- ☆19Dec 4, 2025Updated 5 months ago
- X (weighted / probabilistic) Context-Free Grammars☆25Jan 30, 2024Updated 2 years ago
- Easy trees in LaTeX and TikZ☆14Dec 16, 2022Updated 3 years ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆69Apr 24, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- Author implementation of the paper "Span-based Semantic Parsing for Compositional Generalization"☆17Aug 29, 2021Updated 4 years ago
- Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for sing…☆16Aug 9, 2023Updated 2 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated 2 years ago
- Reading list for research topics in Diffusion models.☆18Jan 12, 2024Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16May 17, 2026Updated last week
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22May 5, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- PyTTI Documentation and Tutorials☆37Jul 7, 2023Updated 2 years ago
- rwkv finetuning☆37Apr 22, 2024Updated 2 years ago
- ☆52Jan 28, 2024Updated 2 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated last year
- This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.☆21Jun 7, 2025Updated 11 months ago
- 基于RWKV模型的角色扮演,实际上是个改的妈都不认识的 RWKV_Role_Playing☆17Aug 17, 2023Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated 2 years ago
- This project is established for real-time training of the RWKV model.☆49May 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Example formalization of Game Theoretic concepts in Lean☆28Feb 14, 2025Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆352Apr 1, 2026Updated last month
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated 2 years ago
- Python wrapper for the DPMMSubClusterStreaming.jl Julia package.☆14Sep 9, 2022Updated 3 years ago