This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
☆133Jul 20, 2024Updated last year
Alternatives and similar repositories for RWKV_Pytorch
Users that are interested in RWKV_Pytorch are comparing it to the libraries listed below
Sorting:
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 6 months ago
- ☆13Dec 21, 2024Updated last year
- A Python QQ robot backend based on the Shamrock framework, which is used to connect large language models RWKV to QQ.一个基于Shamrock框架的Pytho…☆23Mar 20, 2024Updated last year
- The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.☆603Feb 22, 2026Updated last week
- ☆13May 11, 2025Updated 9 months ago
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆35Jan 24, 2025Updated last year
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- 更简单的微调,提供便捷脚本,微调说明☆37May 30, 2025Updated 9 months ago
- ☆23Dec 28, 2024Updated last year
- ☆20Aug 1, 2024Updated last year
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆47Oct 21, 2025Updated 4 months ago
- RWKV centralised docs for the community☆31Jan 17, 2026Updated last month
- ☆17Jan 1, 2025Updated last year
- ☆17Aug 1, 2023Updated 2 years ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆244Jan 13, 2026Updated last month
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Feb 20, 2026Updated last week
- ☆27Feb 26, 2026Updated last week
- ☆13Feb 20, 2026Updated last week
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated 2 weeks ago
- Implementation of the RWKV language model in pure WebGPU/Rust.☆342Jan 10, 2026Updated last month
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- ☆24Sep 25, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆14Sep 18, 2025Updated 5 months ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- Evaluating LLMs with Dynamic Data☆111Feb 11, 2026Updated 3 weeks ago
- Inference RWKV with multiple supported backends.☆80Updated this week
- RAG SYSTEM FOR RWKV☆52Dec 4, 2024Updated last year
- BlackGoose Rimer: RWKV as a Superior Architecture for Large-Scale Time Series Modeling☆32Jul 11, 2025Updated 7 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- ☆125Dec 15, 2023Updated 2 years ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆312Jan 31, 2024Updated 2 years ago
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- RWKV, in easy to read code☆72Mar 25, 2025Updated 11 months ago
- RWKV-TS: Beyond Traditional Recurrent Neural Network for Time Series Tasks☆122Aug 16, 2024Updated last year
- ☆34Jul 21, 2024Updated last year