This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
☆133Jul 20, 2024Updated last year
Alternatives and similar repositories for RWKV_Pytorch
Users that are interested in RWKV_Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python QQ robot backend based on the Shamrock framework, which is used to connect large language models RWKV to QQ.一个基于Shamrock框架的Pytho…☆23Mar 20, 2024Updated 2 years ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 7 months ago
- ☆13Dec 21, 2024Updated last year
- ☆178Jan 13, 2026Updated 2 months ago
- The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.☆606Feb 22, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13May 11, 2025Updated 10 months ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆35Jan 24, 2025Updated last year
- ☆17Aug 1, 2023Updated 2 years ago
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Feb 20, 2026Updated last month
- 更简单的微调,提供便捷脚本,微调说明☆37May 30, 2025Updated 9 months ago
- ☆23Dec 28, 2024Updated last year
- ☆20Aug 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated last month
- Implementation of the RWKV language model in pure WebGPU/Rust.☆346Jan 10, 2026Updated 2 months ago
- ☆13Feb 20, 2026Updated last month
- ☆27Feb 26, 2026Updated 3 weeks ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆244Jan 13, 2026Updated 2 months ago
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Jan 1, 2025Updated last year
- RAG SYSTEM FOR RWKV☆53Dec 4, 2024Updated last year
- Evaluating LLMs with Dynamic Data☆113Feb 11, 2026Updated last month
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- BlackGoose Rimer: RWKV as a Superior Architecture for Large-Scale Time Series Modeling☆32Jul 11, 2025Updated 8 months ago
- Inference RWKV with multiple supported backends.☆82Mar 11, 2026Updated 2 weeks ago
- State tuning tunes the state☆35Feb 12, 2025Updated last year
- This is a warehouse of object detection (yolov) datasets for Vtuber and Vup (i.e., virtual streamers), including 18 Vtuber/Vups. The data…☆12Apr 5, 2024Updated last year
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆66Mar 18, 2026Updated last week
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆56Updated this week
- This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.☆20Jun 7, 2025Updated 9 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆543Feb 18, 2025Updated last year
- Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.☆95Feb 1, 2026Updated last month
- ☆124Dec 15, 2023Updated 2 years ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year