This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
☆133Jul 20, 2024Updated last year
Alternatives and similar repositories for RWKV_Pytorch
Users that are interested in RWKV_Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Apr 2, 2026Updated last month
- ☆12Dec 21, 2024Updated last year
- ☆176Jan 13, 2026Updated 3 months ago
- The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.☆610Feb 22, 2026Updated 2 months ago
- ☆14May 11, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆34Apr 13, 2026Updated 3 weeks ago
- ☆17Aug 1, 2023Updated 2 years ago
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Apr 16, 2026Updated 2 weeks ago
- 更简单的微调,提供便捷脚本,微调说明☆35May 30, 2025Updated 11 months ago
- ☆22Dec 28, 2024Updated last year
- ☆20Aug 1, 2024Updated last year
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆49Oct 21, 2025Updated 6 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Updated this week
- Implementation of the RWKV language model in pure WebGPU/Rust.☆348Apr 1, 2026Updated last month
- ☆26Feb 26, 2026Updated 2 months ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆247Jan 13, 2026Updated 3 months ago
- Fast modular code to create and train cutting edge LLMs☆67May 16, 2024Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- ☆17Jan 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RAG SYSTEM FOR RWKV☆53Dec 4, 2024Updated last year
- Evaluating LLMs with Dynamic Data☆113Apr 20, 2026Updated 2 weeks ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- BlackGoose Rimer: RWKV as a Superior Architecture for Large-Scale Time Series Modeling☆33Jul 11, 2025Updated 9 months ago
- State tuning tunes the state☆35Feb 12, 2025Updated last year
- This is a warehouse of object detection (yolov) datasets for Vtuber and Vup (i.e., virtual streamers), including 18 Vtuber/Vups. The data…☆12Apr 5, 2024Updated 2 years ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated 2 years ago
- Solving puzzles with RWKV locally in your browser.☆12Mar 31, 2026Updated last month
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆67Mar 18, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆57Mar 31, 2026Updated last month
- This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.☆22Jun 7, 2025Updated 10 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆549Feb 18, 2025Updated last year
- Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.☆103Updated this week
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- A fast RWKV Tokenizer written in Rust☆54Aug 12, 2025Updated 8 months ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆312Jan 31, 2024Updated 2 years ago