yuunnn-w / RWKV_PytorchLinks
This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
☆130Updated last year
Alternatives and similar repositories for RWKV_Pytorch
Users that are interested in RWKV_Pytorch are comparing it to the libraries listed below
Sorting:
- ☆145Updated 3 weeks ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆233Updated 3 months ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆44Updated 3 weeks ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Updated last year
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆54Updated 3 weeks ago
- Evaluating LLMs with Dynamic Data☆91Updated last month
- This project is established for real-time training of the RWKV model.☆50Updated last year
- RAG SYSTEM FOR RWKV☆51Updated 9 months ago
- rwkv finetuning☆37Updated last year
- ☆17Updated 8 months ago
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆36Updated 7 months ago
- ☆22Updated 8 months ago
- RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework☆45Updated last month
- State tuning tunes the state☆35Updated 7 months ago
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆328Updated 6 months ago
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆57Updated last week
- Inference RWKV with multiple supported backends.☆59Updated this week
- RWKV in nanoGPT style☆193Updated last year
- ☆13Updated 8 months ago
- Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels☆108Updated 2 years ago
- This project is to extend RWKV LM's capabilities including sequence classification/embedding/peft/cross encoder/bi encoder/multi modaliti…☆10Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆21Updated 2 weeks ago
- continous batching and parallel acceleration for RWKV6☆24Updated last year
- RWKV, in easy to read code☆71Updated 5 months ago
- RWKV-7 mini☆11Updated 5 months ago
- ☆81Updated last year
- ☆10Updated 2 years ago
- ☆125Updated last year
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆44Updated this week
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Updated last year