yuunnn-w / RWKV_PytorchLinks
This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation is overly complex and lacks extensibility. Let's join the flexible PyTorch ecosystem and open-source it together!
☆132Updated last year
Alternatives and similar repositories for RWKV_Pytorch
Users that are interested in RWKV_Pytorch are comparing it to the libraries listed below
Sorting:
- ☆157Updated 2 weeks ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆237Updated 6 months ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆45Updated 3 months ago
- Evaluating LLMs with Dynamic Data☆99Updated 4 months ago
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆56Updated 2 months ago
- rwkv finetuning☆37Updated last year
- ☆13Updated 11 months ago
- RAG SYSTEM FOR RWKV☆51Updated last year
- This project is established for real-time training of the RWKV model.☆50Updated last year
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆35Updated 10 months ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆147Updated last year
- ☆17Updated 11 months ago
- Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models☆338Updated 9 months ago
- RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework☆48Updated last month
- State tuning tunes the state☆35Updated 9 months ago
- continous batching and parallel acceleration for RWKV6☆22Updated last year
- Low-bit optimizers for PyTorch☆133Updated 2 years ago
- Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels☆112Updated 2 years ago
- RWKV in nanoGPT style☆196Updated last year
- A quantization algorithm for LLM☆146Updated last year
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆60Updated last month
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- ☆62Updated last year
- ☆81Updated last year
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- ☆23Updated 11 months ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆45Updated last month
- ☆79Updated last year
- Official implementation of TransNormerLLM: A Faster and Better LLM☆248Updated last year
- ☆125Updated last year