Jellyfish042 / RWKV-StateTuningLinks

State tuning tunes the state

☆35

Alternatives and similar repositories for RWKV-StateTuning

Users that are interested in RWKV-StateTuning are comparing it to the libraries listed below

Sorting:

Joluck / RWKV-PEFT
☆150Updated this week
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
OpenMOSE / RWKV-Infer
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆45Updated last week
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆98Updated 11 months ago
RWKV-Vibe / RWKV-LM-V7
RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework
☆46Updated 3 months ago
SmerkyG / gptcore
Fast modular code to create and train cutting edge LLMs
☆68Updated last year
yynil / RWKVInside
☆38Updated 5 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 5 months ago
Joluck / WorldRWKV
The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…
☆58Updated last week
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆193Updated last year
OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…
☆54Updated last month
Jellyfish042 / uncheatable_eval
Evaluating LLMs with Dynamic Data
☆96Updated 3 months ago
Triang-jyed-driung / rwkv7mini
RWKV-7 mini
☆11Updated 6 months ago
SmerkyG / RWKV_Explained
RWKV, in easy to read code
☆72Updated 7 months ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
howard-hou / RWKV-X
RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…
☆50Updated 3 months ago
Jellyfish042 / RWKV_Othello
A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…
☆42Updated 9 months ago
proger / hippogriff
Griffin MQA + Hawk Linear RNN Hybrid
☆89Updated last year
Jellyfish042 / LongMamba
Some preliminary explorations of Mamba's context scaling.
☆14Updated 10 months ago
Abel2076 / json2binidx_tool
☆81Updated last year
yynil / RWKVinLLAMA
☆17Updated 9 months ago
BBuf / RWKV-World-HF-Tokenizer
☆34Updated last year
Jellyfish042 / Sudoku-RWKV
☆147Updated 11 months ago
recursal / RADLADS-paper
RADLADS training code
☆29Updated 5 months ago
Joluck / MiSS
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…
☆25Updated last month
yynil / RWKV_LM_EXT
This project is to extend RWKV LM's capabilities including sequence classification/embedding/peft/cross encoder/bi encoder/multi modaliti…
☆10Updated last year
BlinkDL / LinearAttentionArena
Here we will test various linear attention designs.
☆61Updated last year
johanwind / wind_rwkv
☆26Updated 3 months ago
howard-hou / VisualRWKV
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
☆237Updated 4 months ago
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆33Updated last year