yynil / RWKV_LM_EXTLinks

This project is to extend RWKV LM's capabilities including sequence classification/embedding/peft/cross encoder/bi encoder/multi modalities, etc.

☆10

Alternatives and similar repositories for RWKV_LM_EXT

Users that are interested in RWKV_LM_EXT are comparing it to the libraries listed below

Sorting:

OpenMOSE / RWKV-Infer
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆45Updated 3 weeks ago
yynil / RWKVInside
☆39Updated 6 months ago
Joluck / RWKV-PEFT
☆153Updated 2 weeks ago
OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…
☆55Updated last month
RWKV-Vibe / rwkv-fla
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
☆45Updated 2 months ago
Joluck / WorldRWKV
The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…
☆60Updated 3 weeks ago
RWKV-Vibe / RWKV-LM-V7
RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework
☆46Updated 3 weeks ago
howard-hou / RWKV-X
RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…
☆51Updated 4 months ago
Jellyfish042 / RWKV_Othello
A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…
☆42Updated 9 months ago
Joluck / MiSS
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…
☆25Updated 3 weeks ago
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆100Updated last year
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 6 months ago
yuunnn-w / RWKV_Pytorch
This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…
☆130Updated last year
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆33Updated last year
Triang-jyed-driung / rwkv7mini
RWKV-7 mini
☆11Updated 7 months ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆146Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
recursal / RADLADS-paper
RADLADS training code
☆34Updated 6 months ago
cahya-wirawan / rwkv-tokenizer
A fast RWKV Tokenizer written in Rust
☆54Updated 3 months ago
RWKV-Vibe / rwkv-kit
☆22Updated 10 months ago
SmerkyG / RWKV_Explained
RWKV, in easy to read code
☆72Updated 7 months ago
Jellyfish042 / RWKV-StateTuning
State tuning tunes the state
☆35Updated 9 months ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
chu-tianxiang / QuIP-for-all
QuIP quantization
☆60Updated last year
tridao / flash-attention-wheels
☆57Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
kyleliang919 / Super_Muon
☆65Updated 7 months ago
gabrielolympie / moe-pruner
A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
☆77Updated 2 months ago
SprocketLab / sparse_matrix_fine_tuning
Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"
☆21Updated last month