OpenMOSE/RWKV-LM-RLHF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSE/RWKV-LM-RLHF)

OpenMOSE / RWKV-LM-RLHF

Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the possibilities for deeper fine-tuning of RWKV.

☆64

Alternatives and similar repositories for RWKV-LM-RLHF

Users that are interested in RWKV-LM-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenMOSE / RWKV-Infer
View on GitHub
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆51Oct 21, 2025Updated 9 months ago
Joluck / RWKV-PEFT
View on GitHub
☆183Jan 13, 2026Updated 6 months ago
LeoLin4258 / rwkvcn-docs
View on GitHub
Official Chinese documentation for RWKV | RWKV官方中文文档
☆15Jun 10, 2026Updated last month
RWKV-Vibe / RWKV-LM-V7
View on GitHub
RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework
☆62May 13, 2026Updated 2 months ago
yynil / RWKVInside
View on GitHub
☆41Apr 30, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zyaaa-ux / ROSA-Tuning
View on GitHub
ROSA-Tuning
☆74Feb 4, 2026Updated 5 months ago
RWKV-Vibe / rwkv_lightning
View on GitHub
RWKV Batch infer backend ⚡Base on albatross https://github.com/BlinkDL/Albatross 🕊️
☆36Updated this week
SmerkyG / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆16Dec 9, 2025Updated 7 months ago
wjie98 / rosa_soft
View on GitHub
Softened ROSA QKV Operators for Training Next-Generation LLM Models
☆39Jun 26, 2026Updated 3 weeks ago
Joluck / mod-rwkv
View on GitHub
The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…
☆69Mar 18, 2026Updated 4 months ago
shoumenchougou / Awesome-RWKV-Prompts
View on GitHub
用户友好、开箱即用的 RWKV Prompts 示例，适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.
☆34Apr 13, 2026Updated 3 months ago
Jellyfish042 / RWKV_Othello
View on GitHub
A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…
☆44Jan 25, 2025Updated last year
Prunoideae / rwkv-contrib
View on GitHub
☆10Aug 18, 2023Updated 2 years ago
AIIRWKV / RWKV-RAG
View on GitHub
RAG SYSTEM FOR RWKV
☆53Dec 4, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ms-KuroNeko / RWKV-Drama
View on GitHub
基于RWKV模型的角色扮演，实际上是个改的妈都不认识的 RWKV_Role_Playing
☆17Aug 17, 2023Updated 2 years ago
RWKV / RWKV-infctx-trainer
View on GitHub
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆148Aug 13, 2024Updated last year
recursal / minmodmon
View on GitHub
Mini Model Daemon
☆13Nov 9, 2024Updated last year
johanwind / wind_rwkv
View on GitHub
☆27Feb 26, 2026Updated 4 months ago
AGENDD / RWKV-SpeechChat
View on GitHub
RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …
☆29Jan 1, 2025Updated last year
cahya-wirawan / rwkv-tokenizer
View on GitHub
A fast RWKV Tokenizer written in Rust
☆53Aug 12, 2025Updated 11 months ago
howard-hou / RWKV-X
View on GitHub
RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…
☆59Mar 31, 2026Updated 3 months ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
MollySophia / rwkv-qualcomm
View on GitHub
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆94Jun 8, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
recursal / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆46Jul 20, 2024Updated 2 years ago
RWKV / RWKV-wiki
View on GitHub
RWKV centralised docs for the community
☆35Jan 17, 2026Updated 6 months ago
yynil / RWKVTTS
View on GitHub
This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
☆101Oct 8, 2025Updated 9 months ago
Joluck / MiSS
View on GitHub
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…
☆35Mar 9, 2026Updated 4 months ago
ancorasir / DecisionRWKV
View on GitHub
☆20Aug 1, 2024Updated last year
Ourboros-Alignment-Team / RWKV-Development-Tools
View on GitHub
☆14May 11, 2025Updated last year
Ai00-X / ai00_server
View on GitHub
The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.
☆619Jun 9, 2026Updated last month
recursal / RADLADS-paper
View on GitHub
RADLADS training code
☆46May 7, 2025Updated last year
playaswd / rwkv-by-hand-excel
View on GitHub
This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.
☆21Jun 7, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
leonsama / web-rwkv-realweb
View on GitHub
☆12Feb 20, 2026Updated 5 months ago
Triang-jyed-driung / RWKV-World-Finetune
View on GitHub
Fine-tuning RWKV-World model
☆26Jun 6, 2023Updated 3 years ago
Abel2076 / json2binidx_tool
View on GitHub
☆81May 15, 2024Updated 2 years ago
Jellyfish042 / uncheatable_eval
View on GitHub
Evaluating LLMs with Dynamic Data
☆116May 9, 2026Updated 2 months ago
dymat / rwkv-burn
View on GitHub
A port of the RWKV v7 language model, implemented with the Burn deep learning framework
☆14Jun 9, 2025Updated last year
SmerkyG / gptcore
View on GitHub
Fast modular code to create and train cutting edge LLMs
☆67May 16, 2024Updated 2 years ago
Jellyfish042 / RWKV-StateTuning
View on GitHub
State tuning tunes the state
☆35Feb 12, 2025Updated last year