Triang-jyed-driung / RWKV-LM-RLHF-DPOLinks

Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.

☆11

Alternatives and similar repositories for RWKV-LM-RLHF-DPO

Users that are interested in RWKV-LM-RLHF-DPO are comparing it to the libraries listed below

Sorting:

Joluck / RWKV-PEFT
☆153Updated 3 weeks ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
Jellyfish042 / uncheatable_eval
Evaluating LLMs with Dynamic Data
☆97Updated 3 months ago
Abel2076 / json2binidx_tool
☆81Updated last year
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…
☆55Updated 2 months ago
BBuf / RWKV-World-HF-Tokenizer
☆34Updated last year
yuunnn-w / RWKV_Pytorch
This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…
☆130Updated last year
shoumenchougou / Awesome-RWKV-Prompts
用户友好、开箱即用的 RWKV Prompts 示例，适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.
☆36Updated 9 months ago
cryscan / web-rwkv-inspector
☆13Updated 11 months ago
yynil / RWKVinLLAMA
☆17Updated 10 months ago
ssbuild / rwkv_finetuning
rwkv finetuning
☆37Updated last year
RWKV / RWKV-wiki
RWKV centralised docs for the community
☆29Updated 3 months ago
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆195Updated last year
RWKV-Vibe / rwkv-fla
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
☆45Updated 3 months ago
AGENDD / RWKV-ASR
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …
☆53Updated 11 months ago
Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago
OpenMOSE / RWKV-Infer
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆45Updated last month
ms-KuroNeko / RWKV-Drama
基于RWKV模型的角色扮演，实际上是个改的妈都不认识的 RWKV_Role_Playing
☆17Updated 2 years ago
SmerkyG / gptcore
Fast modular code to create and train cutting edge LLMs
☆68Updated last year
RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆43Updated last year
Prunoideae / rwkv-contrib
☆10Updated 2 years ago
Jellyfish042 / RWKV-StateTuning
State tuning tunes the state
☆35Updated 9 months ago
Triang-jyed-driung / rwkv7mini
RWKV-7 mini
☆11Updated 7 months ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated 2 years ago
RyokoAI / BigKnow2022
BigKnow2022: Bringing Language Models Up to Speed
☆16Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆147Updated 2 years ago
cahya-wirawan / rwkv-tokenizer
A fast RWKV Tokenizer written in Rust
☆54Updated 3 months ago
gabrielolympie / moe-pruner
A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
☆78Updated 2 months ago
yynil / RWKVInside
☆39Updated 6 months ago