Triang-jyed-driung / RWKV-LM-RLHF-DPOView external linksLinks
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated last year
Alternatives and similar repositories for RWKV-LM-RLHF-DPO
Users that are interested in RWKV-LM-RLHF-DPO are comparing it to the libraries listed below
Sorting:
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆13Dec 21, 2024Updated last year
- ☆13Jun 3, 2023Updated 2 years ago
- 一个基于Flask实现的RWKV_Role_Playing项目的API。☆31Jun 26, 2024Updated last year
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- ☆171Jan 13, 2026Updated last month
- ☆17Jan 1, 2025Updated last year
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆62Sep 19, 2025Updated 4 months ago
- RWKV models and examples powered by candle.☆24Jan 19, 2026Updated 3 weeks ago
- ☆27Jul 28, 2025Updated 6 months ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- A fast RWKV Tokenizer written in Rust☆54Aug 12, 2025Updated 6 months ago
- Fine-tuning RWKV-World model☆26Jun 6, 2023Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- RWKV centralised docs for the community☆31Jan 17, 2026Updated last month
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- ☆34Jul 21, 2024Updated last year
- ☆41Apr 30, 2025Updated 9 months ago
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆154Dec 14, 2025Updated 2 months ago
- rwkv finetuning☆37Apr 22, 2024Updated last year
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 5 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 5, 2026Updated last week
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 7 years ago
- Network Etiquette (Netiquette) -- Written with 2020 technology in mind☆10Nov 19, 2021Updated 4 years ago
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13May 19, 2023Updated 2 years ago
- ☆12Nov 6, 2023Updated 2 years ago
- readthedocs.org documentation for Inkplate boards☆10Aug 25, 2025Updated 5 months ago
- Tools for converting .mid files into text for training large language models☆100Dec 13, 2023Updated 2 years ago
- Evaluating LLMs with Dynamic Data☆111Feb 11, 2026Updated last week
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- A Python library for graph coloring☆12Nov 12, 2025Updated 3 months ago
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆66Dec 15, 2025Updated 2 months ago
- Ebben a repo-ban az ELTE IK-n oktatott analízissel kapcsolatos jegyzetek forrásfájljai találhatók meg.☆16Oct 27, 2017Updated 8 years ago
- The Word Embedding Database API☆11Aug 20, 2019Updated 6 years ago
- Minimal Kréta client written in Python.☆11Oct 7, 2023Updated 2 years ago
- An interactive story app for Android . . .☆15Dec 14, 2014Updated 11 years ago
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- ☆12May 23, 2024Updated last year