ssbuild/rwkv_finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ssbuild/rwkv_finetuning)

ssbuild / rwkv_finetuning

rwkv finetuning

☆37

Alternatives and similar repositories for rwkv_finetuning

Users that are interested in rwkv_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neromous / RWKV-Ouroboros
View on GitHub
This project is established for real-time training of the RWKV model.
☆48May 17, 2024Updated 2 years ago
iantbutler01 / rwkv-raven-qlora-4bit-instruct
View on GitHub
A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library
☆27Jun 5, 2024Updated 2 years ago
00ffcc / chunkRWKV6
View on GitHub
continous batching and parallel acceleration for RWKV6
☆23Jun 28, 2024Updated 2 years ago
ms-KuroNeko / RWKV-Drama
View on GitHub
基于RWKV模型的角色扮演，实际上是个改的妈都不认识的 RWKV_Role_Playing
☆17Aug 17, 2023Updated 2 years ago
cryscan / web-rwkv-inspector
View on GitHub
☆12Dec 21, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Blealtan / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆412Jul 11, 2023Updated 3 years ago
LeoLin4258 / Infofusion
View on GitHub
A 100% locally run AI web tool for generating WeChat replies using the RWKV runner
☆10Oct 29, 2024Updated last year
3outeille / GPTQ-for-RWKV
View on GitHub
☆13Jun 3, 2023Updated 3 years ago
Joluck / RWKV-PEFT
View on GitHub
☆183Jan 13, 2026Updated 6 months ago
RWKV / RWKV-infctx-trainer
View on GitHub
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆148Aug 13, 2024Updated last year
RWKV / RWKV-cpp-node
View on GitHub
Node.js implementation binding for the RWKV.cpp module
☆22Aug 2, 2023Updated 2 years ago
RWKV / rwkv-onnx
View on GitHub
A converter and basic tester for rwkv onnx
☆44Jan 29, 2024Updated 2 years ago
PicoCreator / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆10Nov 3, 2023Updated 2 years ago
shengxia / RWKV_Role_Playing_API
View on GitHub
一个基于Flask实现的RWKV_Role_Playing项目的API。
☆32Jun 26, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
ssbuild / aigc_data
View on GitHub
share data， prompt data , pretraining data
☆36Nov 30, 2023Updated 2 years ago
AXKuhta / rwkv-onnx-dml
View on GitHub
Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…
☆21Mar 16, 2023Updated 3 years ago
deepglint / RWKV-CLIP
View on GitHub
[EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner
☆151Dec 14, 2025Updated 7 months ago
johanwind / wind_rwkv
View on GitHub
☆27Feb 26, 2026Updated 4 months ago
Ai00-X / ai00_server
View on GitHub
The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.
☆619Jun 9, 2026Updated last month
shoumenchougou / Awesome-RWKV-Prompts
View on GitHub
用户友好、开箱即用的 RWKV Prompts 示例，适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.
☆34Apr 13, 2026Updated 3 months ago
cryscan / web-rwkv
View on GitHub
Implementation of the RWKV language model in pure WebGPU/Rust.
☆357Jun 1, 2026Updated last month
briansemrau / MIDI-LLM-tokenizer
View on GitHub
Tools for converting .mid files into text for training large language models
☆101Dec 13, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Jellyfish042 / RWKV-15Puzzle
View on GitHub
☆12Dec 14, 2024Updated last year
codekansas / rwkv
View on GitHub
RWKV model implementation
☆37Jul 15, 2023Updated 3 years ago
MoZeWei / moTuner
View on GitHub
☆10May 12, 2022Updated 4 years ago
daquexian / faster-rwkv
View on GitHub
☆126Dec 15, 2023Updated 2 years ago
MetaLearners / Solution-to-CVPR2021-NAS-competition-Track-1
View on GitHub
☆12Jun 28, 2021Updated 5 years ago
howard-hou / VisualRWKV
View on GitHub
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
☆246Jan 13, 2026Updated 6 months ago
seasonjs / rwkv
View on GitHub
pure go for rwkv
☆18Dec 31, 2023Updated 2 years ago
iantbutler01 / ditty
View on GitHub
A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.
☆16Jun 10, 2026Updated last month
xdsopl / fft
View on GitHub
Mixed-Radix DIT FFT in C++11
☆13Aug 27, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lukasVierling / FaceRWKV
View on GitHub
Course Project for COMP4471 on RWKV
☆17Feb 11, 2024Updated 2 years ago
SynthiaDL / TrainChatGalRWKV
View on GitHub
☆42Mar 14, 2024Updated 2 years ago
harrisonvanderbyl / rwkvstic
View on GitHub
Framework agnostic python runtime for RWKV models
☆147Aug 24, 2023Updated 2 years ago
shengxia / RWKV_Role_Playing
View on GitHub
使用Gradio制作的基于RWKV的角色扮演的webui
☆249Mar 5, 2025Updated last year
zhiqicheng / DB-LSTM
View on GitHub
Code for paper "DB-LSTM: Densely-Connected Bi-directional LSTM for Human Action Recognition"
☆13Jul 1, 2022Updated 4 years ago
pjeena / Traffic-Flow-Prediction-in-the-city-of-Paris-using-LSTM
View on GitHub
This app forecasts the live traffic for the next 3 hours in the famous streets of Paris. Additionally, it also provides statistics for th…
☆13Jul 16, 2024Updated 2 years ago
hyperf / http-server
View on GitHub
☆10Jun 7, 2026Updated last month