Durham/RWKV-finetune-script

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Durham/RWKV-finetune-script)

Durham / RWKV-finetune-script

Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset

☆31

Alternatives and similar repositories for RWKV-finetune-script

Users that are interested in RWKV-finetune-script are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RafaRed / RWKV-api
View on GitHub
Flask server for RWKV
☆10Apr 3, 2023Updated 3 years ago
Blealtan / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆412Jul 11, 2023Updated 3 years ago
harrisonvanderbyl / rwkvstic
View on GitHub
Framework agnostic python runtime for RWKV models
☆147Aug 24, 2023Updated 2 years ago
Abel2076 / json2binidx_tool
View on GitHub
☆81May 15, 2024Updated 2 years ago
iantbutler01 / rwkv-raven-qlora-4bit-instruct
View on GitHub
A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library
☆27Jun 5, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
seung7361 / RWKV-Pytorch
View on GitHub
☆17Aug 1, 2023Updated 2 years ago
johanwind / wind_rwkv
View on GitHub
☆27Feb 26, 2026Updated 4 months ago
yuk-krhs / StickFigure
View on GitHub
☆11Feb 15, 2023Updated 3 years ago
leia-llm / leia
View on GitHub
LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
☆23Apr 24, 2024Updated 2 years ago
mrsteyk / RWKV-LM-deepspeed
View on GitHub
☆44Mar 29, 2023Updated 3 years ago
donomii / go-rwkv.cpp
View on GitHub
A go wrapper around the rwkv.cpp library
☆20Mar 4, 2024Updated 2 years ago
cryscan / eloise
View on GitHub
A QQ Chatbot based on RWKV (W.I.P.)
☆80Nov 16, 2023Updated 2 years ago
t4wefan / ChatRWKV-flask-api
View on GitHub
一个简单的，由ChatGPT主导编写的api，使用简单的请求访问ChatRWKV
☆15May 19, 2023Updated 3 years ago
harrisonvanderbyl / rwkv-cpp-accelerated
View on GitHub
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Jan 31, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
laksjdjf / pfg
View on GitHub
☆20Mar 28, 2023Updated 3 years ago
RWKV / rwkv.cpp
View on GitHub
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
☆1,580Mar 23, 2025Updated last year
gururise / rwkv_gradio
View on GitHub
Gradio UI for RWKV LLM
☆27Feb 21, 2023Updated 3 years ago
aria1th / sd-webui-deepcache-standalone
View on GitHub
☆32Jul 20, 2024Updated 2 years ago
mrsteyk / rwkvk-rs
View on GitHub
☆32Mar 30, 2023Updated 3 years ago
recursal / minmodmon
View on GitHub
Mini Model Daemon
☆13Nov 9, 2024Updated last year
BlinkDL / WorldModel
View on GitHub
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆40Apr 9, 2023Updated 3 years ago
RWKV / RWKV-wiki
View on GitHub
RWKV centralised docs for the community
☆35Jan 17, 2026Updated 6 months ago
m13253 / midi-track-merge
View on GitHub
Merge multi-track MIDI sequence into a single track for further processing
☆12Nov 4, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
RWKV / RWKV-infctx-trainer
View on GitHub
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆149Aug 13, 2024Updated last year
ssbuild / rwkv_finetuning
View on GitHub
rwkv finetuning
☆37Apr 22, 2024Updated 2 years ago
Jellyfish042 / RWKV-15Puzzle
View on GitHub
☆12Dec 14, 2024Updated last year
KanHatakeyama / JapaneseWarcParser
View on GitHub
☆16Mar 4, 2024Updated 2 years ago
jiamingkong / RWKV_chains
View on GitHub
Enhancing LangChain prompts to work better with RWKV models
☆34May 30, 2023Updated 3 years ago
Anwesh43 / StoryView
View on GitHub
A android library to create stories in your app
☆11Mar 25, 2017Updated 9 years ago
imxcstar / RWKVSharp
View on GitHub
call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)
☆36Feb 21, 2025Updated last year
shuttie / embed-benchmark
View on GitHub
☆16Nov 10, 2023Updated 2 years ago
Triang-jyed-driung / rwkv7mini
View on GitHub
RWKV-7 mini
☆12Mar 29, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shengxia / RWKV_Role_Playing_API
View on GitHub
一个基于Flask实现的RWKV_Role_Playing项目的API。
☆32Jun 26, 2024Updated 2 years ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
tassiogustavo / flutter_3d
View on GitHub
Projeto de aplicativo Web a fim de exibição e controle de animações 3D
☆15Nov 15, 2023Updated 2 years ago
SmerkyG / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆16Dec 9, 2025Updated 7 months ago
RWKV / RWKV-cpp-node
View on GitHub
Node.js implementation binding for the RWKV.cpp module
☆22Aug 2, 2023Updated 2 years ago
nutszebra / SENets
View on GitHub
Implementation of SENets by chainer (Squeeze-and-Excitation Networks: https://arxiv.org/abs/1709.01507)
☆15Sep 15, 2017Updated 8 years ago
StarRing2022 / MiniRWKV-4
View on GitHub
实现Blip2RWKV+QFormer的多模态图文对话大模型，使用Two-Step Cognitive Psychology Prompt方法，仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4，ImageBind等图文对话大语言模型，力求以更小的算力和资源实…
☆42Jul 17, 2023Updated 3 years ago