iantbutler01/rwkv-raven-qlora-4bit-instruct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iantbutler01/rwkv-raven-qlora-4bit-instruct)

iantbutler01 / rwkv-raven-qlora-4bit-instruct

A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library

☆27

Alternatives and similar repositories for rwkv-raven-qlora-4bit-instruct

Users that are interested in rwkv-raven-qlora-4bit-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iantbutler01 / ditty
View on GitHub
A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.
☆16Jun 10, 2026Updated last month
Triang-jyed-driung / RWKV-World-Finetune
View on GitHub
Fine-tuning RWKV-World model
☆26Jun 6, 2023Updated 3 years ago
3outeille / GPTQ-for-RWKV
View on GitHub
☆13Jun 3, 2023Updated 3 years ago
jiamingkong / rwkv_reward
View on GitHub
Training a reward model for RLHF using RWKV.
☆15Jun 5, 2023Updated 3 years ago
cryscan / web-rwkv-inspector
View on GitHub
☆12Dec 21, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Abel2076 / json2binidx_tool
View on GitHub
☆81May 15, 2024Updated 2 years ago
ms-KuroNeko / RWKV-Drama
View on GitHub
基于RWKV模型的角色扮演，实际上是个改的妈都不认识的 RWKV_Role_Playing
☆17Aug 17, 2023Updated 2 years ago
PicoCreator / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆10Nov 3, 2023Updated 2 years ago
StarRing2022 / MiniRWKV-4
View on GitHub
实现Blip2RWKV+QFormer的多模态图文对话大模型，使用Two-Step Cognitive Psychology Prompt方法，仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4，ImageBind等图文对话大语言模型，力求以更小的算力和资源实…
☆42Jul 17, 2023Updated 3 years ago
Blealtan / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆412Jul 11, 2023Updated 3 years ago
neromous / RWKV-Ouroboros
View on GitHub
This project is established for real-time training of the RWKV model.
☆48May 17, 2024Updated 2 years ago
cryscan / web-rwkv
View on GitHub
Implementation of the RWKV language model in pure WebGPU/Rust.
☆357Jun 1, 2026Updated last month
jiamingkong / RWKV_chains
View on GitHub
Enhancing LangChain prompts to work better with RWKV models
☆34May 30, 2023Updated 3 years ago
Lagrang / art-rs
View on GitHub
The Adaptive Radix Tree for Rust
☆17Jan 18, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
the-crypt-keeper / llm-webapps
View on GitHub
jQuery, React and Streamlit applications written by LLMs
☆16Dec 24, 2023Updated 2 years ago
LeoLin4258 / rwkvcn-docs
View on GitHub
Official Chinese documentation for RWKV | RWKV官方中文文档
☆15Jun 10, 2026Updated last month
mao-test-h / GPUDokaben
View on GitHub
【Unity】ComputeShader + GPUInstancingで大量のドカベンOPのロゴアニメーションを動かしてみた
☆11Mar 5, 2018Updated 8 years ago
Smidgens / unity-context-grapher
View on GitHub
Context menu plugin for Unity3D
☆13May 5, 2019Updated 7 years ago
OpenMOSE / RWKV5-LM-LoRA
View on GitHub
RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …
☆13Mar 24, 2024Updated 2 years ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
QuiNovas / sqs-extended-client
View on GitHub
AWS SQS extended client functionality from amazon-sqs-java-extended-client-lib
☆15Nov 17, 2023Updated 2 years ago
mohrezaei / thincollections
View on GitHub
Alternate implementations of vector/map/set for Rust
☆15Apr 27, 2023Updated 3 years ago
patrickt / tactics
View on GitHub
Deep-embedded combinators for strategic rewriting.
☆15Nov 24, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jeffvella / NativeFasterDictionary
View on GitHub
An experiment to write a faster/native version of SveltoECS's FasterDictionary
☆17Apr 17, 2019Updated 7 years ago
RWKV / RWKV-cpp-node
View on GitHub
Node.js implementation binding for the RWKV.cpp module
☆22Aug 2, 2023Updated 2 years ago
jhabc1314 / jackdou-chinamap
View on GitHub
laravel 中国地图web Api集合
☆13Apr 27, 2023Updated 3 years ago
lijiaqi0612 / UIE-ACL-310
View on GitHub
有一个通用实体关系事件抽取的任务，需要使用到UIE模框架，而且需要将起部署到昇腾310服务器上，因为UIE模型底层使用的是ernie3.0，但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署，所以才有了以下的操作，主要过程是，先试用paddle训练处模型…
☆21Aug 1, 2022Updated 3 years ago
BismuthCloud / faas
View on GitHub
Container orchestrator and platform services
☆12Aug 13, 2024Updated last year
rmihaylov / mpttune
View on GitHub
Tune MPTs
☆84Jun 17, 2023Updated 3 years ago
sp5wwp / OpenACELP
View on GitHub
Free ACELP vocoder
☆16Sep 20, 2024Updated last year
iantbutler01 / dart
View on GitHub
Disk backed concurrent ART implementation, with optional generations.
☆14Nov 8, 2023Updated 2 years ago
orangetwo / BERT-FLAT
View on GitHub
bert-flat 简化版添加了很多注释
☆15Nov 25, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TyounanMOTI / UnityWindowCapture
View on GitHub
Window capture plugin for Unity
☆14Nov 18, 2016Updated 9 years ago
andremussche / SemanticMergeDelphi
View on GitHub
Pascal parser for PlasticSCM SemanticMerge tool
☆15Sep 30, 2013Updated 12 years ago
ArEnSc / Production-RWKV
View on GitHub
This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…
☆64May 14, 2023Updated 3 years ago
sarkahn / unityecstetristest
View on GitHub
Trying to make Tetris with Unity's ECS system
☆14May 19, 2019Updated 7 years ago
czp3009 / crc32-crack
View on GitHub
crc32 cracker for JVM
☆12Jul 8, 2021Updated 5 years ago
NoiRC256 / unity-mmd-utils
View on GitHub
(Legacy) Utility and stagework scripts for MMD-style rendering workflow in Unity.
☆13May 2, 2022Updated 4 years ago
Clydingus / Paraphrase-OPT
View on GitHub
Observe the slow deterioration of my mental sanity in the github commit history
☆12May 31, 2023Updated 3 years ago