Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Apr 2, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-finetune-script
Users that are interested in RWKV-finetune-script are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fine-tuning RWKV-World model☆26Jun 6, 2023Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Flask server for RWKV☆10Apr 3, 2023Updated 2 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆413Jul 11, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Jun 5, 2024Updated last year
- ☆81May 15, 2024Updated last year
- ☆17Aug 1, 2023Updated 2 years ago
- A converter and basic tester for rwkv onnx☆43Jan 29, 2024Updated 2 years ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆94Sep 2, 2023Updated 2 years ago
- ☆27Feb 26, 2026Updated 3 weeks ago
- Easily deploy your rwkv model☆19May 5, 2023Updated 2 years ago
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- ☆44Mar 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A QQ Chatbot based on RWKV (W.I.P.)☆80Nov 16, 2023Updated 2 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- Audio Speech Segmentation Tool for RVC☆15May 15, 2023Updated 2 years ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆312Jan 31, 2024Updated 2 years ago
- ☆20Mar 28, 2023Updated 2 years ago
- Go framework for language model-powered applications with composability and chaining. Inspired by LangChain.☆13May 2, 2023Updated 2 years ago
- Penrose tiling generator☆17Jan 5, 2022Updated 4 years ago
- JAX implementations of RWKV☆19Sep 26, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 4-bit quantization of models using GPTQ☆18Mar 6, 2023Updated 3 years ago
- ☆32Jul 20, 2024Updated last year
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- a.k.a autoMBW-V2☆28Mar 26, 2024Updated last year
- ☆32Mar 30, 2023Updated 2 years ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Gradio UI for RWKV LLM☆27Feb 21, 2023Updated 3 years ago
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- Merge multi-track MIDI sequence into a single track for further processing☆12Nov 4, 2020Updated 5 years ago
- ☆12Dec 14, 2024Updated last year
- rwkv finetuning☆37Apr 22, 2024Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 3 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago