RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
☆13Mar 24, 2024Updated 2 years ago
Alternatives and similar repositories for RWKV5-LM-LoRA
Users that are interested in RWKV5-LM-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 23, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Feb 20, 2026Updated last month
- ROSA-Tuning☆71Feb 4, 2026Updated last month
- A program that allows you to chat on VRChat using ChatGPT.☆15Mar 22, 2023Updated 3 years ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A list of the top 3 million+ English words in Project Gutenberg, along with their frequency.☆19Oct 26, 2020Updated 5 years ago
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 2 months ago
- High resolution image classifier. An expansion of the ResNet50 architecture to allow for high resolution inputs (448, 896, 1792 sq.px.)☆16Mar 27, 2023Updated 2 years ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆81May 15, 2024Updated last year
- Cute layout visualization☆33Jan 18, 2026Updated 2 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- ☆13Dec 21, 2024Updated last year
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆15Nov 26, 2023Updated 2 years ago
- ☆179Jan 13, 2026Updated 2 months ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆27Feb 21, 2026Updated last month
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆66Mar 18, 2026Updated last week
- ☆68Mar 21, 2025Updated last year
- ☆18Sep 29, 2024Updated last year
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 5 months ago
- Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals☆18Dec 23, 2024Updated last year
- ☆46Updated this week
- The code of "Learning Crisp Boundaries Using Deep Refinement Network and Adaptive Weighting Loss"☆12Feb 1, 2021Updated 5 years ago
- A reproduction of Eulerian Video Magnification for Revealing Subtle Changes in the World☆13Jan 23, 2022Updated 4 years ago
- Kakao Mobility MCP Server for directions and transit information☆10Sep 14, 2025Updated 6 months ago
- ☆17Jan 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Feb 20, 2026Updated last month
- ☆14Apr 10, 2024Updated last year
- A web app to experiment with chained prompts faster.☆16Mar 15, 2023Updated 3 years ago
- A minimal, educational implementation of a agent memory system inspired by mem0☆26Jul 16, 2025Updated 8 months ago
- ☆21Jul 30, 2021Updated 4 years ago
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- Ultra-high-resolution CO2 thermophysical property calculation program☆15Mar 26, 2024Updated 2 years ago