RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
☆13Mar 24, 2024Updated 2 years ago
Alternatives and similar repositories for RWKV5-LM-LoRA
Users that are interested in RWKV5-LM-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 23, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15May 20, 2026Updated last week
- A program that allows you to chat on VRChat using ChatGPT.☆15Mar 22, 2023Updated 3 years ago
- ROSA-Tuning☆73Feb 4, 2026Updated 3 months ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A list of the top 3 million+ English words in Project Gutenberg, along with their frequency.☆22Oct 26, 2020Updated 5 years ago
- RWKV centralised docs for the community☆33Jan 17, 2026Updated 4 months ago
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- ☆81May 15, 2024Updated 2 years ago
- High resolution image classifier. An expansion of the ResNet50 architecture to allow for high resolution inputs (448, 896, 1792 sq.px.)☆16Mar 27, 2023Updated 3 years ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- ☆12Dec 21, 2024Updated last year
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Cute layout visualization☆39Jan 18, 2026Updated 4 months ago
- ☆14Nov 26, 2023Updated 2 years ago
- ☆179Jan 13, 2026Updated 4 months ago
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆67Mar 18, 2026Updated 2 months ago
- ☆19Sep 29, 2024Updated last year
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 7 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- ☆69Mar 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆37Feb 21, 2026Updated 3 months ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 7 months ago
- Explainable vision transformer for automatic visual sleep staging on multimodal PSG signals☆20Dec 23, 2024Updated last year
- The code of "Learning Crisp Boundaries Using Deep Refinement Network and Adaptive Weighting Loss"☆12Feb 1, 2021Updated 5 years ago
- A reproduction of Eulerian Video Magnification for Revealing Subtle Changes in the World☆13Jan 23, 2022Updated 4 years ago
- Kakao Mobility MCP Server for directions and transit information☆11Sep 14, 2025Updated 8 months ago
- ☆47Mar 30, 2026Updated last month
- ☆17Jan 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Apr 10, 2024Updated 2 years ago
- ☆12Feb 20, 2026Updated 3 months ago
- A web app to experiment with chained prompts faster.☆16Mar 15, 2023Updated 3 years ago
- A minimal, educational implementation of a agent memory system inspired by mem0☆25Jul 16, 2025Updated 10 months ago
- AIペルソナが生活し、記憶を蓄積し、自律的に行動するためのアプリケーション。 An application for an AI persona to live, accumulate memories, and act autonomously.☆45May 12, 2026Updated 2 weeks ago
- ☆22Jul 30, 2021Updated 4 years ago
- Ultra-high-resolution CO2 thermophysical property calculation program☆15Mar 26, 2024Updated 2 years ago