rwkv finetuning
☆37Apr 22, 2024Updated 2 years ago
Alternatives and similar repositories for rwkv_finetuning
Users that are interested in rwkv_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Jun 5, 2024Updated last year
- 基于RWKV模型的角色扮演,实际上是个改的妈都不认识的 RWKV_Role_Playing☆17Aug 17, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆413Jul 11, 2023Updated 2 years ago
- ☆176Jan 13, 2026Updated 3 months ago
- Node.js implementation binding for the RWKV.cpp module☆21Aug 2, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14May 11, 2025Updated 11 months ago
- 一个基于Flask实现的RWKV_Role_Playing项目的API。☆32Jun 26, 2024Updated last year
- A 100% locally run AI web tool for generating WeChat replies using the RWKV runner☆10Oct 29, 2024Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆152Dec 14, 2025Updated 4 months ago
- share data, prompt data , pretraining data☆36Nov 30, 2023Updated 2 years ago
- A QQ Chatbot based on RWKV (W.I.P.)☆80Nov 16, 2023Updated 2 years ago
- ☆21Oct 29, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.☆610Feb 22, 2026Updated 2 months ago
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆34Apr 13, 2026Updated 3 weeks ago
- Implementation of the RWKV language model in pure WebGPU/Rust.☆348Apr 1, 2026Updated last month
- ☆12Dec 14, 2024Updated last year
- Tools for converting .mid files into text for training large language models☆103Dec 13, 2023Updated 2 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆46Sep 27, 2025Updated 7 months ago
- RWKV model implementation☆37Jul 15, 2023Updated 2 years ago
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 3 months ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- ☆10May 12, 2022Updated 3 years ago
- ☆10Aug 18, 2023Updated 2 years ago
- ☆12Jun 28, 2021Updated 4 years ago
- Easily deploy your rwkv model☆19May 5, 2023Updated 3 years ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆247Jan 13, 2026Updated 3 months ago
- pure go for rwkv☆18Dec 31, 2023Updated 2 years ago
- ☆18Sep 27, 2022Updated 3 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Apr 27, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.☆22Jun 7, 2025Updated 10 months ago
- ☆16Jul 12, 2024Updated last year
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Apr 2, 2023Updated 3 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- ☆41Mar 14, 2024Updated 2 years ago
- Contains the code for the paper "Multi-Horizon Short-Term Load Forecasting Using Hybrid of LSTM and Modified Split Convolution"☆11Oct 28, 2023Updated 2 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆49Oct 21, 2025Updated 6 months ago