rwkv finetuning
☆37Apr 22, 2024Updated last year
Alternatives and similar repositories for rwkv_finetuning
Users that are interested in rwkv_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Jun 5, 2024Updated last year
- ☆13Dec 21, 2024Updated last year
- ☆81May 15, 2024Updated last year
- Node.js implementation binding for the RWKV.cpp module☆21Aug 2, 2023Updated 2 years ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jun 3, 2023Updated 2 years ago
- A converter and basic tester for rwkv onnx☆43Jan 29, 2024Updated 2 years ago
- ☆13May 11, 2025Updated 10 months ago
- 一个基于Flask实现的RWKV_Role_Playing项目的API。☆31Jun 26, 2024Updated last year
- A 100% locally run AI web tool for generating WeChat replies using the RWKV runner☆10Oct 29, 2024Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated 2 years ago
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆153Dec 14, 2025Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆27Feb 26, 2026Updated last month
- A QQ Chatbot based on RWKV (W.I.P.)☆80Nov 16, 2023Updated 2 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.☆606Feb 22, 2026Updated last month
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 6 months ago
- Implementation of the RWKV language model in pure WebGPU/Rust.☆346Jan 10, 2026Updated 2 months ago
- Tools for converting .mid files into text for training large language models☆102Dec 13, 2023Updated 2 years ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated last year
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10May 12, 2022Updated 3 years ago
- ☆13Jun 28, 2021Updated 4 years ago
- VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.☆244Jan 13, 2026Updated 2 months ago
- 使用Android cpu 运行 RWKV V4 ONNX☆69Aug 1, 2023Updated 2 years ago
- Easily deploy your rwkv model☆19May 5, 2023Updated 2 years ago
- This project demonstrates the computation process of the RWKV (Receptance Weighted Key Value) model through Excel spreadsheets.☆20Jun 7, 2025Updated 9 months ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Mar 16, 2026Updated last week
- ☆16Jul 12, 2024Updated last year
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Apr 2, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆23May 25, 2023Updated 2 years ago
- ☆41Mar 14, 2024Updated 2 years ago
- Contains the code for the paper "Multi-Horizon Short-Term Load Forecasting Using Hybrid of LSTM and Modified Split Convolution"☆11Oct 28, 2023Updated 2 years ago
- 使用Gradio制作的基于RWKV的角色扮演的webui☆247Mar 5, 2025Updated last year
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- This app forecasts the live traffic for the next 3 hours in the famous streets of Paris. Additionally, it also provides statistics for th…☆13Jul 16, 2024Updated last year
- Code for paper "DB-LSTM: Densely-Connected Bi-directional LSTM for Human Action Recognition"☆13Jul 1, 2022Updated 3 years ago