nkypy / candle-rwkv
RWKV models and examples powered by candle.
☆18Updated last month
Alternatives and similar repositories for candle-rwkv:
Users that are interested in candle-rwkv are comparing it to the libraries listed below
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆26Updated 11 months ago
- ☆116Updated 3 weeks ago
- ☆32Updated 2 years ago
- State tuning tunes the state☆31Updated last month
- Implementation of the RWKV language model in pure WebGPU/Rust.☆297Updated this week
- RWKV centralised docs for the community☆22Updated last week
- A converter and basic tester for rwkv onnx☆42Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Updated 7 months ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆93Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated 2 months ago
- This project is established for real-time training of the RWKV model.☆49Updated 10 months ago
- ☆19Updated 5 months ago
- Training a reward model for RLHF using RWKV.☆14Updated last year
- A Fish Speech implementation in Rust, with Candle.rs☆75Updated last month
- ☆40Updated last week
- RWKV in nanoGPT style☆189Updated 9 months ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆22Updated last week
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆37Updated last week
- RWKV, in easy to read code☆71Updated last week
- Fine-tuning RWKV-World model☆25Updated last year
- ☆32Updated last week
- Implementing the BitNet model in Rust☆31Updated 11 months ago
- A large-scale RWKV v6, v7(World, ARWKV, PRWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy o…☆33Updated last week
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Updated 2 years ago
- Sample Python extension using Rust/PyO3/tch to interact with PyTorch☆34Updated last year
- Fast modular code to create and train cutting edge LLMs☆66Updated 10 months ago
- Mini Model Daemon☆11Updated 4 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆70Updated 2 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ☆18Updated 3 months ago