nkypy / candle-rwkvLinks
RWKV models and examples powered by candle.
☆18Updated 3 months ago
Alternatives and similar repositories for candle-rwkv
Users that are interested in candle-rwkv are comparing it to the libraries listed below
Sorting:
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆26Updated last year
- ☆124Updated last week
- A converter and basic tester for rwkv onnx☆41Updated last year
- State tuning tunes the state☆33Updated 3 months ago
- This project is established for real-time training of the RWKV model.☆49Updated last year
- ☆32Updated 2 years ago
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆42Updated 2 weeks ago
- Training a reward model for RLHF using RWKV.☆14Updated 2 years ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆147Updated 9 months ago
- ☆34Updated last month
- Inference RWKV with multiple supported backends.☆48Updated this week
- The WorldRWKV project aims to implement training and inference across various modalities using the RWKV7 architecture. By leveraging diff…☆47Updated 2 weeks ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆72Updated 4 months ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆93Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated last month
- Implementation of the RWKV language model in pure WebGPU/Rust.☆307Updated 2 weeks ago
- RWKV-7 mini☆11Updated 2 months ago
- ☆13Updated 5 months ago
- RWKV-7: Surpassing GPT☆88Updated 6 months ago
- Fine-tuning RWKV-World model☆25Updated 2 years ago
- JAX implementations of RWKV☆19Updated last year
- Course Project for COMP4471 on RWKV☆17Updated last year
- RWKV centralised docs for the community☆26Updated 2 months ago
- ☆18Updated 5 months ago
- ☆140Updated 6 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆32Updated 9 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆19Updated 8 months ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆52Updated 2 weeks ago