RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆42Updated last year
Alternatives and similar repositories for rwkv-onnx:
Users that are interested in rwkv-onnx are comparing it to the libraries listed below
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- Easily deploy your rwkv model☆18Updated last year
- Inference RWKV with multiple supported backends.☆35Updated this week
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Course Project for COMP4471 on RWKV☆17Updated last year
- ☆124Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆59Updated this week
- This project is established for real-time training of the RWKV model.☆49Updated 10 months ago
- ☆110Updated last week
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆31Updated last year
- ☆82Updated 10 months ago
- Enhancing LangChain prompts to work better with RWKV models☆34Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated last year
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated 2 years ago
- A fast RWKV Tokenizer written in Rust☆43Updated this week
- ☆12Updated last year
- Framework agnostic python runtime for RWKV models☆145Updated last year
- ☆18Updated 2 months ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆310Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 2 weeks ago
- RWKV models and examples powered by candle.☆18Updated 3 weeks ago
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆26Updated 11 months ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆20Updated this week
- JAX implementations of RWKV☆19Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆36Updated last year
- 基于RWKV模型的角色扮演,实际上是个改的妈都不认识的 RWKV_Role_Playing☆16Updated last year
- Gradio UI for RWKV LLM☆29Updated 2 years ago
- RWKV (Receptance Weighted Key Value) is a RNN with Transformer-level performance☆39Updated 2 years ago