AXKuhta / rwkv-onnx-dmlLinks

Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this time because of .onnx 2GB file size limitation

☆21

Alternatives and similar repositories for rwkv-onnx-dml

Users that are interested in rwkv-onnx-dml are comparing it to the libraries listed below

Sorting:

RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆42Updated last year
josephrocca / rwkv-v4-web
BlinkDL's RWKV-v4 running in the browser
☆46Updated 2 years ago
ArEnSc / Production-RWKV
This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…
☆64Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
PicoCreator / RWKV-LM-LoRA
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆10Updated last year
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆44Updated 7 months ago
yynil / RWKVInside
☆38Updated 5 months ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆40Updated 2 years ago
mrsteyk / RWKV-LM-deepspeed
☆42Updated 2 years ago
Abel2076 / json2binidx_tool
☆81Updated last year
OpenMOSE / RWKV-Infer
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆45Updated last week
BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆39Updated 2 years ago
tensorpro / tpu_rwkv
JAX implementations of RWKV
☆19Updated 2 years ago
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆54Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
zarakiquemparte / zaraki-tools
☆26Updated 2 years ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
lachlansneff / sparsellama
☆40Updated 2 years ago
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last month
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆99Updated 2 years ago
leszekhanusz / diffusion-ui-backend
Backend for the diffusion-ui frontend
☆24Updated last year
cahya-wirawan / rwkv-tokenizer
A fast RWKV Tokenizer written in Rust
☆53Updated 2 months ago
ConiferLabsWA / flan-ul2-alpaca
☆33Updated 2 years ago
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆21Updated last year