RWKV / RWKV-cpp-nodeLinks
Node.js implementation binding for the RWKV.cpp module
☆21Updated 2 years ago
Alternatives and similar repositories for RWKV-cpp-node
Users that are interested in RWKV-cpp-node are comparing it to the libraries listed below
Sorting:
- Training a reward model for RLHF using RWKV.☆15Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆48Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- A converter and basic tester for rwkv onnx☆43Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆147Updated 2 years ago
- ☆13Updated 2 years ago
- Easily deploy your rwkv model☆19Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- Porting BabyAGI to Oobabooba.☆31Updated 2 years ago
- GPT-2 small trained on phi-like data☆68Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Updated 2 years ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆313Updated 2 years ago
- ☆32Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- A collection of prompts for Llama☆102Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated 2 years ago
- JAX implementations of RWKV☆19Updated 2 years ago
- Merge LLM that are split in to parts☆27Updated 6 months ago
- Train your own small bitnet model☆77Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Updated last year
- ☆13Updated 2 years ago
- ☆81Updated last year
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- Making offline AI models accessible to all types of edge devices.☆146Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆32Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated 2 years ago
- ☆40Updated 9 months ago
- Harnessing the Memory Power of the Camelids☆147Updated 2 years ago