RWKV / RWKV-cpp-node
Node.js implementation binding for the RWKV.cpp module
☆20Updated last year
Related projects: ⓘ
- BlinkDL's RWKV-v4 running in the browser☆46Updated last year
- ☆13Updated last year
- JavaScript bindings for the ggml-js library☆39Updated 9 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆42Updated last month
- Train your own small bitnet model☆47Updated 3 months ago
- Training a reward model for RLHF using RWKV.☆14Updated last year
- Easily deploy your rwkv model☆18Updated last year
- 📖 — Notebooks related to RWKV☆59Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated 3 months ago
- Enhancing LangChain prompts to work better with RWKV models☆34Updated last year
- ☆26Updated last year
- GPT-2 small trained on phi-like data☆65Updated 7 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆68Updated last year
- A converter and basic tester for rwkv onnx☆40Updated 7 months ago
- ☆44Updated 8 months ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆30Updated last year
- Framework agnostic python runtime for RWKV models☆144Updated last year
- Making offline AI models accessible to all types of edge devices.☆119Updated 7 months ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- ☆53Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- ☆80Updated 4 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- A small standalone flask python server for llama.cpp that acts like a KoboldAI api.☆14Updated last year
- Spotlight-like client for Ollama on Windows.☆24Updated 4 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- ☆48Updated this week
- Inference Llama 2 in one file of pure JavaScript(HTML)☆28Updated 2 months ago
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Updated 3 months ago