harrisonvanderbyl / rwkvsticLinks

Framework agnostic python runtime for RWKV models

☆146

Alternatives and similar repositories for rwkvstic

Users that are interested in rwkvstic are comparing it to the libraries listed below

Sorting:

ArEnSc / Production-RWKV
This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…
☆64Updated 2 years ago
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆100Updated 2 years ago
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Updated last year
harrisonvanderbyl / rwkv_chatbot
rwkv_chatbot
☆61Updated 2 years ago
resloved / RWKV-notebooks
📖 — Notebooks related to RWKV
☆58Updated 2 years ago
aspctu / alpaca-lora
Instruct-tuning LLaMA on consumer hardware
☆65Updated 2 years ago
mrsteyk / RWKV-LM-deepspeed
☆42Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆146Updated last year
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆308Updated last year
BlinkDL / RWKV-v2-RNN-Pile
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
☆66Updated 3 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
zphang / minimal-gpt-neox-20b
☆131Updated 3 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆411Updated 2 years ago
BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆39Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆51Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
lachlansneff / sparsellama
☆40Updated 2 years ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
0cc4m / GPTQ-for-LLaMa
4 bits quantization of LLMs using GPTQ
☆49Updated 2 years ago
thomasantony / llamacpp-python
Python bindings for llama.cpp
☆198Updated 2 years ago
Abel2076 / json2binidx_tool
☆81Updated last year
skeskinen / llama-lite
Embeddings focused small version of Llama NLP model
☆106Updated 2 years ago
RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆42Updated last year
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆44Updated 8 months ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆40Updated 2 years ago
Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago