Blealtan / RWKV-LM-LoRALinks

RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

☆412

Alternatives and similar repositories for RWKV-LM-LoRA

Users that are interested in RWKV-LM-LoRA are comparing it to the libraries listed below

Sorting:

Abel2076 / json2binidx_tool
☆81Updated last year
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Updated last year
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆99Updated 2 years ago
resloved / RWKV-notebooks
📖 — Notebooks related to RWKV
☆58Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
clcarwin / alpaca-weight
Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
☆52Updated 2 years ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
zphang / minimal-llama
☆457Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated 2 years ago
cryscan / eloise
A QQ Chatbot based on RWKV (W.I.P.)
☆79Updated last year
RWKV / rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
☆1,550Updated 7 months ago
thomasantony / llamacpp-python
Python bindings for llama.cpp
☆198Updated 2 years ago
chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
Joluck / RWKV-PEFT
☆150Updated last week
oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆130Updated 2 years ago
sambanova / bloomchat
This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter mu…
☆586Updated 2 years ago
Ai00-X / ai00_server
The all-in-one RWKV runtime box with embed, RAG, AI agents, and more.
☆587Updated last week
modular-ml / wrapyfi-examples_llama
Inference code for facebook LLaMA models with Wrapyfi support
☆129Updated 2 years ago
galatolofederico / vanilla-llama
Plain pytorch implementation of LLaMA
☆188Updated 2 years ago
lxe / cerebras-lora-alpaca
LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt
☆63Updated 2 years ago
jiamingkong / RWKV_chains
Enhancing LangChain prompts to work better with RWKV models
☆34Updated 2 years ago
soulteary / llama-docker-playground
Quick Start LLaMA models with multiple methods, and fine-tune 7B/65B with One-Click.
☆351Updated 2 years ago
Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago
tloen / llama-int8
Quantized inference code for LLaMA models
☆1,046Updated 2 years ago
chrisociepa / allamo
Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models
☆182Updated last month
iantbutler01 / rwkv-raven-qlora-4bit-instruct
A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library
☆28Updated last year
shengxia / RWKV_Role_Playing
使用Gradio制作的基于RWKV的角色扮演的webui
☆244Updated 7 months ago
KohakuBlueleaf / guanaco-lora
Instruct-tune LLaMA on consumer hardware
☆72Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago