PicoCreator / RWKV-LM-LoRALinks

RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

☆10

Alternatives and similar repositories for RWKV-LM-LoRA

Users that are interested in RWKV-LM-LoRA are comparing it to the libraries listed below

Sorting:

Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago
Aratako / Task-Vector-Merge-Optimzier
☆14Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
zarakiquemparte / zaraki-tools
☆26Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
RWKV / RWKV-wiki
RWKV centralised docs for the community
☆29Updated 2 months ago
emrgnt-cmplxty / SmolTrainer
☆19Updated 2 years ago
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆26Updated 3 months ago
CoffeeVampir3 / ez-trainer
Train Llama Loras Easily
☆30Updated 2 years ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆40Updated 2 years ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆68Updated last year
OpenMOSE / RWKV-Infer
A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…
☆45Updated last week
kyegomez / FastFF
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆15Updated 11 months ago
Glavin001 / Data2AITextbook
🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)
☆25Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
mzbac / qlora-inference-multi-gpu
☆13Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆76Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
yynil / RWKVInside
☆39Updated 5 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated 11 months ago
AXKuhta / rwkv-onnx-dml
Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…
☆21Updated 2 years ago
resloved / RWKV-notebooks
📖 — Notebooks related to RWKV
☆58Updated 2 years ago
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
lachlansneff / sparsellama
☆40Updated 2 years ago