KohakuBlueleaf / guanaco-loraLinks

Instruct-tune LLaMA on consumer hardware

☆73

Alternatives and similar repositories for guanaco-lora

Users that are interested in guanaco-lora are comparing it to the libraries listed below

Sorting:

TehVenomm / LM_Transformers_BlockMerge
Image Diffusion block merging technique applied to transformers based Language Models.
☆55Updated 2 years ago
Abel2076 / json2binidx_tool
☆81Updated last year
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated 2 years ago
clcarwin / alpaca-weight
Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
☆52Updated 2 years ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆99Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆50Updated 2 years ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
nitrosocke / diffusers-webui
This is a Gradio WebUI working with the Diffusers format of Stable Diffusion
☆82Updated 2 years ago
oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆130Updated 2 years ago
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆109Updated 2 years ago
PygmalionAI / data-toolbox
Our data munging code.
☆33Updated 2 weeks ago
acpopescu / bitsandbytes
8-bit CUDA functions for PyTorch
☆41Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
resloved / RWKV-notebooks
📖 — Notebooks related to RWKV
☆58Updated 2 years ago
Keith-Hon / bitsandbytes-windows
8-bit CUDA functions for PyTorch in Windows 10
☆68Updated 2 years ago
BBuf / RWKV-World-HF-Tokenizer
☆34Updated last year
Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆76Updated last year
LAION-AI / riverbed
Tools for content datamining and NLP at scale
☆44Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
mrsteyk / RWKV-LM-deepspeed
☆42Updated 2 years ago
zarakiquemparte / zaraki-tools
☆26Updated 2 years ago
CoffeeVampir3 / ez-trainer
Train Llama Loras Easily
☆30Updated 2 years ago
NolanoOrg / llama-int4-quant
☆26Updated 2 years ago
bupticybee / FastLoRAChat
Instruct-tune LLaMA on consumer hardware with shareGPT data
☆126Updated 2 years ago
waifu-diffusion / network-trainer
☆27Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
camenduru / Qwen-VL-Chat-colab
☆80Updated last year
Blealtan / RWKV-LM-LoRA
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆412Updated 2 years ago