johnsmith0031 / alpaca_lora_4bitLinks

☆534

Alternatives and similar repositories for alpaca_lora_4bit

Users that are interested in alpaca_lora_4bit are comparing it to the libraries listed below

Sorting:

rmihaylov / falcontune
Tune any FALCON in 4-bit
☆464Updated 2 years ago
zphang / minimal-llama
☆457Updated 2 years ago
pointnetwork / point-alpaca
☆403Updated 2 years ago
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,050Updated last year
melodysdreamj / WizardVicunaLM
LLM that combines the principles of wizardLM and vicunaLM
☆716Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆730Updated last year
chrisociepa / allamo
Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models
☆182Updated last month
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆821Updated 2 years ago
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆426Updated last year
gururise / AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
☆1,577Updated 2 years ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆308Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆356Updated 2 years ago
NouamaneTazi / bloomz.cpp
C++ implementation for BLOOM
☆806Updated 2 years ago
Blealtan / RWKV-LM-LoRA
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆412Updated 2 years ago
modular-ml / wrapyfi-examples_llama
Inference code for facebook LLaMA models with Wrapyfi support
☆129Updated 2 years ago
teknium1 / GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,634Updated 2 years ago
mallorbc / Finetune_LLMs
Repo for fine-tuning Casual LLMs
☆456Updated last year
zetavg / LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT…
☆475Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆455Updated 2 years ago
tloen / llama-int8
Quantized inference code for LLaMA models
☆1,046Updated 2 years ago
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆207Updated last year
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆604Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆629Updated last year
mzbac / qlora-fine-tune
☆166Updated 2 years ago
qwopqwop200 / GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
☆3,076Updated last year
skeskinen / bert.cpp
ggml implementation of BERT
☆494Updated last year