clcarwin / alpaca-weightLinks

Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.

☆52

Alternatives and similar repositories for alpaca-weight

Users that are interested in alpaca-weight are comparing it to the libraries listed below

Sorting:

zphang / minimal-llama
☆457Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆426Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆207Updated last year
modular-ml / wrapyfi-examples_llama
Inference code for facebook LLaMA models with Wrapyfi support
☆129Updated 2 years ago
Blealtan / RWKV-LM-LoRA
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆412Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆50Updated 2 years ago
aspctu / alpaca-lora
Instruct-tuning LLaMA on consumer hardware
☆65Updated 2 years ago
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Updated last year
bupticybee / FastLoRAChat
Instruct-tune LLaMA on consumer hardware with shareGPT data
☆126Updated 2 years ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated 2 years ago
thomasantony / llamacpp-python
Python bindings for llama.cpp
☆198Updated 2 years ago
salesforce / CodeGen2
CodeGen2 models for program synthesis
☆269Updated 2 years ago
radi-cho / botbots
A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…
☆164Updated 2 years ago
henrywoo / minichatgpt
minichatgpt - To Train ChatGPT In 5 Minutes
☆169Updated 2 years ago
iwalton3 / mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
☆58Updated 2 years ago
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆368Updated last year
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆70Updated 2 years ago
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆77Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆531Updated last year
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆108Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆455Updated 2 years ago
kaistAI / SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆228Updated 2 years ago
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆99Updated 2 years ago
Abel2076 / json2binidx_tool
☆81Updated last year