zphang/minimal-llama

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zphang/minimal-llama)

zphang / minimal-llama

☆456

Alternatives and similar repositories for minimal-llama

Users that are interested in minimal-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qwopqwop200 / GPTQ-for-LLaMa
View on GitHub
4 bits quantization of LLaMA using GPTQ
☆3,072Jul 13, 2024Updated 2 years ago
lxe / simple-llm-finetuner
View on GitHub
Simple UI for LLM Model Finetuning
☆2,053Dec 21, 2023Updated 2 years ago
lxe / llama-peft-tuner
View on GitHub
Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.
☆25Mar 15, 2023Updated 3 years ago
johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆534Dec 1, 2023Updated 2 years ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,913Jul 29, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tloen / llama-int8
View on GitHub
Quantized inference code for LLaMA models
☆1,038Mar 17, 2023Updated 3 years ago
gururise / AlpacaDataCleaned
View on GitHub
Alpaca dataset from Stanford, cleaned and curated
☆1,602Mar 7, 2026Updated 4 months ago
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,916Mar 14, 2024Updated 2 years ago
sahil280114 / codealpaca
View on GitHub
☆1,515May 12, 2023Updated 3 years ago
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
mbzuai-nlp / LaMini-LM
View on GitHub
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆822May 6, 2023Updated 3 years ago
lachlansneff / sparsellama
View on GitHub
☆40Mar 25, 2023Updated 3 years ago
AetherCortex / Llama-X
View on GitHub
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,605Aug 30, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,246Jul 17, 2024Updated 2 years ago
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,333Jun 11, 2023Updated 3 years ago
PotatoSpudowski / fastLLaMa
View on GitHub
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆413Jun 2, 2023Updated 3 years ago
feizc / MLE-LLaMA
View on GitHub
Multi-language Enhanced LLaMA
☆301Apr 13, 2023Updated 3 years ago
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,364Oct 28, 2024Updated last year
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,752Jan 8, 2024Updated 2 years ago
deep-diver / LLM-As-Chatbot
View on GitHub
LLM as a Chatbot Service
☆3,320Nov 20, 2023Updated 2 years ago
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,084Jul 1, 2025Updated last year
henrywoo / pyllama
View on GitHub
LLaMA: Open and Efficient Foundation Language Models
☆2,780Nov 8, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zsc / llama_infer
View on GitHub
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆109Mar 24, 2023Updated 3 years ago
IST-DASLab / gptq
View on GitHub
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
☆2,342Mar 27, 2024Updated 2 years ago
nebuly-ai / optimate
View on GitHub
A collection of libraries to optimise AI model performances
☆8,333Jul 22, 2024Updated 2 years ago
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,533Jul 16, 2023Updated 3 years ago
PygmalionAI / logbooks
View on GitHub
Where we keep our notes about model training runs.
☆16Mar 12, 2023Updated 3 years ago
togethercomputer / OpenDataHub
View on GitHub
☆127Apr 26, 2023Updated 3 years ago
teknium1 / GPTeacher
View on GitHub
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,667Sep 15, 2023Updated 2 years ago
CStanKonrad / long_llama
View on GitHub
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,465Nov 7, 2023Updated 2 years ago
henrywoo / chatllama
View on GitHub
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
☆1,200Jan 18, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
AlpinDale / RPTQ-for-LLaMA
View on GitHub
Efficient 3bit/4bit quantization of LLaMA models
☆18May 18, 2023Updated 3 years ago
project-baize / baize-chatbot
View on GitHub
Let ChatGPT teach your own chatbot in hours with a single GPU!
☆3,151Mar 17, 2024Updated 2 years ago
pointnetwork / point-alpaca
View on GitHub
☆402Mar 22, 2023Updated 3 years ago
cat-state / tinypar
View on GitHub
☆20Jul 12, 2023Updated 3 years ago
young-geng / EasyLM
View on GitHub
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆2,514Aug 13, 2024Updated last year
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,460Updated this week
bigscience-workshop / xmtf
View on GitHub
Crosslingual Generalization through Multitask Finetuning
☆535Sep 22, 2024Updated last year