tatsu-lab / stanford_alpacaLinks

Code and documentation to train Stanford's Alpaca models, and generate the data.

☆30,098

Alternatives and similar repositories for stanford_alpaca

Users that are interested in stanford_alpaca are comparing it to the libraries listed below

Sorting:

tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,931Updated last year
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆38,944Updated 2 months ago
deepspeedai / DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆39,599Updated this week
Stability-AI / StableLM
StableLM: Stability AI Language Models
☆15,824Updated last year
BlinkDL / RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆13,876Updated last week
hpcaitech / ColossalAI
Making large AI models cheaper, faster and more accessible
☆41,062Updated last week
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,083Updated last month
meta-llama / llama
Inference code for Llama models
☆58,594Updated 6 months ago
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,252Updated this week
antimatter15 / alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
☆10,223Updated 2 years ago
microsoft / JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
☆24,249Updated last week
togethercomputer / OpenChatKit
☆9,012Updated last year
cocktailpeanut / dalai
The simplest way to run LLaMA on your local machine
☆13,072Updated last year
FMInference / FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,356Updated 9 months ago
zai-org / GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
☆7,689Updated 2 years ago
openlm-research / open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,515Updated 2 years ago
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,593Updated last year
Vision-CAIR / MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,742Updated 11 months ago
BlinkDL / ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
☆9,505Updated 3 months ago
OptimalScale / LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
☆8,453Updated 3 weeks ago
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,063Updated last week
togethercomputer / RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,783Updated 8 months ago
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,441Updated 2 years ago
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆23,229Updated 11 months ago
run-llama / llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
☆43,460Updated this week
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆12,477Updated 7 months ago
OpenGVLab / LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,888Updated last year
deepspeedai / DeepSpeedExamples
Example models using DeepSpeed
☆6,606Updated last week
bigcode-project / starcoder
Home of StarCoder: fine-tuning & inference!
☆7,441Updated last year
yoheinakajima / babyagi
☆21,697Updated 9 months ago