Lightning-AI / lit-llamaLinks

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

☆6,077

Alternatives and similar repositories for lit-llama

Users that are interested in lit-llama are comparing it to the libraries listed below

Sorting:

togethercomputer / RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,829Updated 10 months ago
openlm-research / open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,526Updated 2 years ago
OpenGVLab / LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,906Updated last year
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,710Updated last year
young-geng / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆2,497Updated last year
henrywoo / pyllama
LLaMA: Open and Efficient Foundation Language Models
☆2,801Updated last year
CarperAI / trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,714Updated last year
stochasticai / xTuring
Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-s…
☆2,659Updated last week
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
Instruction Tuning with GPT-4
☆4,332Updated 2 years ago
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,454Updated 4 months ago
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,501Updated 2 years ago
deep-diver / LLM-As-Chatbot
LLM as a Chatbot Service
☆3,343Updated last year
qwopqwop200 / GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
☆3,075Updated last year
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,969Updated last year
AutoGPTQ / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆4,970Updated 6 months ago
EleutherAI / pythia
The hub for EleutherAI's work on interpretability and learning dynamics
☆2,639Updated 4 months ago
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,659Updated 3 weeks ago
project-baize / baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
☆3,170Updated last year
ray-project / llm-numbers
Numbers every LLM developer should know
☆4,260Updated last year
lxe / simple-llm-finetuner
Simple UI for LLM Model Finetuning
☆2,065Updated last year
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,832Updated last week
FMInference / FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,371Updated 11 months ago
microsoft / LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,151Updated 3 months ago
lamini-ai / lamini
The Official Python Client for Lamini's API
☆2,543Updated 6 months ago
lucidrains / toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,050Updated last year
gururise / AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
☆1,574Updated 2 years ago
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,580Updated last month
turboderp / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,903Updated 2 years ago
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,070Updated last year
mlfoundations / open_flamingo
An open-source framework for training large multimodal models.
☆4,029Updated last year