randaller/llama-chat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/randaller/llama-chat)

randaller / llama-chat

Chat with Meta's LLaMA models at home made easy

☆839

Alternatives and similar repositories for llama-chat

Users that are interested in llama-chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

randaller / llama-cpu
View on GitHub
Inference on CPU code for LLaMA models
☆136Mar 19, 2023Updated 3 years ago
shawwn / llama-dl
View on GitHub
High-speed download of LLaMA, Facebook's 65B parameter GPT model
☆4,119Jun 28, 2023Updated 3 years ago
qwopqwop200 / GPTQ-for-LLaMa
View on GitHub
4 bits quantization of LLaMA using GPTQ
☆3,071Jul 13, 2024Updated 2 years ago
henrywoo / pyllama
View on GitHub
LLaMA: Open and Efficient Foundation Language Models
☆2,780Nov 8, 2023Updated 2 years ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,909Jul 29, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shawwn / llama
View on GitHub
Inference code for LLaMA models
☆189Mar 6, 2023Updated 3 years ago
henrywoo / chatllama
View on GitHub
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
☆1,199Jan 18, 2025Updated last year
modular-ml / wrapyfi-examples_llama
View on GitHub
Inference code for facebook LLaMA models with Wrapyfi support
☆128Mar 16, 2023Updated 3 years ago
tloen / llama-int8
View on GitHub
Quantized inference code for LLaMA models
☆1,038Mar 17, 2023Updated 3 years ago
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,252Jul 17, 2024Updated 2 years ago
nebuly-ai / optimate
View on GitHub
A collection of libraries to optimise AI model performances
☆8,332Jul 22, 2024Updated 2 years ago
meta-llama / llama
View on GitHub
Inference code for Llama models
☆59,520Jan 26, 2025Updated last year
jorahn / llama-int8
View on GitHub
Quantized inference code for LLaMA models
☆13Mar 12, 2023Updated 3 years ago
deep-diver / LLM-As-Chatbot
View on GitHub
LLM as a Chatbot Service
☆3,319Nov 20, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zphang / minimal-llama
View on GitHub
☆456Oct 15, 2023Updated 2 years ago
togethercomputer / OpenChatKit
View on GitHub
☆8,983Apr 9, 2024Updated 2 years ago
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,917Mar 14, 2024Updated 2 years ago
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,480Jun 7, 2025Updated last year
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,082Jul 1, 2025Updated last year
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,496May 1, 2026Updated 2 months ago
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,474Jun 2, 2026Updated last month
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,531Jul 16, 2023Updated 3 years ago
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,332Jun 11, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
antimatter15 / alpaca.cpp
View on GitHub
Locally run an Instruction-Tuned Chat-Style LLM
☆10,128Apr 19, 2023Updated 3 years ago
cocktailpeanut / dalai
View on GitHub
The simplest way to run LLaMA on your local machine
☆12,914Jun 18, 2024Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆533Dec 1, 2023Updated 2 years ago
databrickslabs / dolly
View on GitHub
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
☆10,807Jun 30, 2023Updated 3 years ago
togethercomputer / RedPajama-Data
View on GitHub
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,969Jun 3, 2026Updated last month
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,379Aug 17, 2024Updated last year
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,358Oct 28, 2024Updated last year
Stability-AI / StableLM
View on GitHub
StableLM: Stability AI Language Models
☆15,686Apr 8, 2024Updated 2 years ago
feizc / MLE-LLaMA
View on GitHub
Multi-language Enhanced LLaMA
☆301Apr 13, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
devbrones / llama-prompts
View on GitHub
A collection of prompts for Llama
☆101Mar 23, 2023Updated 3 years ago
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆121,178Updated this week
sahil280114 / codealpaca
View on GitHub
☆1,513May 12, 2023Updated 3 years ago
hpcaitech / ColossalAI
View on GitHub
Making large AI models cheaper, faster and more accessible
☆41,417Jul 13, 2026Updated last week
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
BlinkDL / ChatRWKV
View on GitHub
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
☆9,497Updated this week
lxe / simple-llm-finetuner
View on GitHub
Simple UI for LLM Model Finetuning
☆2,050Dec 21, 2023Updated 2 years ago