zanussbaum / gpt4all.cppLinks

Locally run an Assistant-Tuned Chat-Style LLM

☆496

Alternatives and similar repositories for gpt4all.cpp

Users that are interested in gpt4all.cpp are comparing it to the libraries listed below

Sorting:

melodysdreamj / WizardVicunaLM
LLM that combines the principles of wizardLM and vicunaLM
☆716Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆459Updated 2 years ago
keldenl / gpt-llama.cpp
A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…
☆597Updated 2 years ago
rhohndorf / Auto-Llama-cpp
Uses Auto-GPT with Llama.cpp
☆385Updated last year
pointnetwork / point-alpaca
☆404Updated 2 years ago
PotatoSpudowski / fastLLaMa
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…
☆412Updated 2 years ago
NouamaneTazi / bloomz.cpp
C++ implementation for BLOOM
☆809Updated 2 years ago
nomic-ai / pygpt4all
Official supported Python bindings for llama.cpp + gpt4all
☆1,016Updated 2 years ago
rupeshs / alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM (Android/Linux/Windows/Mac)
☆262Updated 2 years ago
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆311Updated 2 years ago
randaller / llama-chat
Chat with Meta's LLaMA models at home made easy
☆842Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆535Updated 2 years ago
petals-infra / chat.petals.dev
💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
☆316Updated last year
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆249Updated 2 years ago
teknium1 / GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,629Updated 2 years ago
ViperX7 / Alpaca-Turbo
Web UI to run alpaca model locally
☆868Updated 2 years ago
RWKV / rwkv.cpp
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
☆1,562Updated 10 months ago
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
BillSchumacher / Auto-Vicuna
☆137Updated 2 years ago
randaller / llama-cpu
Inference on CPU code for LLaMA models
☆137Updated 2 years ago
thomasantony / llamacpp-python
Python bindings for llama.cpp
☆198Updated 2 years ago
vicuna-tools / vicuna-installation-guide
The "vicuna-installation-guide" provides step-by-step instructions for installing and configuring Vicuna 13 and 7B
☆282Updated 2 years ago
oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆131Updated 2 years ago
skeskinen / bert.cpp
ggml implementation of BERT
☆498Updated last year
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Updated 2 years ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆109Updated 2 years ago
tloen / llama-int8
Quantized inference code for LLaMA models
☆1,046Updated 2 years ago
nomic-ai / gpt4all-chat
gpt4all-j chat
☆1,271Updated 2 years ago
jankais3r / LLaMA_MPS
Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.
☆585Updated 2 years ago
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year