rupeshs / alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM (Android/Linux/Windows/Mac)
☆268Updated last year
Related projects ⓘ
Alternatives and complementary repositories for alpaca.cpp
- C++ implementation for BLOOM☆811Updated last year
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆410Updated last year
- C++ implementation for 💫StarCoder☆445Updated last year
- SoTA Transformers with C-backend for fast inference on your CPU.☆312Updated 11 months ago
- A prompt/context management system☆165Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- A mobile Implementation of llama.cpp☆292Updated 9 months ago
- ☆406Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- A discord bot that roleplays!☆146Updated last year
- 💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client☆309Updated 6 months ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆244Updated 9 months ago
- Python bindings for llama.cpp☆199Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆307Updated last year
- A Simple Discord Bot for the Alpaca LLM☆102Updated last year
- Locally run an Assistant-Tuned Chat-Style LLM☆507Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆118Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,420Updated 3 months ago
- An experimental open-source attempt to make GPT-4 fully autonomous.☆100Updated last year
- Run Alpaca LLM in LangChain☆217Updated 10 months ago
- Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models☆152Updated this week
- LLM that combines the principles of wizardLM and vicunaLM☆711Updated last year
- ggml implementation of BERT☆464Updated 8 months ago
- ☆144Updated last year
- Uses Auto-GPT with Llama.cpp☆384Updated 7 months ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year