shawwn / llama
Inference code for LLaMA models
☆189Updated last year
Related projects: ⓘ
- ☆257Updated this week
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆326Updated last year
- ☆406Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 weeks ago
- Inference code for LLaMA models☆45Updated last year
- ☆533Updated 9 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Inference code for LLaMA models☆35Updated last year
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 7 months ago
- ☆453Updated 11 months ago
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆408Updated last year
- Quantized inference code for LLaMA models☆1,052Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆30Updated last year
- Nearly a thousand bash and python scripts I've written over the years.☆118Updated 2 months ago
- SoTA Transformers with C-backend for fast inference on your CPU.☆311Updated 9 months ago
- OpenAI API webserver☆180Updated 2 years ago
- Python bindings for llama.cpp☆199Updated last year
- C++ implementation for 💫StarCoder☆443Updated last year
- Instruct-tune LLaMA on consumer hardware☆363Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆347Updated last year
- A collection of prompts for Llama☆94Updated last year
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆165Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated last year
- A prompt/context management system☆163Updated last year
- Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models☆143Updated last week
- LLM-based code completion engine☆172Updated last year
- Supercharge Open-Source AI Models☆348Updated last year