gmorenz / llama
Inference code for LLaMA models
☆35Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llama
- Inference code for LLaMA models☆45Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- A Qt GUI for large language models☆40Updated last year
- rwkv_chatbot☆62Updated last year
- An OpenAI-like LLaMA inference API☆111Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year
- A collection of prompts for Llama☆96Updated last year
- Conversational Language model toolkit for training against human preferences.☆41Updated 7 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- The code we currently use to fine-tune models.☆109Updated 6 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆35Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆92Updated 4 months ago
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 10 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆128Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated 7 months ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- A template to run LLaMA in Cog☆63Updated last year
- Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT☆48Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- Little AI roleplay program☆52Updated last year
- A semi-scalable system to scrape the chatgpt API to make input/output pairs☆38Updated last year
- Embedding models from Jina AI☆56Updated 10 months ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆66Updated last year