gmorenz / llama
Inference code for LLaMA models
☆35Updated last year
Alternatives and similar repositories for llama:
Users that are interested in llama are comparing it to the libraries listed below
- Inference code for LLaMA models☆46Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- ☆40Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- The code we currently use to fine-tune models.☆112Updated 8 months ago
- Conversational Language model toolkit for training against human preferences.☆40Updated 9 months ago
- 📖 — Notebooks related to RWKV☆59Updated last year
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year
- LLM family chart☆50Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 9 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 4 months ago
- Flexible Python package for managing and extending LLM based agents☆25Updated last year
- ☆74Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- A collection of prompts for Llama☆97Updated last year
- An OpenAI-like LLaMA inference API☆113Updated last year
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆27Updated 10 months ago
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆56Updated 3 years ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated last year
- A Qt GUI for large language models☆40Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year