venuatu / llama
Inference code for LLaMA models
☆45Updated last year
Related projects: ⓘ
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆30Updated last year
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Inference code for LLaMA models☆35Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 weeks ago
- Inference code for LLaMA models☆189Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆54Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆122Updated last year
- Inference code for LLaMA 2 models☆30Updated 2 months ago
- Conversational Language model toolkit for training against human preferences.☆41Updated 5 months ago
- Framework agnostic python runtime for RWKV models☆144Updated last year
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆68Updated last year
- LLM-based code completion engine☆172Updated last year
- ☆453Updated 11 months ago
- Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models☆143Updated last week
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆66Updated 11 months ago
- ☆40Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆63Updated last year
- LLaVA server (llama.cpp).☆173Updated 11 months ago
- A collection of prompts for Llama☆94Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- ☆533Updated 9 months ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated last year
- Train Llama Loras Easily☆29Updated last year
- Instruct-tune LLaMA on consumer hardware☆73Updated last year