cedrickchee / llama
Inference code for LLaMA 2 models
☆31Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- ☆26Updated last year
- ☆33Updated last year
- Instruct-tune LLaMA on consumer hardware☆73Updated last year
- LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt☆63Updated last year
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- Tune MPTs☆84Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆100Updated 6 months ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- LLM family chart☆51Updated last year
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- ☆42Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆122Updated last year
- BlinkDL's RWKV-v4 running in the browser☆46Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆161Updated 10 months ago
- LLaMA implementation for HuggingFace Transformers☆39Updated last year
- Repository featuring fine-tuning code for various LLMs, complemented by occasional explanations, deep dives.☆40Updated 2 months ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated last year
- ☆83Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 9 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- Langport is a language model inference service☆93Updated 2 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 3 months ago
- Inference code for facebook LLaMA models with Wrapyfi support☆130Updated last year
- Finetuning BLOOM on a single GPU using gradient-accumulation☆25Updated last year