ypeleg / llamaLinks
User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.
☆338Updated 2 years ago
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below
Sorting:
- Tune any FALCON in 4-bit☆466Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated 2 years ago
- ☆457Updated last year
- ☆535Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆352Updated last year
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- Repo for fine-tuning Casual LLMs☆455Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆721Updated last year
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆472Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆215Updated last year
- Crosslingual Generalization through Multitask Finetuning☆535Updated 8 months ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆454Updated last year
- Plain pytorch implementation of LLaMA☆187Updated 2 years ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,118Updated last year
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆165Updated 2 years ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆546Updated last year
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆451Updated last year
- LOMO: LOw-Memory Optimization☆984Updated 11 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Updated 2 years ago
- ☆269Updated 2 years ago
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆289Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,554Updated 2 years ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆697Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆711Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆421Updated last year
- A minimum example of aligning language models with RLHF similar to ChatGPT☆218Updated last year
- Chat with Meta's LLaMA models at home made easy☆835Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆185Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆473Updated 2 years ago