lxe / llama-peft-tunerLinks
Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.
☆25Updated 2 years ago
Alternatives and similar repositories for llama-peft-tuner
Users that are interested in llama-peft-tuner are comparing it to the libraries listed below
Sorting:
- ☆460Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆353Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆215Updated last year
- User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.☆339Updated 2 years ago
- 🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization…☆232Updated last year
- Repo for fine-tuning Casual LLMs☆457Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆219Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆172Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆203Updated 2 years ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆229Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- ☆96Updated 2 years ago
- ☆180Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Merge Transformers language models by use of gradient parameters.☆208Updated last year
- ☆98Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆94Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆163Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 3 months ago
- A bagel, with everything.☆324Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆423Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆247Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆166Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆556Updated last year
- ☆161Updated last year
- ☆535Updated last year