lxe / llama-peft-tunerLinks
Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.
☆25Updated 2 years ago
Alternatives and similar repositories for llama-peft-tuner
Users that are interested in llama-peft-tuner are comparing it to the libraries listed below
Sorting:
- ☆96Updated 2 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆171Updated 2 years ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆213Updated last year
- An experimental implementation of the retrieval-enhanced language model☆74Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆201Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 9 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆103Updated last month
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Wrapper to easily generate the chat template for Llama2☆65Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated 2 years ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆206Updated 10 months ago
- ☆95Updated last year
- ☆105Updated 2 years ago
- ☆149Updated 4 years ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Multipack distributed sampler for fast padding-free training of LLMs☆191Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆58Updated 2 years ago
- ☆458Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago