lxe / llama-peft-tunerLinks

Tune LLaMa-7B on Alpaca Dataset using PEFT / LORA Based on @zphang's https://github.com/zphang/minimal-llama scripts.

☆25

Alternatives and similar repositories for llama-peft-tuner

Users that are interested in llama-peft-tuner are comparing it to the libraries listed below

Sorting:

Dahoas / reward-modeling
☆96Updated 2 years ago
xrsrke / instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
☆171Updated 2 years ago
lxe / llama-tune
LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers
☆51Updated 2 years ago
akoksal / LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
☆213Updated last year
Langboat / mengzi-retrieval-lm
An experimental implementation of the retrieval-enhanced language model
☆74Updated 2 years ago
vihangd / alpaca-qlora
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
☆81Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
mzbac / llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
☆201Updated last year
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Updated 9 months ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated last month
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
samrawal / llama2_chat_templater
Wrapper to easily generate the chat template for Llama2
☆65Updated last year
soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆43Updated 2 years ago
gigio1023 / alpaca-lora-for-huggingface
Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel
☆24Updated 2 years ago
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆206Updated 10 months ago
hydrallm / llama-moe-v1
☆95Updated last year
AI21Labs / Parallel-Context-Windows
☆105Updated 2 years ago
EleutherAI / lm_perplexity
☆149Updated 4 years ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆191Updated 10 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
aspctu / alpaca-lora
Instruct-tuning LLaMA on consumer hardware
☆66Updated 2 years ago
jackaduma / Alpaca-LoRA-RLHF-PyTorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…
☆58Updated 2 years ago
zphang / minimal-llama
☆458Updated last year
facebookresearch / tart
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆162Updated last year
xingyaoww / LeTI
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆65Updated last year
iwalton3 / mpt-lora-patch
Patch for MPT-7B which allows using and training a LoRA
☆58Updated 2 years ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago