thisserand / alpaca-lora-finetune-languageLinks
☆122Updated last year
Alternatives and similar repositories for alpaca-lora-finetune-language
Users that are interested in alpaca-lora-finetune-language are comparing it to the libraries listed below
Sorting:
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Tune any FALCON in 4-bit☆467Updated last year
- ☆167Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆185Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated 2 years ago
- Finetune BLOOM☆40Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Repo for fine-tuning Casual LLMs☆456Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆162Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆103Updated last month
- Local LLM ReAct Agent with Guidance☆158Updated 2 years ago
- llama-4bit-colab☆64Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆57Updated 2 months ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆352Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆171Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆71Updated 9 months ago
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- ☆64Updated 2 years ago
- ☆276Updated 2 years ago
- Large Language Model (LLM) Inference API and Chatbot☆126Updated last year
- Merge Transformers language models by use of gradient parameters.☆206Updated 10 months ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- Multi-Domain Expert Learning☆67Updated last year