thisserand / alpaca-lora-finetune-language
☆121Updated last year
Related projects ⓘ
Alternatives and complementary repositories for alpaca-lora-finetune-language
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 7 months ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- ☆168Updated last year
- Repo for fine-tuning Casual LLMs☆449Updated 7 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated last month
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆80Updated 11 months ago
- Tune any FALCON in 4-bit☆468Updated last year
- ☆162Updated 9 months ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆155Updated last year
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆161Updated 10 months ago
- Harnessing the Memory Power of the Camelids☆145Updated last year
- Small finetuned LLMs for a diverse set of useful tasks☆123Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆295Updated 3 weeks ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ☆97Updated last year
- ☆27Updated 3 years ago
- Chat with your data privately using MPT-30b☆182Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆112Updated last year
- Local LLM ReAct Agent with Guidance☆155Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆348Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆67Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 3 months ago
- Merge Transformers language models by use of gradient parameters.☆201Updated 3 months ago
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.☆114Updated last year