QLoRA: Efficient Finetuning of Quantized LLMs
☆79Apr 10, 2024Updated last year
Alternatives and similar repositories for qlora
Users that are interested in qlora are comparing it to the libraries listed below
Sorting:
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- Customizable implementation of the self-instruct paper.☆1,049Mar 7, 2024Updated 2 years ago
- ☆27Aug 30, 2023Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- ☆74Sep 5, 2023Updated 2 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Feb 4, 2024Updated 2 years ago
- LLMs as Collaboratively Edited Knowledge Bases☆46Feb 8, 2026Updated 3 weeks ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆42Jun 1, 2023Updated 2 years ago
- Analyzing LLM Alignment via Token distribution shift☆17Jan 26, 2024Updated 2 years ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆21Feb 26, 2024Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Training LLMs with QLoRA + FSDP☆1,538Nov 9, 2024Updated last year
- ☆78Dec 26, 2023Updated 2 years ago
- The implement of paper:"Large Language Model Enhanced Collaborative Filtering" accepted by CIKM 2024☆22Jul 28, 2024Updated last year
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- Experiments on speculative sampling with Llama models☆128Jun 8, 2023Updated 2 years ago
- FuseAI Project☆590Jan 25, 2025Updated last year
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Sep 5, 2023Updated 2 years ago
- Cookbooks showcasing various applications of Cleanlab☆22Jan 20, 2026Updated last month
- ☆21Oct 6, 2023Updated 2 years ago
- ☆53Jul 20, 2025Updated 7 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆46Jun 11, 2025Updated 8 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆409May 17, 2024Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆214Mar 5, 2024Updated 2 years ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,676Apr 17, 2024Updated last year
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆145Sep 10, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆145Sep 20, 2024Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆145Aug 7, 2025Updated 7 months ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago