jondurbin / qloraView external linksLinks
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Apr 10, 2024Updated last year
Alternatives and similar repositories for qlora
Users that are interested in qlora are comparing it to the libraries listed below
Sorting:
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated last year
- ☆27Aug 30, 2023Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆37Aug 8, 2023Updated 2 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- ☆74Sep 5, 2023Updated 2 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Feb 4, 2024Updated 2 years ago
- LLMs as Collaboratively Edited Knowledge Bases☆46Feb 8, 2026Updated last week
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated last year
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆42Jun 1, 2023Updated 2 years ago
- ☆40Mar 25, 2023Updated 2 years ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆21Feb 26, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Training LLMs with QLoRA + FSDP☆1,539Nov 9, 2024Updated last year
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- ☆78Dec 26, 2023Updated 2 years ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Jan 18, 2024Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- Experiments on speculative sampling with Llama models☆128Jun 8, 2023Updated 2 years ago
- FuseAI Project☆587Jan 25, 2025Updated last year
- Cookbooks showcasing various applications of Cleanlab☆22Jan 20, 2026Updated 3 weeks ago
- ☆21Oct 6, 2023Updated 2 years ago
- Inference code for LLaMA models☆21Apr 3, 2025Updated 10 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- ☆46Jun 11, 2025Updated 8 months ago
- ☆52Jul 20, 2025Updated 6 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆409May 17, 2024Updated last year
- Evaluating the faithfulness of long-context language models☆30Oct 21, 2024Updated last year
- ☆130Oct 1, 2024Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Feb 9, 2024Updated 2 years ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,669Apr 17, 2024Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆215Mar 5, 2024Updated last year
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated last year
- Image Diffusion block merging technique applied to transformers based Language Models.☆56May 8, 2023Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Sep 15, 2023Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆144Sep 10, 2023Updated 2 years ago