QLoRA: Efficient Finetuning of Quantized LLMs
☆79Apr 10, 2024Updated 2 years ago
Alternatives and similar repositories for qlora
Users that are interested in qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- ☆28Aug 30, 2023Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆43Jun 1, 2023Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Feb 4, 2024Updated 2 years ago
- ☆16Feb 21, 2026Updated last month
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆167Jun 1, 2023Updated 2 years ago
- Traing PRO extension for oobabooga WebUI - recent dev version☆52Aug 7, 2025Updated 8 months ago
- ☆54Jun 11, 2023Updated 2 years ago
- A Python library for efficient and flexible cycle-consistency training of transformer models via iteratie back-translation. Memory and co…☆11Jan 13, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)☆11Oct 21, 2024Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,636Sep 15, 2023Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- YaRN: Efficient Context Window Extension of Large Language Models☆1,690Apr 17, 2024Updated last year
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Inference code for LLaMA models☆21Apr 3, 2025Updated last year
- ☆131Oct 1, 2024Updated last year
- extension for text WebUI☆20Aug 7, 2025Updated 8 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆145Sep 20, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- ☆78Dec 26, 2023Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,538Nov 9, 2024Updated last year
- FuseAI Project☆592Jan 25, 2025Updated last year
- reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…☆12Mar 6, 2024Updated 2 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- Experiments on speculative sampling with Llama models☆128Jun 8, 2023Updated 2 years ago
- ☆40Mar 25, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,865Jun 10, 2024Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,915Sep 30, 2023Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Mar 22, 2023Updated 3 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition☆668Jul 22, 2024Updated last year
- spaCy entry points for Curated Transformers☆32Mar 27, 2026Updated 2 weeks ago