Fine-tuning LLMs using QLoRA
☆268Jun 8, 2024Updated last year
Alternatives and similar repositories for llm_qlora
Users that are interested in llm_qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 24, 2023Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Mar 16, 2026Updated last month
- ☆167Jun 1, 2023Updated 2 years ago
- A pytest plugin to organize and track algorithm visualizations☆18Dec 1, 2024Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,885Jan 28, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated 2 years ago
- ☆13May 25, 2023Updated 2 years ago
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Kon is a minimal coding agent (and also a highly opinionated one)☆262Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,870Jun 10, 2024Updated last year
- A prompt/context management system☆168May 8, 2023Updated 2 years ago
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 11 months ago
- SAIL: Search Augmented Instruction Learning☆159Jul 22, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Go ahead and axolotl questions☆11,688Updated this week
- ☆40Jun 3, 2025Updated 10 months ago
- ☆52Feb 8, 2026Updated 2 months ago
- Democratizing access to LLMs for the open-source community. Let's advance AI, together.☆29Sep 2, 2023Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆241May 26, 2024Updated last year
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆56Jan 5, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- pip install poai☆14Mar 2, 2026Updated last month
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105Apr 13, 2026Updated last week
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA…☆24Mar 7, 2026Updated last month
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Jul 4, 2022Updated 3 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Oct 8, 2024Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,915Sep 30, 2023Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Dec 15, 2023Updated 2 years ago
- ☆120Dec 18, 2024Updated last year
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆181Jan 31, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Training LLMs with QLoRA + FSDP☆1,541Nov 9, 2024Updated last year
- A general purpose library for training any type of GPT model.☆11Jun 13, 2023Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Jul 10, 2024Updated last year
- A simple updated colab doc that will allow you to run the Ooba Booga Text-Generation-Webui for free with just a few lines of codes.☆25Sep 30, 2024Updated last year
- ☆16Mar 12, 2024Updated 2 years ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Jun 19, 2023Updated 2 years ago
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago