Fine-tuning LLMs using QLoRA
☆268Jun 8, 2024Updated last year
Alternatives and similar repositories for llm_qlora
Users that are interested in llm_qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Mar 16, 2026Updated 2 weeks ago
- ☆167Jun 1, 2023Updated 2 years ago
- A pytest plugin to organize and track algorithm visualizations☆18Dec 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,883Jan 28, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- ☆13May 25, 2023Updated 2 years ago
- Kon is a minimal coding agent (and also a highly opinionated one)☆189Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,861Jun 10, 2024Updated last year
- A prompt/context management system☆168May 8, 2023Updated 2 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 10 months ago
- SAIL: Search Augmented Instruction Learning☆159Jul 22, 2025Updated 8 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Go ahead and axolotl questions☆11,508Updated this week
- ☆40Jun 3, 2025Updated 9 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆241May 26, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆58Jan 5, 2025Updated last year
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA…☆24Mar 7, 2026Updated 3 weeks ago
- ☆15Mar 12, 2024Updated 2 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Oct 8, 2024Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,912Sep 30, 2023Updated 2 years ago
- ☆119Dec 18, 2024Updated last year
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Dec 15, 2023Updated 2 years ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆180Jan 31, 2024Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,539Nov 9, 2024Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆52Jul 10, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple MNIST data reader in pure Java☆21Sep 28, 2016Updated 9 years ago
- A local version of WebSim.AI, the prompt to webpage engine. Infinite possibilities to cure your boredom. ( I made it so you dont have to…☆58Jul 19, 2024Updated last year
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Jun 19, 2023Updated 2 years ago
- Simple debug menu for Unity☆17Sep 30, 2023Updated 2 years ago
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago