Fine-tuning LLMs using QLoRA
☆269Jun 8, 2024Updated last year
Alternatives and similar repositories for llm_qlora
Users that are interested in llm_qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Apr 27, 2026Updated last week
- ☆167Jun 1, 2023Updated 2 years ago
- A pytest plugin to organize and track algorithm visualizations☆18Dec 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,886Jan 28, 2024Updated 2 years ago
- A workbench application to test out different prompts on a variety of AI models to see how they perform☆16Feb 9, 2025Updated last year
- Customizable implementation of the self-instruct paper.☆1,053Mar 7, 2024Updated 2 years ago
- ☆13May 25, 2023Updated 2 years ago
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Kon is a minimal coding agent (and also a highly opinionated one)☆293Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,901Jun 10, 2024Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- A prompt/context management system☆168May 8, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SAIL: Search Augmented Instruction Learning☆160Jul 22, 2025Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Go ahead and axolotl questions☆11,842May 1, 2026Updated last week
- ☆42Jun 3, 2025Updated 11 months ago
- Democratizing access to LLMs for the open-source community. Let's advance AI, together.☆29Sep 2, 2023Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- pip install poai☆14Mar 2, 2026Updated 2 months ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆58Jan 5, 2025Updated last year
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105Updated this week
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA…☆24Mar 7, 2026Updated 2 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Oct 8, 2024Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Jul 4, 2022Updated 3 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,915Sep 30, 2023Updated 2 years ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Dec 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆122Dec 18, 2024Updated last year
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆181Jan 31, 2024Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,542Nov 9, 2024Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆52Jul 10, 2024Updated last year
- A simple updated colab doc that will allow you to run the Ooba Booga Text-Generation-Webui for free with just a few lines of codes.☆25Sep 30, 2024Updated last year
- A local version of WebSim.AI, the prompt to webpage engine. Infinite possibilities to cure your boredom. ( I made it so you dont have to…☆58Jul 19, 2024Updated last year
- ☆16Mar 12, 2024Updated 2 years ago