A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
☆18Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for LoRA-MPT
Users that are interested in LoRA-MPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- Tensor library for machine learning☆17Jul 13, 2023Updated 2 years ago
- ☆16May 8, 2023Updated 3 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105May 6, 2026Updated 3 weeks ago
- data prep utilities for LLMs, using LLMs☆16Nov 7, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Oct 3, 2024Updated last year
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Aug 2, 2023Updated 2 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 10 months ago
- Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.☆16Apr 22, 2023Updated 3 years ago
- ☆35Apr 8, 2023Updated 3 years ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 10 months ago
- Study materials about "Deep Learning for Molecular Applications".☆15Aug 5, 2019Updated 6 years ago
- Reproduction study of Grassmann Flows for sequence modeling (arXiv 2512.19428). Shows 22.6% gap vs claimed 10-15%, includes CUDA kernels …☆30Dec 26, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 3 years ago
- ☆13Apr 17, 2024Updated 2 years ago
- Example Fabulous app that uses MSAL to authenticate a user on Azure Active Directory☆11Dec 8, 2022Updated 3 years ago
- Variational Inference by Policy Search☆13Apr 24, 2019Updated 7 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- ☆14May 25, 2023Updated 3 years ago
- LLMON (pronounced limón) is a structured data format optimized for large language models☆33Jul 17, 2023Updated 2 years ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated 2 years ago
- NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning☆28Jul 28, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆41Feb 23, 2026Updated 3 months ago
- ☆18Mar 12, 2019Updated 7 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 7 months ago
- ☆43Aug 2, 2025Updated 9 months ago
- Minimal MBrace setup to test out the waters, with as few dependencies as possible☆11Mar 18, 2018Updated 8 years ago
- Scaffold application to get started with Fabulous☆18Jun 23, 2022Updated 3 years ago
- ☆28Apr 14, 2024Updated 2 years ago
- ☆37May 31, 2023Updated 2 years ago
- Retrieval Augmented Generation (RAG) Application to offer Q&A like experience on a long format text using OpenAI and ElasticSearch as Vec…☆23Mar 27, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Mar 28, 2025Updated last year
- Low Code for react native☆12Jun 23, 2020Updated 5 years ago
- BitNet a4.8 Implementation in one file of pytorch☆21Jan 13, 2025Updated last year
- Port of Facebook's LLaMA model in C/C++☆11Apr 10, 2026Updated last month
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- Black Box Variational Inference for Bayesian logistic regression☆18Apr 1, 2017Updated 9 years ago
- ☆13Jun 7, 2023Updated 2 years ago