A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
☆18Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for LoRA-MPT
Users that are interested in LoRA-MPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- Tensor library for machine learning☆17Jul 13, 2023Updated 2 years ago
- ☆16May 8, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104May 20, 2025Updated 10 months ago
- data prep utilities for LLMs, using LLMs☆16Nov 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- Generate a multi-signature Bitcoin address in python☆13Nov 6, 2022Updated 3 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 8 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Aug 2, 2023Updated 2 years ago
- Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancy☆20May 19, 2025Updated 10 months ago
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- Example Fabulous app that uses MSAL to authenticate a user on Azure Active Directory☆11Dec 8, 2022Updated 3 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Variational Inference by Policy Search☆13Apr 24, 2019Updated 6 years ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated last year
- NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning☆27Jul 28, 2024Updated last year
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆39Feb 23, 2026Updated last month
- ☆13May 25, 2023Updated 2 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 5 months ago
- Minimal MBrace setup to test out the waters, with as few dependencies as possible☆11Mar 18, 2018Updated 8 years ago
- Scaffold application to get started with Fabulous☆18Jun 23, 2022Updated 3 years ago
- ☆28Apr 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆37May 31, 2023Updated 2 years ago
- BitNet a4.8 Implementation in one file of pytorch☆21Jan 13, 2025Updated last year
- A version of the default Fabulous app template running on Linux and macOS☆12Dec 5, 2020Updated 5 years ago
- Low Code for react native☆12Jun 23, 2020Updated 5 years ago
- Port of Facebook's LLaMA model in C/C++☆11Oct 25, 2025Updated 5 months ago
- ☆10Dec 11, 2021Updated 4 years ago
- Just some dumb code to get TicTacToe working in F# and Xamarin Forms☆13Apr 13, 2018Updated 7 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- Black Box Variational Inference for Bayesian logistic regression☆18Apr 1, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jun 7, 2023Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Simply extract JSON values from HTTP requests without defining intermediate types or using model binding☆20Jun 3, 2021Updated 4 years ago
- A curated list of Bolero samples, community projects, blogs and tutorials☆12Jul 8, 2020Updated 5 years ago
- RPi SD Card Image for IoT LoRa Range☆11Mar 5, 2020Updated 6 years ago
- ☆16Jul 20, 2023Updated 2 years ago
- MBrace runtime implementation targeting Amazon Web Services☆13Oct 19, 2017Updated 8 years ago