A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
☆18Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for LoRA-MPT
Users that are interested in LoRA-MPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- Tensor library for machine learning☆17Jul 13, 2023Updated 2 years ago
- ☆16May 8, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105Updated this week
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generate a multi-signature Bitcoin address in python☆13Nov 6, 2022Updated 3 years ago
- ☆11Sep 16, 2023Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Aug 2, 2023Updated 2 years ago
- Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancy☆20May 19, 2025Updated 10 months ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 8 months ago
- Study materials about "Deep Learning for Molecular Applications".☆15Aug 5, 2019Updated 6 years ago
- ☆13Apr 17, 2024Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- Example Fabulous app that uses MSAL to authenticate a user on Azure Active Directory☆11Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Nov 22, 2022Updated 3 years ago
- ☆13May 25, 2023Updated 2 years ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated 2 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 6 months ago
- ☆18Mar 12, 2019Updated 7 years ago
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- Minimal MBrace setup to test out the waters, with as few dependencies as possible☆11Mar 18, 2018Updated 8 years ago
- Scaffold application to get started with Fabulous☆18Jun 23, 2022Updated 3 years ago
- ☆37May 31, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Santa tracker app build using Fabulous☆15Dec 5, 2020Updated 5 years ago
- ☆17Mar 28, 2025Updated last year
- Low Code for react native☆12Jun 23, 2020Updated 5 years ago
- BitNet a4.8 Implementation in one file of pytorch☆21Jan 13, 2025Updated last year
- ☆10Jun 8, 2024Updated last year
- Port of Facebook's LLaMA model in C/C++☆11Updated this week
- Just some dumb code to get TicTacToe working in F# and Xamarin Forms☆13Apr 13, 2018Updated 8 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- ☆13Jun 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains implementation of DHNE : Network Representation Learning Method for Dynamic Heterogeneous Network.☆10May 11, 2019Updated 6 years ago
- Simply extract JSON values from HTTP requests without defining intermediate types or using model binding☆20Jun 3, 2021Updated 4 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Jul 26, 2023Updated 2 years ago
- A curated list of Bolero samples, community projects, blogs and tutorials☆12Jul 8, 2020Updated 5 years ago
- Minecraft: Authentic Adventure, a Minecraft 1.2.5 mod with beta-esque features.☆62Oct 3, 2024Updated last year
- RPi SD Card Image for IoT LoRa Range☆11Mar 5, 2020Updated 6 years ago