A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted to use another.
☆18Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for LoRA-MPT
Users that are interested in LoRA-MPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- ☆16May 8, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105Updated this week
- data prep utilities for LLMs, using LLMs☆16Nov 7, 2023Updated 2 years ago
- ☆19Jun 7, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- ☆11Sep 16, 2023Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Aug 2, 2023Updated 2 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 9 months ago
- ☆35Apr 8, 2023Updated 3 years ago
- Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancy☆20May 19, 2025Updated 11 months ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 9 months ago
- Study materials about "Deep Learning for Molecular Applications".☆15Aug 5, 2019Updated 6 years ago
- ☆13Apr 17, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- Example Fabulous app that uses MSAL to authenticate a user on Azure Active Directory☆11Dec 8, 2022Updated 3 years ago
- LLMON (pronounced limón) is a structured data format optimized for large language models☆33Jul 17, 2023Updated 2 years ago
- ☆13May 25, 2023Updated 2 years ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated 2 years ago
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆40Feb 23, 2026Updated 2 months ago
- ☆18Mar 12, 2019Updated 7 years ago
- [EMNLP2022] Source code for Neural Machine Translation with Contrastive Translation Memories☆12Feb 15, 2023Updated 3 years ago
- Minimal MBrace setup to test out the waters, with as few dependencies as possible☆11Mar 18, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Scaffold application to get started with Fabulous☆18Jun 23, 2022Updated 3 years ago
- ☆28Apr 14, 2024Updated 2 years ago
- Retrieval Augmented Generation (RAG) Application to offer Q&A like experience on a long format text using OpenAI and ElasticSearch as Vec…☆23Mar 27, 2024Updated 2 years ago
- A Santa tracker app build using Fabulous☆15Dec 5, 2020Updated 5 years ago
- Low Code for react native☆12Jun 23, 2020Updated 5 years ago
- Prisoner's Dilemma research environment.☆17Jul 9, 2021Updated 4 years ago
- BitNet a4.8 Implementation in one file of pytorch☆21Jan 13, 2025Updated last year
- ☆10Jun 8, 2024Updated last year
- Port of Facebook's LLaMA model in C/C++☆11Apr 10, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Dec 11, 2021Updated 4 years ago
- Just some dumb code to get TicTacToe working in F# and Xamarin Forms☆13Apr 13, 2018Updated 8 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- ☆13Jun 7, 2023Updated 2 years ago
- This repository contains implementation of DHNE : Network Representation Learning Method for Dynamic Heterogeneous Network.☆10May 11, 2019Updated 6 years ago
- Simply extract JSON values from HTTP requests without defining intermediate types or using model binding☆20Jun 3, 2021Updated 4 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆35Aug 14, 2024Updated last year