iwalton3/mpt-lora-patch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iwalton3/mpt-lora-patch)

iwalton3 / mpt-lora-patch

Patch for MPT-7B which allows using and training a LoRA

☆57

Alternatives and similar repositories for mpt-lora-patch

Users that are interested in mpt-lora-patch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

practical-dreamer / vicuna_to_alpacan
View on GitHub
Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer
☆12Jun 21, 2023Updated 3 years ago
leehanchung / SMILE-factory
View on GitHub
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆106Updated this week
ConiferLabsWA / flan-ul2-dolly
View on GitHub
☆34Apr 23, 2023Updated 3 years ago
vihangd / alpaca-qlora
View on GitHub
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
☆80Dec 15, 2023Updated 2 years ago
pikalover6 / openassistant.cpp
View on GitHub
☆15May 8, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rhulha / lora
View on GitHub
Train Large Language Models (LLM) using LoRA
☆26May 22, 2023Updated 3 years ago
soochan-lee / RoT
View on GitHub
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆45Jun 13, 2023Updated 3 years ago
ConiferLabsWA / flan-ul2-alpaca
View on GitHub
☆33Apr 23, 2023Updated 3 years ago
mikeybellissimo / LoRA-MPT
View on GitHub
A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted …
☆18Jun 12, 2023Updated 3 years ago
Agora-Lab-AI / The-Distiller
View on GitHub
Generate High Quality textual or multi-modal datasets with Agents
☆18Jun 7, 2023Updated 3 years ago
SkunkworksAI / CodeFusion
View on GitHub
☆14Oct 31, 2023Updated 2 years ago
andrewgcodes / vec2vec
View on GitHub
☆17Jun 20, 2023Updated 3 years ago
cadovid / nlp4rare
View on GitHub
This repository contains the data and code created under the project NLP4Rare-cm-uc3m.
☆10Sep 14, 2021Updated 4 years ago
dynamiccreator / lora_scripts
View on GitHub
This repo helps to transform text into a better form for lora training
☆12Apr 9, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bupticybee / FastLoRAChat
View on GitHub
Instruct-tune LLaMA on consumer hardware with shareGPT data
☆124Apr 20, 2023Updated 3 years ago
biological-alignment-benchmarks / Manipulative-Expression-Recognition
View on GitHub
MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …
☆15Jan 16, 2026Updated 6 months ago
eugenepentland / landmark-attention-qlora
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Jun 16, 2023Updated 3 years ago
sradc / pretraining-BERT
View on GitHub
Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch
☆45May 20, 2025Updated last year
voidism / L2KD
View on GitHub
Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123
☆12Jul 13, 2021Updated 5 years ago
byterocket / TSOwnable-Huff
View on GitHub
A Two-Step Transfer Ownable contract implemented in Huff.
☆16Aug 24, 2022Updated 3 years ago
CarperAI / autocrit
View on GitHub
A repository for transformer critique learning and generation
☆88Dec 7, 2023Updated 2 years ago
sebischair / Exploring-NLP-Research
View on GitHub
Repository of the RANLP 2023 paper "Exploring the Landscape of Natural Language Processing Research".
☆13Oct 20, 2024Updated last year
sleepingcat4 / TinyStories
View on GitHub
code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper
☆40Nov 24, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NO-ob / simpleLlama
View on GitHub
A Simple webserver for generating text with exllamav2
☆14Dec 18, 2023Updated 2 years ago
BlairStanek / gpt-statutes
View on GitHub
Probe how GPT-n performs on statutory reasoning
☆10Sep 17, 2024Updated last year
teddylee777 / react-voice-agent
View on GitHub
☆12Oct 3, 2024Updated last year
linydub / azureml-greenai-txtsum
View on GitHub
Samples for fine-tuning HuggingFace models with AzureML
☆10Oct 14, 2021Updated 4 years ago
fabprezja / keras-gpt-copilot
View on GitHub
Integrate an LLM copilot within your Keras model development workflow
☆28Sep 23, 2023Updated 2 years ago
ApeAcademy / ERC721
View on GitHub
Fresh and Ape-y NFT Template
☆16May 10, 2024Updated 2 years ago
j0sephsasson / fine-tune-LLMs
View on GitHub
A no-code application that enables companies to create intelligent digital assistants.
☆13Oct 9, 2023Updated 2 years ago
tmptrash / irma
View on GitHub
Digital organisms ecology system experiment
☆16May 29, 2020Updated 6 years ago
llm-jp / llm-jp-model-playground
View on GitHub
Interactive application to verify multiple LLMs
☆14Feb 20, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
alasdairforsythe / capcode
View on GitHub
Lossless normalization of uppercase characters: Go, C++ & JavaScript
☆11Jul 7, 2026Updated 2 weeks ago
xaiguy / chippy
View on GitHub
☆13Feb 26, 2023Updated 3 years ago
lando22 / GPT-3T
View on GitHub
Building language models to predict more than one token ahead to enable further ahead predictions.
☆12May 22, 2025Updated last year
BenetManzanaresSalor / LifeStepByStep
View on GitHub
A simple life simulation game developed in Unity, where the player can watch a group of entities move, grow, search, do pathfinding, eat,…
☆13Dec 2, 2020Updated 5 years ago
UKPLab / emnlp2017-claim-identification
View on GitHub
Source code repository for our EMNLP paper on cross-domain claim identification
☆14Oct 24, 2018Updated 7 years ago
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆215Aug 8, 2024Updated last year
nik-dim / sequel
View on GitHub
A Continual Learning Library in PyTorch and JAX
☆14Apr 18, 2023Updated 3 years ago