rmihaylov / mpttuneLinks

Tune MPTs

☆84

Alternatives and similar repositories for mpttune

Users that are interested in mpttune are comparing it to the libraries listed below

Sorting:

jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆76Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last month
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated 5 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆177Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated 2 years ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
euclaise / supertrainer2000
☆50Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated 11 months ago
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆102Updated 2 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated 2 years ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆66Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆208Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆99Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
QuixiAI / grokadamw
☆136Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
abacaj / train-with-fsdp
☆94Updated 2 years ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆118Updated 2 years ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆201Updated last year
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆158Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆102Updated last year