abacaj / fine-tune-mistralLinks

Fine-tune mistral-7B on 3090s, a100s, h100s

☆717

Alternatives and similar repositories for fine-tune-mistral

Users that are interested in fine-tune-mistral are comparing it to the libraries listed below

Sorting:

VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆502Updated last year
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆465Updated last year
SkunkworksAI / hydra-moe
☆416Updated last year
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,048Updated last year
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆595Updated last year
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆702Updated last year
mistralai / megablocks-public
☆864Updated last year
modal-labs / llm-finetuning
Guide for fine-tuning Llama/Mistral/CodeLlama models and more
☆613Updated 3 months ago
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
jondurbin / bagel
A bagel, with everything.
☆323Updated last year
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆727Updated last year
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,006Updated 11 months ago
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆516Updated last year
apoorvumang / prompt-lookup-decoding
☆556Updated 11 months ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,061Updated last year
sabetAI / BLoRA
batched loras
☆344Updated last year
mistralai-sf24 / hackathon
☆447Updated last year
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆349Updated last year
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆649Updated last year
FastEval / FastEval
Fast & more realistic evaluation of chat language models. Includes leaderboard.
☆187Updated last year
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆371Updated last year
philschmid / easyllm
☆461Updated last year
vgel / repeng
A library for making RepE control vectors
☆622Updated 6 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
mistralai / mistral-common
Official inference library for pre-processing of Mistral models
☆772Updated this week
carlini / yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
☆1,024Updated 3 months ago
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,553Updated last year
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆591Updated last year