Locutusque / TinyMistral-train-evalLinks

The training notebooks that were similar to the original script used to train TinyMistral.

☆22

Alternatives and similar repositories for TinyMistral-train-eval

Users that are interested in TinyMistral-train-eval are comparing it to the libraries listed below

Sorting:

thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆178Updated last year
QuixiAI / grokadamw
☆135Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆107Updated last year
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆75Updated 11 months ago
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
arcee-ai / PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆247Updated last year
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆63Updated 8 months ago
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆158Updated 5 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated 11 months ago
QuixiAI / spectrum
☆136Updated last month
cg123 / bitnet
Modeling code for a BitNet b1.58 Llama-style model.
☆25Updated last year
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆42Updated last month
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆93Updated 5 months ago
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆158Updated last year
QuixiAI / kraken
☆67Updated last year
mkurman / grpo-llm-evaluator
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…
☆46Updated 5 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆201Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated 2 years ago
nyunAI / PruneGPT
☆51Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated 2 weeks ago
LexiestLeszek / namegen
Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…
☆51Updated last year
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆208Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆160Updated 2 years ago
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆161Updated last month
Contextualist / lone-arena
Self-hosted LLM chatbot arena, with yourself as the only judge
☆41Updated last year
jukofyork / control-vectors
Genertaes control vectors for use with llama.cpp in GGUF format.
☆32Updated 6 months ago
lukasVierling / FaceRWKV
Course Project for COMP4471 on RWKV
☆17Updated last year