usamec / lowmem_finetuningLinks

Low memory full parameter finetuning of LLMs

☆54

Alternatives and similar repositories for lowmem_finetuning

Users that are interested in lowmem_finetuning are comparing it to the libraries listed below

Sorting:

VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆73Updated 7 months ago
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆68Updated last week
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 9 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 10 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆148Updated 2 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated last year
kmohan321 / Research_Papers
☆46Updated 8 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 7 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 3 months ago
kurakurai / Luth
Luth is a state-of-the-art series of fine-tuned LLMs for French
☆40Updated last month
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆149Updated last year
alexzhang13 / rlm
Super basic implementation (gist-like) of RLMs with REPL environments.
☆278Updated last month
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆201Updated 6 months ago
AnswerDotAI / fastkmeans
☆86Updated 5 months ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆99Updated 4 months ago
areu01or00 / Tensor-Slayer
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…
☆26Updated 6 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆46Updated 10 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆304Updated last month
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆116Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 6 months ago
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆112Updated last month
strangeloopcanon / LLMRank
PageRank for LLMs
☆51Updated 2 months ago
attentionmech / smolbox
smolbox of recipies
☆28Updated 7 months ago
geronimi73 / phi2-finetune
☆86Updated last year
brendanhogan / picoDeepResearch
☆68Updated 6 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆52Updated 9 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 8 months ago