euclaise/SlimTrainer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/euclaise/SlimTrainer)

euclaise / SlimTrainer

Full finetuning of large language models without large memory requirements

☆92

Alternatives and similar repositories for SlimTrainer

Users that are interested in SlimTrainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jquesnelle / ctranslate2-rs
View on GitHub
Rust bindings for CTranslate2
☆14Jun 21, 2023Updated 3 years ago
OpenLMLab / LOMO
View on GitHub
LOMO: LOw-Memory Optimization
☆994Jul 2, 2024Updated 2 years ago
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
SkunkworksAI / hydra-moe
View on GitHub
☆416Nov 2, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
GreenBitAI / low_bit_llama
View on GitHub
Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs
☆110Jan 11, 2024Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
View on GitHub
☆74Sep 5, 2023Updated 2 years ago
ChrisHayduk / qlora-multi-gpu
View on GitHub
QLoRA with Enhanced Multi GPU Support
☆38Aug 8, 2023Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆533Dec 1, 2023Updated 2 years ago
davidbrochart / ipyhtmx
View on GitHub
Build modern UIs in Jupyter with Python
☆12Dec 28, 2022Updated 3 years ago
simonw / scrape-huggingface-models
View on GitHub
☆10Apr 21, 2024Updated 2 years ago
CarperAI / decontamination
View on GitHub
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆28Apr 21, 2023Updated 3 years ago
VikParuchuri / textbook_quality
View on GitHub
Generate textbook-quality synthetic LLM pretraining data
☆508Oct 19, 2023Updated 2 years ago
CarperAI / treasure_trove
View on GitHub
☆21Aug 27, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Alignment-Lab-AI / datagen
View on GitHub
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆32Sep 22, 2024Updated last year
CoffeeVampir3 / ez-trainer
View on GitHub
Train Llama Loras Easily
☆30Aug 3, 2023Updated 2 years ago
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
abacaj / openhermes-function-calling
View on GitHub
☆133Nov 24, 2023Updated 2 years ago
abacaj / fine-tune-mistral
View on GitHub
Fine-tune mistral-7B on 3090s, a100s, h100s
☆735Oct 11, 2023Updated 2 years ago
abacaj / train-with-fsdp
View on GitHub
☆93Oct 5, 2023Updated 2 years ago
drisspg / transformer_nuggets
View on GitHub
A place to store reusable transformer components of my own creation or found on the interwebs
☆80Jul 16, 2026Updated last week
reactorsh / ambrosia
View on GitHub
clean up your LLM datasets
☆113May 30, 2023Updated 3 years ago
yuhuixu1993 / qa-lora
View on GitHub
Official PyTorch implementation of QA-LoRA
☆147Mar 13, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
teknium1 / transformers-gptq-quant
View on GitHub
☆46Oct 13, 2023Updated 2 years ago
scottlogic-alex / prm800k-denorm
View on GitHub
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 3 years ago
EleutherAI / magiCARP
View on GitHub
One stop shop for all things carp
☆58Sep 9, 2022Updated 3 years ago
ChrisHayduk / QLoRA-for-MLM
View on GitHub
QLoRA for Masked Language Modeling
☆23Sep 11, 2023Updated 2 years ago
JustlyAI / lmss_entity_extractor
View on GitHub
Tool to apply Legal Matter Specification Standard (LMSS) to documents
☆12Aug 15, 2024Updated last year
xhan77 / in-context-alignment
View on GitHub
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆34Aug 9, 2023Updated 2 years ago
Vahe1994 / SpQR
View on GitHub
☆554Feb 8, 2026Updated 5 months ago
SebastianBodza / EnsembleForecasting
View on GitHub
Using multiple LLMs for ensemble Forecasting
☆16Jan 17, 2024Updated 2 years ago
jondurbin / bagel
View on GitHub
A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
explosion / spacy-vectors-builder
View on GitHub
🌸 Train floret vectors
☆18May 4, 2023Updated 3 years ago
taylorai / galactic
View on GitHub
data cleaning and curation for unstructured text
☆329Aug 6, 2024Updated last year
hu-po / TubeGPT
View on GitHub
YouTube Assistant
☆12May 15, 2023Updated 3 years ago
turboderp / exllama
View on GitHub
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,933Sep 30, 2023Updated 2 years ago
tysam-code / hlb-gpt
View on GitHub
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆359Jul 29, 2024Updated last year
rian-dolphin / fasthtml-chat
View on GitHub
A chat implementation for FastHTML
☆12Sep 14, 2025Updated 10 months ago
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago