Full finetuning of large language models without large memory requirements
☆94Sep 22, 2025Updated 5 months ago
Alternatives and similar repositories for SlimTrainer
Users that are interested in SlimTrainer are comparing it to the libraries listed below
Sorting:
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- LOMO: LOw-Memory Optimization☆988Jul 2, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆73Updated this week
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆27Apr 21, 2023Updated 2 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- ☆415Nov 2, 2023Updated 2 years ago
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Jan 11, 2024Updated 2 years ago
- ☆48Aug 29, 2024Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- Official PyTorch implementation of QA-LoRA☆145Mar 13, 2024Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- ☆535Dec 1, 2023Updated 2 years ago
- clean up your LLM datasets☆113May 30, 2023Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- ☆553Feb 8, 2026Updated 3 weeks ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- Fast AI Practical Deep Learning for Coders experiments in Stable Diffusion☆24Nov 10, 2022Updated 3 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆724Oct 11, 2023Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Infr is an autonomous, open-source platform for data collection, storage, & retrieval that you can self-host.☆45Nov 3, 2023Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- ☆94Oct 5, 2023Updated 2 years ago
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- ☆135Nov 24, 2023Updated 2 years ago
- Sequence models in Numpy☆25Oct 9, 2020Updated 5 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆355Jul 29, 2024Updated last year
- DSPy-powered email optimization for startup founders: drop in your 3 best emails, get optimized outreach for new leads☆39Sep 14, 2025Updated 5 months ago
- ☆76Jan 24, 2024Updated 2 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- ☆10Apr 21, 2024Updated last year
- ☆11Aug 26, 2021Updated 4 years ago