Full finetuning of large language models without large memory requirements
☆93Sep 22, 2025Updated 9 months ago
Alternatives and similar repositories for SlimTrainer
Users that are interested in SlimTrainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 3 years ago
- LOMO: LOw-Memory Optimization☆994Jul 2, 2024Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated 2 years ago
- ☆416Nov 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A place to store reusable transformer components of my own creation or found on the interwebs☆80May 30, 2026Updated last month
- ☆47Aug 29, 2024Updated last year
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Jan 11, 2024Updated 2 years ago
- ☆21Aug 27, 2023Updated 2 years ago
- ☆535Dec 1, 2023Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆28Apr 21, 2023Updated 3 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- ☆10Apr 21, 2024Updated 2 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆732Oct 11, 2023Updated 2 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- ☆133Nov 24, 2023Updated 2 years ago
- clean up your LLM datasets☆113May 30, 2023Updated 3 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆94Oct 5, 2023Updated 2 years ago
- Official PyTorch implementation of QA-LoRA☆147Mar 13, 2024Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- One stop shop for all things carp☆58Sep 9, 2022Updated 3 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Aug 9, 2023Updated 2 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- ☆553Feb 8, 2026Updated 4 months ago
- 🌸 Train floret vectors☆18May 4, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆23Jun 20, 2026Updated last week
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- data cleaning and curation for unstructured text☆330Aug 6, 2024Updated last year
- Use LLMs to clean your gmail inbox☆22Dec 23, 2023Updated 2 years ago
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,929Sep 30, 2023Updated 2 years ago