OpenLMLab/LOMO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenLMLab/LOMO)

OpenLMLab / LOMO

LOMO: LOw-Memory Optimization

☆994

Alternatives and similar repositories for LOMO

Users that are interested in LOMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenMOSS / CoLLiE
View on GitHub
Collaborative Training of Large Language Models in an Efficient Way
☆420Aug 28, 2024Updated last year
artidoro / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,959Jun 10, 2024Updated 2 years ago
CStanKonrad / long_llama
View on GitHub
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,466Nov 7, 2023Updated 2 years ago
princeton-nlp / MeZO
View on GitHub
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
☆1,168Jan 11, 2024Updated 2 years ago
JIA-Lab-research / LongLoRA
View on GitHub
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,689Aug 14, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 9 months ago
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,916Mar 14, 2024Updated 2 years ago
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,331Jun 11, 2023Updated 3 years ago
OptimalScale / LMFlow
View on GitHub
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
☆8,485May 22, 2026Updated last month
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,479Jun 7, 2025Updated last year
PhoebusSi / Alpaca-CoT
View on GitHub
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,791Dec 12, 2023Updated 2 years ago
hiyouga / FastEdit
View on GitHub
🩹Editing large language models within 10 seconds⚡
☆1,368Aug 13, 2023Updated 2 years ago
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,333Updated this week
FasterDecoding / Medusa
View on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,757Jun 25, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
locuslab / wanda
View on GitHub
A simple and effective LLM pruning approach.
☆868Aug 9, 2024Updated last year
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆806Sep 22, 2023Updated 2 years ago
baichuan-inc / Baichuan-7B
View on GitHub
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
☆5,652Jul 18, 2024Updated 2 years ago
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,530Jul 16, 2023Updated 3 years ago
jiaweizzhao / GaLore
View on GitHub
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,700Oct 28, 2024Updated last year
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,606Mar 27, 2023Updated 3 years ago
SqueezeAILab / SqueezeLLM
View on GitHub
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
☆722Aug 13, 2024Updated last year
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,878Updated this week
mit-han-lab / llm-awq
View on GitHub
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
☆3,591Jul 17, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,777Aug 4, 2024Updated last year
FlagAI-Open / FlagAI
View on GitHub
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
☆3,868Jul 13, 2026Updated last week
FreedomIntelligence / LLMZoo
View on GitHub
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
☆2,941Nov 26, 2023Updated 2 years ago
salesforce / CodeTF
View on GitHub
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
☆1,479May 1, 2025Updated last year
qwopqwop200 / GPTQ-for-LLaMa
View on GitHub
4 bits quantization of LLaMA using GPTQ
☆3,071Jul 13, 2024Updated 2 years ago
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,415Updated this week
mosaicml / llm-foundry
View on GitHub
LLM training code for Databricks foundation models
☆4,429Mar 25, 2026Updated 3 months ago
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,489Updated this week
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,248Jul 11, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,910Jul 29, 2024Updated last year
AetherCortex / Llama-X
View on GitHub
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,605Aug 30, 2023Updated 2 years ago
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,273Oct 16, 2024Updated last year
TigerResearch / TigerBot
View on GitHub
TigerBot: A multi-language multi-task LLM
☆2,259Dec 28, 2024Updated last year
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,243Jun 17, 2026Updated last month
OpenLMLab / LEval
View on GitHub
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆406Jul 9, 2024Updated 2 years ago
deepspeedai / DeepSpeedExamples
View on GitHub
Example models using DeepSpeed
☆6,831Updated this week