microsoft/torchscale

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/torchscale)

microsoft / torchscale

Foundation Architecture for (M)LLMs

☆3,134

Alternatives and similar repositories for torchscale

Users that are interested in torchscale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,173Jan 23, 2026Updated 6 months ago
facebookresearch / xformers
View on GitHub
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆10,531Jul 15, 2026Updated 2 weeks ago
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,568Updated this week
microsoft / LMOps
View on GitHub
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,450Updated this week
Jamie-Stirling / RetNet
View on GitHub
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
☆1,210Oct 22, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,445Mar 27, 2024Updated 2 years ago
deepspeedai / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆42,830Updated this week
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,369Updated this week
facebookresearch / fairscale
View on GitHub
PyTorch extensions for high performance and large scale training.
☆3,411Apr 26, 2025Updated last year
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆17,252Updated this week
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,953Updated this week
mosaicml / composer
View on GitHub
Supercharge Your Model Training
☆5,491Apr 29, 2026Updated 3 months ago
facebookincubator / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆4,726Jul 14, 2026Updated 2 weeks ago
huggingface / accelerate
View on GitHub
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,800Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,460Updated this week
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,253Jul 11, 2024Updated 2 years ago
facebookresearch / metaseq
View on GitHub
Repo for external large-scale work
☆6,550Apr 27, 2024Updated 2 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,249Sep 30, 2025Updated 9 months ago
artidoro / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,975Jun 10, 2024Updated 2 years ago
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,643Jul 23, 2026Updated last week
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,257Jun 2, 2026Updated last month
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,752Jan 8, 2024Updated 2 years ago
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,364Oct 28, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,915Mar 14, 2024Updated 2 years ago
syncdoth / RetNet
View on GitHub
Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…
☆227Mar 12, 2024Updated 2 years ago
mosaicml / llm-foundry
View on GitHub
LLM training code for Databricks foundation models
☆4,432Mar 25, 2026Updated 4 months ago
arogozhnikov / einops
View on GitHub
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,564Jul 5, 2026Updated 3 weeks ago
mlfoundations / open_flamingo
View on GitHub
An open-source framework for training large multimodal models.
☆4,118Aug 31, 2024Updated last year
NVIDIA / TransformerEngine
View on GitHub
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆3,455Updated this week
hpcaitech / ColossalAI
View on GitHub
Making large AI models cheaper, faster and more accessible
☆41,422Jul 13, 2026Updated 2 weeks ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,032Jul 17, 2026Updated last week
state-spaces / mamba
View on GitHub
Mamba SSM architecture
☆18,682Jul 22, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆19,812Updated this week
huggingface / optimum
View on GitHub
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…
☆3,453Updated this week
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,884Mar 21, 2026Updated 4 months ago
facebookresearch / multimodal
View on GitHub
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
☆1,728Updated this week
ELS-RD / kernl
View on GitHub
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…
☆1,586Jan 28, 2026Updated 6 months ago
Lightning-AI / pytorch-lightning
View on GitHub
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
☆31,258Updated this week
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,243Jul 17, 2024Updated 2 years ago