meta-pytorch / torchforgeLinks

PyTorch-native post-training at scale

☆479

Alternatives and similar repositories for torchforge

Users that are interested in torchforge are comparing it to the libraries listed below

Sorting:

changjonathanc / flex-nano-vllm
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆301Updated this week
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆298Updated last week
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆1,009Updated this week
huggingface / kernels
Load compute kernels from the Hub
☆316Updated last week
MoonshotAI / checkpoint-engine
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆804Updated this week
microsoft / dion
Dion optimizer algorithm
☆379Updated last week
thinking-machines-lab / batch_invariant_ops
☆877Updated this week
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆270Updated last week
huggingface / picotron_tutorial
☆225Updated 2 weeks ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆353Updated 10 months ago
meta-pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆441Updated 2 weeks ago
foundation-model-stack / fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆270Updated this week
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆314Updated this week
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆344Updated 6 months ago
PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆734Updated last week
tilde-research / MoMoE-impl
Memory optimized Mixture of Experts
☆69Updated 3 months ago
apple / ml-recurrent-drafter
☆218Updated 9 months ago
snowflakedb / ArcticTraining
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
☆241Updated this week
facebookresearch / PhysicsLM4
Physics of Language Models, Part 4
☆255Updated 3 months ago
apple / ml-cross-entropy
☆545Updated last month
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆192Updated 11 months ago
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆207Updated 7 months ago
thinking-machines-lab / tinker
Training API
☆202Updated 3 weeks ago
meta-pytorch / monarch
PyTorch Single Controller
☆869Updated this week
facebookresearch / spdl
Scalable and Performant Data Loading
☆331Updated this week
NVIDIA / kvpress
LLM KV cache compression made easy
☆674Updated this week
NVIDIA-NeMo / Automodel
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
☆147Updated this week
antimatter15 / reverse-engineering-gemma-3n
Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model
☆250Updated 5 months ago
ScalingIntelligence / KernelBench
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)
☆642Updated this week
marin-community / marin
Open-source framework for the research and development of foundation models.
☆574Updated last week