thinking-machines-lab / batch_invariant_opsLinks

☆917

Alternatives and similar repositories for batch_invariant_ops

Users that are interested in batch_invariant_ops are comparing it to the libraries listed below

Sorting:

PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆867Updated this week
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆1,048Updated this week
MoonshotAI / checkpoint-engine
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆851Updated last week
changjonathanc / flex-nano-vllm
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆313Updated last month
marin-community / marin
Open-source framework for the research and development of foundation models.
☆640Updated this week
NVIDIA / kvpress
LLM KV cache compression made easy
☆701Updated this week
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆482Updated this week
meta-pytorch / torchforge
PyTorch-native post-training at scale
☆549Updated this week
apple / ml-cross-entropy
☆555Updated 2 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆360Updated 11 months ago
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆1,287Updated last week
ScalingIntelligence / KernelBench
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)
☆683Updated this week
NVIDIA-NeMo / Skills
A project to improve skills of large language models
☆628Updated this week
thinking-machines-lab / tinker
Training API and CLI
☆238Updated last week
huggingface / picotron_tutorial
☆224Updated last week
radixark / miles
☆317Updated this week
open-thought / reasoning-gym
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,242Updated 3 weeks ago
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆561Updated last month
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆316Updated this week
fla-org / native-sparse-attention
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
☆928Updated 8 months ago
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆347Updated 7 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆573Updated last month
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆456Updated 6 months ago
foundation-model-stack / fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆271Updated last week
huggingface / kernels
Load compute kernels from the Hub
☆337Updated last week
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆215Updated 8 months ago
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆850Updated last month
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,372Updated 4 months ago
facebookresearch / PhysicsLM4
Physics of Language Models, Part 4
☆262Updated 4 months ago
hao-ai-lab / Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
☆406Updated last year