huggingface / picotron_tutorialLinks

☆206

Alternatives and similar repositories for picotron_tutorial

Users that are interested in picotron_tutorial are comparing it to the libraries listed below

Sorting:

wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆163Updated 4 months ago
foundation-model-stack / fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆258Updated last week
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆460Updated 5 months ago
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆558Updated this week
huggingface / kernels
Load compute kernels from the Hub
☆220Updated this week
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆506Updated 3 weeks ago
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆269Updated this week
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated 2 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆342Updated 7 months ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆149Updated last month
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆323Updated 3 months ago
marin-community / marin
☆347Updated this week
NVIDIA / NeMo-Skills
A project to improve skills of large language models
☆501Updated this week
gpu-mode / ring-attention
ring-attention experiments
☆146Updated 9 months ago
PrimeIntellect-ai / prime-rl
Decentralized RL Training at Scale
☆400Updated this week
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆185Updated 8 months ago
snowflakedb / ArcticTraining
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
☆190Updated this week
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆418Updated last week
AnswerDotAI / cold-compress
Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…
☆138Updated 11 months ago
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆134Updated this week
gpu-mode / profiling-cuda-in-torch
☆162Updated last year
melisa-writer / short-transformers
Prune transformer layers
☆69Updated last year
pytorch-labs / monarch
PyTorch Single Controller
☆341Updated this week
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 3 weeks ago
hkproj / triton-flash-attention
☆182Updated 7 months ago
lucidrains / speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
☆275Updated 7 months ago
pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆372Updated this week
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆272Updated 2 months ago
mengxiayu / LLMSuperWeight
Code for studying the super weight in LLM
☆114Updated 8 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year