huggingface / gpu-fryerLinks
Where GPUs get cooked π©βπ³π₯
β266Updated last week
Alternatives and similar repositories for gpu-fryer
Users that are interested in gpu-fryer are comparing it to the libraries listed below
Sorting:
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)β377Updated this week
- PyTorch Single Controllerβ345Updated this week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understandβ188Updated 2 months ago
- Load compute kernels from the Hubβ220Updated last week
- Inference server benchmarking toolβ87Updated 3 months ago
- β182Updated this week
- Scalable and Performant Data Loadingβ291Updated this week
- π· Build compute kernelsβ87Updated last week
- β216Updated 6 months ago
- A tool to configure, launch and manage your machine learning experiments.β176Updated this week
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUsβ445Updated last week
- π Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flashβ¦β258Updated 2 weeks ago
- β88Updated last year
- Google TPU optimizations for transformers modelsβ117Updated 6 months ago
- Best practices & guides on how to write distributed pytorch training codeβ463Updated 5 months ago
- β162Updated last year
- β208Updated 5 months ago
- β232Updated this week
- Cray-LM unified training and inference stack.β22Updated 6 months ago
- Simple & Scalable Pretraining for Neural Architecture Researchβ283Updated this week
- β186Updated this week
- A lightweight, local-first, and free experiment tracking Python library built on top of π€ Datasets and Spaces.β570Updated this week
- Decentralized RL Training at Scaleβ403Updated this week
- Simple MPI implementation for prototyping or learningβ275Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!β48Updated last week
- β213Updated last month
- TorchFix - a linter for PyTorch-using code with autofix supportβ145Updated 6 months ago
- Transform datasets at scale. Optimize datasets for fast AI model training.β520Updated this week
- ScalarLM - a unified training and inference stackβ52Updated last week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)β66Updated 4 months ago