Where GPUs get cooked 👩🍳🔥
☆370Jan 21, 2026Updated last month
Alternatives and similar repositories for gpu-fryer
Users that are interested in gpu-fryer are comparing it to the libraries listed below
Sorting:
- Multi-GPU CUDA stress test☆2,108Nov 4, 2025Updated 4 months ago
- A lattice QCD library.☆16Feb 10, 2026Updated 3 weeks ago
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated 2 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,099Aug 26, 2025Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,314Feb 20, 2026Updated 2 weeks ago
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆38Dec 2, 2025Updated 3 months ago
- Efficient Triton Kernels for LLM Training☆6,189Updated this week
- A Datacenter Scale Distributed Inference Serving Framework☆6,154Updated this week
- Inference server benchmarking tool☆145Oct 2, 2025Updated 5 months ago
- Recipes to scale inference-time compute of open models☆1,129May 22, 2025Updated 9 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆143Sep 12, 2025Updated 5 months ago
- ☆49Sep 26, 2025Updated 5 months ago
- ☆27Feb 9, 2026Updated 3 weeks ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆91Jan 9, 2026Updated last month
- Collection of small examples for running on ALCF resources☆21Feb 24, 2026Updated last week
- Scalable toolkit for efficient model reinforcement☆1,372Updated this week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆481Updated this week
- Tile primitives for speedy kernels☆3,202Feb 24, 2026Updated last week
- ☆15Oct 24, 2023Updated 2 years ago
- CUDA checkpoint and restore utility☆424Sep 15, 2025Updated 5 months ago
- Speed up model training by fixing data loading.☆577Updated this week
- [NeurIPS 2025] Official Pytorch Implementation of "The Curse of Depth in Large Language Models" by Wenfang Sun, Xinyuan Song, Pengxiang L…☆67Jan 2, 2026Updated 2 months ago
- Cray-LM unified training and inference stack.☆22Jan 30, 2025Updated last year
- A high-performance distributed file system designed to address the challenges of AI training and inference workloads.☆9,730Feb 25, 2026Updated last week
- Build compute kernels and load them from the Hub.☆452Feb 28, 2026Updated last week
- MoE training for Me and You and maybe other people☆364Feb 7, 2026Updated last month
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Jul 8, 2022Updated 3 years ago
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- ☆11Jan 28, 2026Updated last month
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- Liquid Argon Computer Vision☆12Dec 4, 2025Updated 3 months ago
- ☆12Jul 9, 2021Updated 4 years ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Simple MPI implementation for prototyping or learning☆304Aug 6, 2025Updated 7 months ago
- benchmarking some transformer deployments☆26Dec 15, 2025Updated 2 months ago
- Save, load, host, and share AI model checkpoints without slowing down training. Host on Lightning AI or your own cloud with enterprise-gr…☆42Updated this week