Where GPUs get cooked 👩🍳🔥
☆389Apr 8, 2026Updated last month
Alternatives and similar repositories for gpu-fryer
Users that are interested in gpu-fryer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimalistic large language model 3D-parallelism training☆2,690Apr 7, 2026Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,182Aug 26, 2025Updated 8 months ago
- ☆18Dec 2, 2024Updated last year
- Efficient Triton Kernels for LLM Training☆6,350May 11, 2026Updated last week
- Save, load, host, and share AI model checkpoints without slowing down training. Host on Lightning AI or your own cloud with enterprise-gr…☆41May 4, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Multi-GPU CUDA stress test☆2,193Nov 4, 2025Updated 6 months ago
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆39Dec 2, 2025Updated 5 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,415May 7, 2026Updated last week
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆19Oct 31, 2024Updated last year
- Cray-LM unified training and inference stack.☆22Jan 30, 2025Updated last year
- A Datacenter Scale Distributed Inference Serving Framework☆6,791Updated this week
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13May 7, 2023Updated 3 years ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆419Updated this week
- Parseit - Parseit is command line tool to parse data using EBNF or ABNF using the excellent Instaparse library, and serializing the resul…☆16Dec 5, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Search your favorite websites and chat with them, on your desktop🌐☆29Jan 26, 2025Updated last year
- Implementation for robust ViT and scaled attention☆21Apr 4, 2025Updated last year
- Inference server benchmarking tool☆158Apr 24, 2026Updated 3 weeks ago
- Recipes to scale inference-time compute of open models☆1,132Apr 2, 2026Updated last month
- Build compute kernels and load them from the Hub.☆638Updated this week
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆12Jun 11, 2024Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆153Sep 12, 2025Updated 8 months ago
- Tile primitives for speedy kernels☆3,360May 11, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆28May 11, 2026Updated last week
- MoE training for Me and You and maybe other people☆386Mar 15, 2026Updated 2 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 15, 2026Updated last month
- ☆54Sep 26, 2025Updated 7 months ago
- Simple MPI implementation for prototyping or learning☆312Aug 6, 2025Updated 9 months ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆500Apr 3, 2026Updated last month
- A high-performance distributed file system designed to address the challenges of AI training and inference workloads.☆9,886May 7, 2026Updated last week
- An automatic differentiation system for dense and sparse problems☆13Jan 16, 2025Updated last year
- CUDA checkpoint and restore utility☆450Sep 15, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The Core Registry of Container Blueprints for the Autamus Build System☆15Mar 14, 2023Updated 3 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- A peer to peer machine intelligence benchmark☆31Mar 24, 2023Updated 3 years ago
- Speed up model training by fixing data loading.☆594Updated this week
- ☆17May 7, 2026Updated last week
- Everything about the SmolLM and SmolVLM family of models☆3,777Apr 2, 2026Updated last month