Toolkit for launching and observing MaxText training on Slurm-managed GPU clusters
☆24Mar 17, 2026Updated this week
Alternatives and similar repositories for maxtext-slurm
Users that are interested in maxtext-slurm are comparing it to the libraries listed below
Sorting:
- Scale-out system monitoring☆21Updated this week
- ☆64Updated this week
- ☆82Updated this week
- Automatically exported from code.google.com/p/libfixmath☆11Jan 25, 2016Updated 10 years ago
- Open Source Repository for Team UPennalizers☆69Nov 30, 2016Updated 9 years ago
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- Using Genetic Algorithms to aid Machine Learning☆20Feb 20, 2018Updated 8 years ago
- A TensorFlow Extension: GPU performance tools for TensorFlow.☆26Jul 27, 2023Updated 2 years ago
- A toolchain file and examples using cmake for iOS development (this is a fork of a similar project found on code.google.com)☆26Nov 15, 2017Updated 8 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- A script to call taxi automatically.☆19Aug 3, 2017Updated 8 years ago
- OpenCL implementation of a NN and CNN☆22Jun 27, 2018Updated 7 years ago
- Separate from hardware and used to learn some NCCL mechanisms☆25Apr 19, 2024Updated last year
- auto Iterative pruning based on caffe☆23Oct 17, 2017Updated 8 years ago
- A simple tool to profile performance of multiple combinations of GEMM of cuBLAS☆25Feb 9, 2021Updated 5 years ago
- For my own use BVLC/caffe helper tools☆24Nov 5, 2015Updated 10 years ago
- 💥 Cython bindings for MurmurHash2☆45Nov 14, 2025Updated 4 months ago
- Collection of header-only utilities for C++☆36Dec 4, 2023Updated 2 years ago
- Official repository of Alibaba erdma drivers☆36Jul 23, 2025Updated 7 months ago
- Ristretto: Caffe-based approximation of convolutional neural networks.☆30Sep 25, 2018Updated 7 years ago
- ☆29Mar 20, 2017Updated 9 years ago
- XNOR-Net, CUDNN5 supported version of XNOR-Net-caffe: https://github.com/loswensiana/BWN-XNOR-caffe☆31Apr 1, 2018Updated 7 years ago
- python开发的Web爬虫☆30Jan 27, 2016Updated 10 years ago
- saving memory by recomputing for keras☆37Apr 30, 2020Updated 5 years ago
- Automatically exported from code.google.com/p/math-neon☆40Apr 20, 2015Updated 10 years ago
- Simple pruning example using Caffe☆33Oct 19, 2017Updated 8 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Apr 13, 2023Updated 2 years ago
- A high-performance inference system for large language models, designed for production environments.☆491Dec 19, 2025Updated 3 months ago
- ☆52Jun 14, 2024Updated last year
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆47Oct 17, 2023Updated 2 years ago
- ☆79Jan 5, 2025Updated last year
- Caffe with NNPACK integration☆59Mar 24, 2016Updated 9 years ago
- PyTorch distributed training acceleration framework☆54Aug 13, 2025Updated 7 months ago
- Pipeline Parallelism Emulation and Visualization☆80Jan 8, 2026Updated 2 months ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆62Jul 1, 2022Updated 3 years ago
- TensorFlow util for building memory usage timeline from LOG_MEMORY messages☆65Dec 7, 2017Updated 8 years ago
- 📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).☆70Apr 26, 2025Updated 10 months ago
- Merge Batch Norm caffe☆64Jul 25, 2018Updated 7 years ago
- Embedded and Mobile Deployment☆73May 2, 2018Updated 7 years ago