GoogleCloudPlatform / slurm-gcp
☆24Updated last week
Related projects: ⓘ
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆188Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆69Updated this week
- ☆37Updated 2 weeks ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆23Updated last month
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆11Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!☆45Updated last month
- ☆18Updated this week
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆40Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆94Updated this week
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆62Updated 7 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆194Updated this week
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆41Updated 3 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 3 months ago
- Transformer GPU VRAM estimator☆35Updated 5 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- ☆21Updated 5 months ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- Singularity Image Format (SIF) reference implementation.☆17Updated 2 weeks ago
- ☆13Updated 3 years ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆50Updated 3 weeks ago
- Real-time visualisation☆14Updated 2 months ago
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- First token cutoff sampling inference example☆28Updated 8 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated 3 weeks ago
- PyTorch centric eager mode debugger☆43Updated 2 months ago
- ☆11Updated 2 months ago
- GPU Environment Management for Visual Studio Code☆35Updated last year
- Google TPU optimizations for transformers models☆62Updated this week
- ☆117Updated this week
- C API for MLX☆68Updated last week