☆15Mar 15, 2021Updated 4 years ago
Alternatives and similar repositories for aws-parallelcluster-megatron
Users that are interested in aws-parallelcluster-megatron are comparing it to the libraries listed below
Sorting:
- This repository contains sample code to help you extend your LSF cluster to the cloud. It provides fully functional examples of how to s…☆16Dec 17, 2025Updated 2 months ago
- ☆15Mar 6, 2020Updated 5 years ago
- Infrastructure template and Jupyter notebooks for running RoseTTAFold on AWS Batch.☆22Feb 24, 2022Updated 4 years ago
- The open source version of the AWS ParallelCluster User Guide.☆25Jun 16, 2023Updated 2 years ago
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆18Sep 17, 2025Updated 5 months ago
- Manage AWS ParallelCluster through an easy to use web interface☆67Mar 13, 2023Updated 2 years ago
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- 1-Click Cluster Deployment with AWS ParallelCluster☆29Sep 23, 2022Updated 3 years ago
- Deep learning for gravitational potentials, based on well-mixed tracers in phase space.☆11Dec 29, 2025Updated 2 months ago
- Wantedlyのインターン情報や新卒採用についてのインフォメーションです☆11Apr 5, 2022Updated 3 years ago
- ☆41Aug 27, 2024Updated last year
- ☆10Jun 8, 2022Updated 3 years ago
- Memory Benchmark for OpenCL-supported Intel FPGAs☆11Dec 25, 2023Updated 2 years ago
- A continuous integration (CI) system for 📓 Jupyter notebooks, built using 🧠 Amazon SageMaker.☆11Aug 5, 2025Updated 6 months ago
- [INACTIVE] A real-time, collaborative, HTML5 drawing widget powered by KineticJS / FabricJS and inspired by Literally Canvas.☆10Feb 9, 2014Updated 12 years ago
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago
- Sagemaker Studio Docker UI Extension☆11Apr 17, 2024Updated last year
- Fine Tune Multimodal LLM "Idefics 2" using QLoRA.☆11Apr 20, 2024Updated last year
- ☆16Jul 15, 2019Updated 6 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 6 years ago
- CNN model to predict lung cancer based on CT scan images☆11Aug 5, 2020Updated 5 years ago
- A tiny python2.7 script which converts LaTex projects into arxiv-format. Suggestions are welcome.☆10Mar 20, 2016Updated 9 years ago
- In this repository, we will present techniques to detect covariate drift, and demonstrate how to incorporate your own custom drift detect…☆13May 26, 2021Updated 4 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Repository for CoSAI workstream 3, AI Risk Governance☆21Feb 18, 2026Updated last week
- Import Haskell modules in Python as if they were native modules☆12Mar 30, 2021Updated 4 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- ☆11Sep 10, 2023Updated 2 years ago
- An implementation of the AlphaZero algorithm by Google Deepmind. Research paper here: https://arxiv.org/abs/1911.08265☆12Oct 10, 2024Updated last year
- Manages CloudWatch Alarms for EC2 Instances in ASGs☆12Feb 17, 2022Updated 4 years ago
- A custom Docker image containing the theia-ide for Java development☆15Nov 30, 2021Updated 4 years ago
- All notebook for FastAI learning purposes.☆15Jun 11, 2019Updated 6 years ago
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated 2 years ago
- ☆11Jun 29, 2021Updated 4 years ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 8 months ago
- Tool to dump all GPS traces collected by/for the OpenStreetMap project.☆25Mar 6, 2019Updated 6 years ago
- GPT-J 6B inference on TensorRT with INT-8 precision☆11Apr 5, 2023Updated 2 years ago