☆15Mar 15, 2021Updated 5 years ago
Alternatives and similar repositories for aws-parallelcluster-megatron
Users that are interested in aws-parallelcluster-megatron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains sample code to help you extend your LSF cluster to the cloud. It provides fully functional examples of how to s…☆16Dec 17, 2025Updated 6 months ago
- The open source version of the AWS ParallelCluster User Guide.☆25Jun 16, 2023Updated 3 years ago
- The open source version of the NICE Desktop Cloud Visualization User Guide. You can provide feedback by submitting issues in this repo, o…☆10Jun 15, 2023Updated 3 years ago
- This guidance shows how to deploy a secure, high-performance Cryo-EM analysis environment on AWS ParallelCluster Service, enabling resear…☆10Nov 14, 2025Updated 7 months ago
- 1-Click Cluster Deployment with AWS ParallelCluster☆30Sep 23, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆67Feb 20, 2024Updated 2 years ago
- Monitoring Dashboard for AWS ParallelCluster AWS ParallelCluster & AWS PCS + Amazon RES virtual desktops☆41Jun 22, 2026Updated last week
- Store intermediate results of iterative methods.☆16May 21, 2024Updated 2 years ago
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆20Sep 17, 2025Updated 9 months ago
- Pickle decompiler plugin for Radare2☆18Aug 6, 2023Updated 2 years ago
- ☆17Mar 11, 2024Updated 2 years ago
- ☆13Aug 6, 2019Updated 6 years ago
- Create an Amazon EKS cluster and run a distributed training example☆29Aug 19, 2024Updated last year
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Unified high-performance Python client for object and file stores.☆76Updated this week
- ☆31May 4, 2020Updated 6 years ago
- ☆11Sep 10, 2023Updated 2 years ago
- ☆11May 16, 2019Updated 7 years ago
- Spearmint uses Gaussian Processes to automatically optimize hyper parameter. This is a fork of Spearmint for the deep learning community.…☆11Nov 30, 2016Updated 9 years ago
- ☆16Apr 28, 2025Updated last year
- ☆10Jun 8, 2022Updated 4 years ago
- SageMaker Experiments and DVC☆18Aug 22, 2022Updated 3 years ago
- Amazon EKS cluster consumption made easier☆33Oct 18, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Canidadate for the Kaggle 2017 Data Science Bowl - Automatic detection of lung cancer from CT scans☆10Apr 7, 2017Updated 9 years ago
- Support ThingShadow and MQTT protocol with Certificate Store by clientId (thingName)☆11Aug 24, 2020Updated 5 years ago
- Develop mobile application using AWS Cloud9 - React Native with Amplify and AppSync and Mobile hub☆14Aug 13, 2018Updated 7 years ago
- A continuous integration (CI) system for 📓 Jupyter notebooks, built using 🧠 Amazon SageMaker.☆11Aug 5, 2025Updated 10 months ago
- recipe for training fully-featured self supervised image jepa models☆14Jun 4, 2025Updated last year
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- CNN model to predict lung cancer based on CT scan images☆11Aug 5, 2020Updated 5 years ago
- Scheduling app to search for free block fo time on your Google Calendar (using Flask).☆16Jun 24, 2013Updated 13 years ago
- This repository contains notebooks showing how to perform mixed precision training in tf.keras 2.0☆12Dec 15, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- In this repository, we will present techniques to detect covariate drift, and demonstrate how to incorporate your own custom drift detect…☆12May 26, 2021Updated 5 years ago
- Fine Tune Multimodal LLM "Idefics 2" using QLoRA.☆11Apr 20, 2024Updated 2 years ago
- ☆19Oct 8, 2024Updated last year
- AIBOM Workshop RSA 2024☆15May 20, 2024Updated 2 years ago
- Memory Benchmark for OpenCL-supported Intel FPGAs☆12Dec 25, 2023Updated 2 years ago
- ☆16Sep 22, 2024Updated last year
- Automatically split the chest x-ray into two views☆20Aug 24, 2023Updated 2 years ago