Deadline-based hyperparameter tuning on RayTune.
☆32Jan 16, 2020Updated 6 years ago
Alternatives and similar repositories for hypersched
Users that are interested in hypersched are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Jan 5, 2023Updated 3 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Jan 8, 2022Updated 4 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆167May 7, 2020Updated 6 years ago
- ☆37Updated this week
- Distributed ML Optimizer☆35Jul 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An experimental distributed execution engine☆23Jul 23, 2020Updated 5 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 5 years ago
- [ICLR 2021] CompOFA: Compound Once-For-All Networks For Faster Multi-Platform Deployment☆25Jan 5, 2023Updated 3 years ago
- Fluent dataset operations, compatible with your favorite libraries☆11Sep 4, 2025Updated 9 months ago
- ☆27Aug 31, 2023Updated 2 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32May 15, 2024Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated last year
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 4 years ago
- ☆47Dec 16, 2022Updated 3 years ago
- A Deep Learning Cluster Scheduler☆36Jan 11, 2021Updated 5 years ago
- Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale☆19May 27, 2020Updated 6 years ago
- Training wheels, side rails, and helicopter parent for your Deep Learning projects in PyTorch☆24Sep 23, 2023Updated 2 years ago
- CaDiCaL + neural glue variable predictions☆10Oct 21, 2020Updated 5 years ago
- Building a Deep Learning Powered Emoji Slackbot!☆15Jul 23, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- GPU-scheduler-for-deep-learning☆212Nov 5, 2020Updated 5 years ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆138Jul 25, 2024Updated last year
- the hadoop plugin for chdfs☆15Feb 27, 2026Updated 3 months ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Sublime Text 3 plugin which executes given commands on file save☆16Oct 17, 2018Updated 7 years ago
- ☆31Jul 18, 2019Updated 6 years ago
- GPU analyzer for Kubernetes GPU clusters☆16Apr 11, 2020Updated 6 years ago
- Privacy Budget Orchestration in Machine Learning Workloads (OSDI '21)☆26Oct 20, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆28May 2, 2023Updated 3 years ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆159Nov 26, 2025Updated 6 months ago
- Code for "Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP", which appeared at SOSP 2021☆28Dec 15, 2021Updated 4 years ago
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- ☆58Jan 25, 2021Updated 5 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆199Dec 22, 2022Updated 3 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Apr 15, 2022Updated 4 years ago