geoffxy/habitat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/geoffxy/habitat)

geoffxy / habitat

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

☆63

Alternatives and similar repositories for habitat

Users that are interested in habitat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

S-Lab-System-Group / HeliosArtifact
View on GitHub
HeliosArtifact
☆22Sep 27, 2022Updated 3 years ago
joapolarbear / dpro
View on GitHub
Analysis for the traces from byteprofile
☆32Nov 21, 2023Updated 2 years ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated 2 years ago
S-Lab-System-Group / Hydro
View on GitHub
Surrogate-based Hyperparameter Tuning System
☆30Jun 29, 2023Updated 3 years ago
microsoft / elasticflow-traces
View on GitHub
Integrated Training Platform (ITP) traces used in ElasticFlow paper.
☆31Dec 23, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stanford-futuredata / POP
View on GitHub
Code for "Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP", which appeared at SOSP 2021
☆28Dec 15, 2021Updated 4 years ago
petuum / adaptdl
View on GitHub
Resource-adaptive cluster scheduler for deep learning training.
☆459Mar 5, 2023Updated 3 years ago
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆213Nov 5, 2020Updated 5 years ago
msr-fiddle / philly-traces
View on GitHub
☆198Aug 31, 2019Updated 6 years ago
CDECatapult / ml-performance-prediction
View on GitHub
Code that accompanies the paper "Predicting the Computational Cost of Deep Learning Models"
☆21Dec 14, 2018Updated 7 years ago
CDECatapult / mlpredict
View on GitHub
Python package to predict deep learning execution time
☆13Jul 26, 2022Updated 4 years ago
AliyunContainerService / et-operator
View on GitHub
Kubernetes Operator for AI and Bigdata Elastic Training
☆91Jan 10, 2025Updated last year
feifeibear / PyTorchMemTracer
View on GitHub
Depict GPU memory footprint during DNN training of PyTorch
☆11Nov 17, 2022Updated 3 years ago
zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
S-Lab-System-Group / HeliosData
View on GitHub
Helios Traces from SenseTime
☆63Sep 27, 2022Updated 3 years ago
SymbioticLab / Tiresias
View on GitHub
Tiresias is a GPU cluster manager for distributed deep learning training.
☆165May 7, 2020Updated 6 years ago
lwangbm / Metis
View on GitHub
Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale
☆19May 27, 2020Updated 6 years ago
mblo / hire-cluster-simulator
View on GitHub
Switches for HIRE: Resource Scheduling for Data Center In-Network Computing
☆13Jan 18, 2021Updated 5 years ago
SymbioticLab / ModelKeeper
View on GitHub
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆36Jan 9, 2023Updated 3 years ago
CentML / DeepView.Predict
View on GitHub
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆14Dec 16, 2024Updated last year
enyac-group / NeuralPower
View on GitHub
The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks
☆24Jul 10, 2019Updated 7 years ago
siasosp23 / artifacts
View on GitHub
☆24Aug 15, 2023Updated 2 years ago
ucbrise / hypersched
View on GitHub
Deadline-based hyperparameter tuning on RayTune.
☆32Jan 16, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hiddenlayer2020 / ML-Job-Scheduler-MLFS
View on GitHub
☆13Dec 18, 2020Updated 5 years ago
petuum-inc / poseidon-release
View on GitHub
Release doc/tutorial/wheels for poseidon-tf
☆10Jan 18, 2018Updated 8 years ago
k82cn / kubesim
View on GitHub
A simulator of Kuberntes for batch and service workload.
☆49Mar 26, 2021Updated 5 years ago
shen203 / GPU_Microbenchmark
View on GitHub
☆25Jun 24, 2022Updated 4 years ago
columbia / PrivateKube
View on GitHub
Privacy Budget Orchestration in Machine Learning Workloads (OSDI '21)
☆27Oct 20, 2023Updated 2 years ago
microsoft / dist-ir
View on GitHub
An IR for efficiently simulating distributed ML computation.
☆33Jan 13, 2024Updated 2 years ago
ide3a / connecticity
View on GitHub
A serious game to support teaching and learning of topics of connected critical infrastructure in urban settings 🏙 🍃 💧 ⚡️ 🚗 👩‍💻
☆10Dec 1, 2021Updated 4 years ago
Sys-KU / DeepPlan
View on GitHub
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Aug 6, 2025Updated 11 months ago
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
NMSU-PEARL / GPUs-Energy
View on GitHub
[CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs
☆15Dec 11, 2020Updated 5 years ago
TonyTangYu / delta-examples
View on GitHub
☆12Apr 30, 2024Updated 2 years ago
casys-kaist / EnvPipe
View on GitHub
☆27Aug 31, 2023Updated 2 years ago
matinraayai / Luthier
View on GitHub
Luthier, a GPU binary instrumentation tool for AMD GPUs
☆28Updated this week
magruener / reconstructing-proprietary-video-streaming-algorithms
View on GitHub
This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"
☆14Mar 24, 2021Updated 5 years ago
PasaLab / Liquid
View on GitHub
Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters
☆16Nov 18, 2021Updated 4 years ago
HKBU-HPML / ddl-benchmarks
View on GitHub
ddl-benchmarks: Benchmarks for Distributed Deep Learning
☆36May 29, 2020Updated 6 years ago