UofT-EcoSystem/hfta

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UofT-EcoSystem/hfta)

UofT-EcoSystem / hfta

Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion

☆32

Alternatives and similar repositories for hfta

Users that are interested in hfta are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UofT-EcoSystem / rlscope
View on GitHub
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
☆48Apr 7, 2021Updated 5 years ago
tlc-pack / tenset
View on GitHub
☆100Nov 4, 2022Updated 3 years ago
skylineprof / skyline
View on GitHub
🏙 Interactive in-editor performance profiling, visualization, and debugging for PyTorch neural networks.
☆32Dec 11, 2022Updated 3 years ago
awslabs / lorien
View on GitHub
☆42Sep 8, 2023Updated 2 years ago
CentML / DeepView.Explore
View on GitHub
🛠 VSCode plugin that provides visual interface for CentML Tools
☆15Dec 5, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CentML / DeepView.Profile
View on GitHub
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆65Jan 21, 2025Updated last year
uw-mad-dash / Accordion
View on GitHub
Code for reproducing experiments performed for Accoridon
☆13Jun 11, 2021Updated 5 years ago
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 11 months ago
SymbioticLab / Fluid
View on GitHub
A Generic Resource-Aware Hyperparameter Tuning Execution Engine
☆15Jan 8, 2022Updated 4 years ago
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
microsoft / SuperScaler
View on GitHub
An experimental parallel training platform
☆57Mar 25, 2024Updated 2 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
chhzh123 / ptc-tutorial
View on GitHub
PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo
☆17Mar 13, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
anony-sub / chameleon
View on GitHub
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆26Nov 7, 2019Updated 6 years ago
SymbioticLab / ModelKeeper
View on GitHub
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆36Jan 9, 2023Updated 3 years ago
microsoft / nnfusion
View on GitHub
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆1,002Sep 19, 2024Updated last year
zhuzilin / pytorch-malloc
View on GitHub
An external memory allocator example for PyTorch.
☆16Aug 10, 2025Updated 11 months ago
Soroosh129 / NeuOS
View on GitHub
Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"
☆22Jan 4, 2021Updated 5 years ago
Raphael-Hao / Abacus
View on GitHub
☆38Jun 27, 2025Updated last year
lwangbm / Metis
View on GitHub
Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale
☆19May 27, 2020Updated 6 years ago
CentML / DeepView.Predict
View on GitHub
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆14Dec 16, 2024Updated last year
hku-systems / naspipe
View on GitHub
☆14Jan 12, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Lunderberg / tvmcon-2021
View on GitHub
Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"
☆11Jan 20, 2022Updated 4 years ago
ucbrise / hypersched
View on GitHub
Deadline-based hyperparameter tuning on RayTune.
☆32Jan 16, 2020Updated 6 years ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
magruener / reconstructing-proprietary-video-streaming-algorithms
View on GitHub
This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"
☆14Mar 24, 2021Updated 5 years ago
ise-uiuc / tzer
View on GitHub
Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.
☆72Mar 9, 2023Updated 3 years ago
he-actlab / polymath
View on GitHub
☆23Feb 18, 2025Updated last year
casys-kaist / EnvPipe
View on GitHub
☆27Aug 31, 2023Updated 2 years ago
UofT-EcoSystem / DietCode
View on GitHub
DietCode Code Release
☆65Jul 21, 2022Updated 4 years ago
ampersand-projects / tilt
View on GitHub
☆11Jun 9, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
msr-fiddle / CheckFreq
View on GitHub
☆57Jan 25, 2021Updated 5 years ago
uwplse / tensat
View on GitHub
Re-implementation of the TASO compiler using equality saturation
☆142Jun 28, 2021Updated 5 years ago
ucbrise / actnn
View on GitHub
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
☆199Dec 22, 2022Updated 3 years ago
limenghao / AdaTune
View on GitHub
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14May 16, 2021Updated 5 years ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
HPCRL / ASPLOS_artifact
View on GitHub
☆13Nov 1, 2021Updated 4 years ago
microsoft / hivedscheduler
View on GitHub
Kubernetes Scheduler for Deep Learning
☆263May 22, 2022Updated 4 years ago