This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Parallelization of Deep Neural Networks.
☆13Apr 4, 2024Updated last year
Alternatives and similar repositories for ASPEN
Users that are interested in ASPEN are comparing it to the libraries listed below
Sorting:
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆28May 10, 2021Updated 4 years ago
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year
- notepad++堆缓冲区溢出漏洞CVE-2023-40031 分析与复现☆15Sep 8, 2023Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Mar 12, 2026Updated last week
- ☆12Jan 18, 2026Updated 2 months ago
- A library for finding the maximum common induced subgraph between two graphs and compute their similarity (correlation).☆15Oct 21, 2019Updated 6 years ago
- Resources for deep learning with satellite & aerial imagery☆14Sep 29, 2021Updated 4 years ago
- PyTorch implementation of a 9-layer ResNet for CIFAR-10.☆12May 8, 2024Updated last year
- The official implementation of the paper SimVP: Towards Simple yet Powerful Spatiotemporal Predictive learning.☆11Jan 2, 2024Updated 2 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆64Aug 5, 2024Updated last year
- ☆26Feb 20, 2024Updated 2 years ago
- PyTorch implementation of federated learning on MNIST☆23Feb 19, 2024Updated 2 years ago
- GLSearch: Maximum Common Subgraph Detection via Learning to Search☆24Jun 25, 2023Updated 2 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- ☆10Sep 22, 2021Updated 4 years ago
- A new congestion control algorithm for LEO satellite networks.☆34Jan 22, 2026Updated last month
- ☆23Jan 6, 2025Updated last year
- An example of how to use the multiprocessing package along with PyTorch.☆21Jan 15, 2021Updated 5 years ago
- Starlink Prometheus Exporter Monitoring Stack☆35Jan 29, 2026Updated last month
- Neural network compatible DDEs☆13Apr 8, 2025Updated 11 months ago
- Dynamic mode decomposition in Python☆13Jun 9, 2015Updated 10 years ago
- ☆13Jun 25, 2021Updated 4 years ago
- ☆18Mar 4, 2025Updated last year
- [ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennige…☆92Feb 26, 2024Updated 2 years ago
- DECAF is a tool that measure the performance of cloud gaming platforms such as Google Stadia, Amazon Luna, NVIDIA GeForceNow.☆12Dec 17, 2021Updated 4 years ago
- ☆12Jun 29, 2024Updated last year
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆15Oct 11, 2024Updated last year
- USB 3.0 Stereo Camera☆18Mar 19, 2025Updated last year
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- [DEPRECATED] A community maintained fork of Community Cellular Manager☆14Sep 7, 2018Updated 7 years ago
- Toolkit for Bayesian scaling analysis☆14Sep 8, 2022Updated 3 years ago
- Extending rllab to event-driven multiagent environments☆13Oct 1, 2018Updated 7 years ago
- Research prototype of PRISM — a cost-efficient multi-LLM serving system with flexible time- and space-based GPU sharing.☆58Updated this week
- A multiphase field model based on machine learning method☆49Feb 10, 2022Updated 4 years ago
- This repository contains the scripts for reproducing the results presented in Costa AC, Ahamed T, Jordan D, Stephens GJ (2023) "A Markov…☆11Sep 25, 2025Updated 5 months ago
- 用于训练中文DeepSeek R1大模型的Lora脚本☆13Mar 20, 2025Updated last year
- [ICML 2022] Learning Efficient and Robust Ordinary Differential \\ Equations via Invertible Neural Networks☆10Apr 14, 2023Updated 2 years ago
- This repo contains the details of the HRPlanesv2, a high-resolution satellite imagery dataset for aircraft detection, as well as the benc…☆28Sep 23, 2023Updated 2 years ago