AgrawalAmey / awesome-ml-for-systems
π A curated list of resources dedicated to Machine Learning for Systems research
β10Updated 4 years ago
Alternatives and similar repositories for awesome-ml-for-systems:
Users that are interested in awesome-ml-for-systems are comparing it to the libraries listed below
- β12Updated 2 years ago
- This is the (evolving) reading list for the seminar.β57Updated 4 years ago
- β21Updated 6 years ago
- This repo is to collect the state-of-the-art GNN hardware acceleration paperβ54Updated 3 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into efβ¦β61Updated 2 years ago
- one-shot-tunerβ8Updated 2 years ago
- Repo for the IISWC 2018 submissionβ9Updated 2 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.β24Updated 4 years ago
- β10Updated 3 years ago
- β14Updated 3 years ago
- ColTraIn HBFP Training Emulatorβ16Updated 2 years ago
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimizationβ28Updated last year
- β14Updated last year
- β21Updated last year
- Machine Learning Systemβ14Updated 4 years ago
- ICLR 2021β46Updated 3 years ago
- SOTA Learning-augmented Systemsβ34Updated 2 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"β18Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilationβ27Updated 5 years ago
- research, experimentation and implementation of hardware-agnostic accelerated DL frameworkβ36Updated 3 weeks ago
- β73Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launchesβ15Updated 5 years ago
- Code for our paper "Binary Graph Neural Networks", CVPR 2021β37Updated 3 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.β15Updated 7 months ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inferenceβ17Updated last year
- β13Updated last year
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Multβ¦β39Updated 11 months ago
- G3: A Programmable GNN Training System on GPUβ43Updated 4 years ago
- Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMsβ24Updated 7 months ago
- PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worlβ¦β22Updated last month