csce585-mlsystems / project-athena
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
☆13Updated 4 years ago
Alternatives and similar repositories for project-athena:
Users that are interested in project-athena are comparing it to the libraries listed below
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- An Attention Superoptimizer☆21Updated 3 months ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- ☆11Updated 4 years ago
- An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search☆23Updated 5 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆26Updated 4 months ago
- Benchmark PyTorch Custom Operators☆14Updated last year
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆14Updated 11 months ago
- The quantitative performance comparison among DL compilers on CNN models.☆74Updated 4 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- General system research material (not limited to paper) reading notes.☆21Updated 4 years ago
- Github repo backing website for the CS Assistant Professor Handbook☆26Updated 7 months ago
- This is the (evolving) reading list for the seminar.☆57Updated 4 years ago
- ☆22Updated 3 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆17Updated 2 years ago
- SyReNN: Symbolic Representations for Neural Networks☆40Updated 2 years ago
- ☆12Updated 2 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Appli…☆12Updated 5 years ago
- Collection of Papers and Trials on Deep Learning to aid EE design☆44Updated 4 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- TVMFuzz: fuzzing tensor-level intermediate representation in TVM☆28Updated 4 years ago
- This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).☆24Updated 5 years ago
- Summary for Stanford class CS243 - Program Analysis and Optimizations | Winter 2016☆31Updated 9 years ago
- Metal: Learning a Meta-Solver for Syntax-Guided Program Synthesis☆15Updated 6 years ago
- A GPU (CUDA) implementation, with a python interface, of the approximated KNN graph computation with Random Sample Forest algorithm KNN.☆12Updated 4 months ago
- 📝 "End-to-end Deep Learning of Optimization Heuristics" (🥇 PACT'17 Best Paper)☆73Updated 2 years ago
- ☆11Updated 3 years ago
- 📖 A curated list of resources dedicated to Machine Learning for Systems research☆11Updated 4 years ago
- ☆18Updated 4 years ago