csce585-mlsystems / project-athenaLinks
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
☆13Updated 4 years ago
Alternatives and similar repositories for project-athena
Users that are interested in project-athena are comparing it to the libraries listed below
Sorting:
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- An Attention Superoptimizer☆21Updated 5 months ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- 📖 A curated list of resources dedicated to Machine Learning for Systems research☆11Updated 4 years ago
- GPU Task Scheduler (Python library)☆43Updated 4 years ago
- A list of awesome neural symbolic papers.☆47Updated 2 years ago
- Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization☆24Updated this week
- ☆14Updated 3 years ago
- This is the course taught by Prof.John Shen and Prof. Onur Mutlu from CMU☆11Updated 9 years ago
- ☆19Updated 4 years ago
- A minimalistic header only C++11 Neural Network library based on Eigen::Tensor☆20Updated 7 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆15Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).☆25Updated 5 years ago
- labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell☆13Updated 2 years ago
- Fibertree emulator☆12Updated 7 months ago
- ☆11Updated 4 years ago
- An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search☆23Updated 5 years ago
- Collection of Papers and Trials on Deep Learning to aid EE design☆44Updated 4 years ago
- HWASim is a simulator for heterogeneous systems with CPUs and Hardware Accelerators (HWAs). It is released with the DASH memory scheduler…☆19Updated 9 years ago
- Efficient Compute-Communication Overlap for Distributed LLM Inference☆13Updated this week
- propositional satisfiability problem (SAT) goes neural and deep☆13Updated 3 years ago
- A curated list for Efficient Large Language Models☆11Updated last year
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆16Updated 5 years ago
- The accelerometer analytical model published in ASPLOS 2020 (Accelerometer: Understanding Acceleration Opportunities forData Center Overh…☆15Updated 5 years ago
- ICLR 2021☆48Updated 4 years ago
- CS294 AI Systems Class Website☆16Updated 3 years ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆14Updated 4 years ago
- ☆21Updated 2 years ago