ConstantPark / DL_Compiler
Study Group of Deep Learning Compiler
☆149Updated last year
Related projects: ⓘ
- ☆101Updated last year
- NEST Compiler☆114Updated 2 months ago
- Lightweight and Parallel Deep Learning Framework☆261Updated last year
- Study parallel programming - CUDA, OpenMP, MPI, Pthread☆54Updated 2 years ago
- ☆21Updated last year
- ☆38Updated this week
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆63Updated 6 years ago
- ☆56Updated last year
- Shared Middle-Layer for Triton Compilation☆160Updated this week
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆191Updated 2 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆175Updated 2 years ago
- System for automated integration of deep learning backends.☆48Updated 2 years ago
- A performance library for machine learning applications.☆178Updated 11 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆98Updated 9 months ago
- ☆13Updated last year
- Experimental deep learning framework written in Rust☆13Updated last year
- This repository is a meta package to provide Samsung OneMCC (Memory Coupled Computing) infrastructure.☆25Updated 11 months ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆129Updated last year
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆26Updated 5 years ago
- NNtrainer is Software Framework for Training Neural Network Models on Devices.☆144Updated last week
- ☆113Updated last year
- Benchmark code for the "Online normalizer calculation for softmax" paper☆52Updated 6 years ago
- one-shot-tuner☆8Updated last year
- NEST-SNN☆13Updated 2 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆100Updated last year
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆51Updated 5 months ago
- ☆73Updated 5 months ago
- ☆33Updated 5 months ago
- OpenAI Triton backend for Intel® GPUs☆126Updated this week
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆143Updated last month