KAIST-NCL / Accelerator-Docker
Accelerator-Docker : provides common interface for automatic passthrough of heterogeneous hardware accelerators in docker
☆36Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Accelerator-Docker
- Kubernetes device plugin supporting FPGA and other accelerators☆11Updated 5 years ago
- Neural Network Acceleration such as ASIC, FPGA, GPU, and PIM☆51Updated 4 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆58Updated 3 weeks ago
- Neural Network Acceleration using CPU/GPU, ASIC, FPGA☆60Updated 4 years ago
- Study Group of Deep Learning Compiler☆155Updated last year
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆60Updated 3 months ago
- ☆31Updated last year
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Updated 4 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆22Updated last year
- ☆18Updated 2 years ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- SOTA Learning-augmented Systems☆33Updated 2 years ago
- Post-training sparsity-aware quantization☆33Updated last year
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆54Updated 7 months ago
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆15Updated 3 years ago
- ☆12Updated last year
- Model-less Inference Serving☆82Updated last year
- ☆12Updated 4 years ago
- ☆60Updated 3 years ago
- ☆39Updated last month
- ☆100Updated last year
- ☆12Updated 2 years ago
- ☆23Updated last year
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆26Updated 5 years ago
- Nsight Systems in Docker☆17Updated 11 months ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆10Updated last year
- ☆30Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- A 8-/16-/32-/64-bit floating point number family☆16Updated 2 years ago