KAIST-NCL / Accelerator-Docker
Accelerator-Docker : provides common interface for automatic passthrough of heterogeneous hardware accelerators in docker
☆36Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Accelerator-Docker
- Kubernetes device plugin supporting FPGA and other accelerators☆11Updated 5 years ago
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆15Updated 3 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆55Updated 3 weeks ago
- ☆16Updated 4 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆26Updated 5 years ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- Experimental deep learning framework written in Rust☆14Updated 2 years ago
- Cluster simulator with far memory☆12Updated 4 years ago
- Multi-Instance-GPU profiling tool☆53Updated last year
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆45Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- ☆12Updated 3 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆22Updated last year
- ☆12Updated 2 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆60Updated 3 months ago
- ☆18Updated 2 years ago
- ☆31Updated last year
- ☆15Updated 3 years ago
- SOTA Learning-augmented Systems☆32Updated 2 years ago
- ☆25Updated 5 years ago
- ☆10Updated 8 months ago
- ☆22Updated last year
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆54Updated 7 months ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆11Updated last year
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆13Updated 3 years ago
- ☆38Updated 4 years ago
- Study Group of Deep Learning Compiler☆152Updated last year
- ☆60Updated 3 years ago
- ☆12Updated last year