kooyunmo / pytorch-uvm
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆13Updated 4 years ago
Alternatives and similar repositories for pytorch-uvm:
Users that are interested in pytorch-uvm are comparing it to the libraries listed below
- PyTorch-UVM on super-large language models.☆14Updated 4 years ago
- ☆23Updated 2 years ago
- ☆35Updated last year
- ☆25Updated 4 years ago
- ☆31Updated 7 months ago
- Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (USENIX ATC '21)☆43Updated 2 years ago
- ☆44Updated 3 weeks ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆28Updated last year
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆40Updated 10 months ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆31Updated last year
- ☆67Updated 4 years ago
- ☆53Updated 3 years ago
- ☆23Updated last year
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆89Updated 2 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆53Updated 8 months ago
- ☆23Updated last year
- This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews shoul…☆29Updated 5 months ago
- A Cycle-level simulator for M2NDP☆22Updated last month
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆54Updated 9 months ago
- ☆37Updated 3 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆9Updated 10 months ago
- Artifacts for our SIGCOMM'22 paper Muri☆41Updated last year
- Cluster Far Mem, framework to execute single job and multi job experiments using fastswap☆21Updated last year
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆9Updated 3 years ago
- Thinking is hard - automate it☆19Updated 2 years ago
- ☆29Updated 3 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Updated last year
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆39Updated 5 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆40Updated 2 years ago
- Source code for the software implementation of Sibyl proposed in our ISCA 2022 paper: Gagandeep Singh et. al., "Sibyl: Adaptive and Exten…☆33Updated 2 years ago