kooyunmo / pytorch-uvmLinks
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆15Updated 5 years ago
Alternatives and similar repositories for pytorch-uvm
Users that are interested in pytorch-uvm are comparing it to the libraries listed below
Sorting:
- PyTorch-UVM on super-large language models.☆17Updated 5 years ago
- ☆26Updated 3 years ago
- ☆41Updated 2 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆104Updated 3 years ago
- ☆28Updated last year
- ☆53Updated last year
- ☆66Updated 7 months ago
- Synthesizer for optimal collective communication algorithms☆124Updated last year
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆39Updated 2 years ago
- Repository for MLCommons Chakra schema and tools☆153Updated 3 months ago
- LLM serving cluster simulator☆135Updated last year
- ☆38Updated 7 months ago
- ☆23Updated 2 years ago
- ☆216Updated 2 months ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Updated 2 years ago
- ☆52Updated 3 years ago
- An interference-aware scheduler for fine-grained GPU sharing☆159Updated 2 months ago
- ☆166Updated last year
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Updated 6 months ago
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆10Updated 4 years ago
- ☆13Updated last year
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆43Updated 3 years ago
- ☆31Updated last year
- ☆84Updated 3 years ago
- ☆33Updated 5 years ago
- ☆81Updated 5 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆55Updated last year
- ☆40Updated 3 years ago
- MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters☆20Updated 2 years ago
- ☆56Updated 5 years ago