casys-kaist / HUVMView external linksLinks
☆26Aug 19, 2022Updated 3 years ago
Alternatives and similar repositories for HUVM
Users that are interested in HUVM are comparing it to the libraries listed below
Sorting:
- ☆33Sep 9, 2020Updated 5 years ago
- PyTorch-UVM on super-large language models.☆17Dec 21, 2020Updated 5 years ago
- Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"☆18Sep 19, 2024Updated last year
- [IEEE CAL 2025] Accelerating Page Migrations in Operating Systems with Intel DSA☆16Nov 20, 2024Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆15Dec 21, 2020Updated 5 years ago
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆48Mar 31, 2022Updated 3 years ago
- ☆81Nov 16, 2020Updated 5 years ago
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Aug 6, 2025Updated 6 months ago
- Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration☆33Jan 8, 2026Updated last month
- Secure Inference Resilient Against Malicious Clients☆15May 3, 2022Updated 3 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆52Mar 24, 2024Updated last year
- ☆11Jul 2, 2024Updated last year
- ☆13Oct 6, 2024Updated last year
- GPU Performance Advisor☆65Jul 25, 2022Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆67Jan 22, 2026Updated 3 weeks ago
- ☆18May 8, 2021Updated 4 years ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆14Jun 24, 2020Updated 5 years ago
- ☆16Feb 27, 2022Updated 3 years ago
- ☆14Jan 12, 2022Updated 4 years ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆39Sep 25, 2023Updated 2 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Mar 7, 2024Updated last year
- ☆36Jun 10, 2024Updated last year
- Enabling pure data parallel training of DLRM via caching and prefetching☆17Oct 29, 2021Updated 4 years ago
- ☆19Aug 26, 2021Updated 4 years ago
- ☆38Jun 27, 2025Updated 7 months ago
- LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism☆89Dec 24, 2021Updated 4 years ago
- CasHMC: A Cycle-accurate Simulator for Hybrid Memory Cube☆23Aug 10, 2018Updated 7 years ago
- [DATE'23] The official code for paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>☆23Jan 19, 2026Updated 3 weeks ago
- CUDAAdvisor: a GPU profiling tool☆52Aug 24, 2018Updated 7 years ago
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 6 months ago
- Johnny Cache: the End of DRAM Cache Conflicts (in Tiered Main Memory Systems)☆20Aug 2, 2023Updated 2 years ago
- Fine-grained GPU sharing primitives☆148Jul 28, 2025Updated 6 months ago
- Cheddar: A Swift Fully Homomorphic Encryption (FHE) GPU Library☆46Jan 14, 2026Updated last month
- Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS'23)☆219Oct 13, 2024Updated last year
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆50Aug 21, 2018Updated 7 years ago
- Arbitrary offloads for RDMA NICs☆99Apr 25, 2022Updated 3 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆58Jul 7, 2025Updated 7 months ago