atomicapple0/libsmctrl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/atomicapple0/libsmctrl)

atomicapple0 / libsmctrl

Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.

☆67

Alternatives and similar repositories for libsmctrl

Users that are interested in libsmctrl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gofreelee / SpaceServe
View on GitHub
☆32Jul 13, 2026Updated last week
casys-kaist / HUVM
View on GitHub
☆27Aug 19, 2022Updated 3 years ago
eunomia-bpf / cupti-tutorial
View on GitHub
Tutorials for NVIDIA CUPTI samples
☆70Updated this week
eunomia-bpf / nccl-eBPF
View on GitHub
☆20Jul 7, 2026Updated 2 weeks ago
eth-easl / orion
View on GitHub
An interference-aware scheduler for fine-grained GPU sharing
☆164Nov 26, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wu-kan / wuk_cupti_wrapper
View on GitHub
a simple API to use CUPTI
☆10Aug 19, 2025Updated 11 months ago
Multi-LLM / prism-research
View on GitHub
Research prototype of PRISM — a cost-efficient multi-LLM serving system with flexible time- and space-based GPU sharing.
☆71Mar 17, 2026Updated 4 months ago
SJTU-IPADS / reef-artifacts
View on GitHub
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43May 29, 2022Updated 4 years ago
xinhao-luo / ClusterFusion
View on GitHub
[NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
☆75Dec 11, 2025Updated 7 months ago
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
open-neutrino / neutrino
View on GitHub
☆264Dec 25, 2025Updated 7 months ago
OSU-STARLAB / UVM_benchmark
View on GitHub
☆34Sep 9, 2020Updated 5 years ago
sarchlab / mgpusim
View on GitHub
A highly-flexible GPU simulator for AMD GPUs.
☆260Jul 17, 2026Updated last week
ovg-project / kvcached
View on GitHub
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
☆1,115Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HPC-Research-Lab / STMatch
View on GitHub
☆12Aug 17, 2022Updated 3 years ago
eunomia-bpf / gpu_ext
View on GitHub
eBPF for GPU UVM offloading and scheduling in Linux kernel
☆59Apr 15, 2026Updated 3 months ago
thustorage / GPreempt
View on GitHub
☆25May 18, 2025Updated last year
Raphael-Hao / Abacus
View on GitHub
☆38Jun 27, 2025Updated last year
zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
jamro1149 / Hydra
View on GitHub
Automatic Parallelism Using LLVM
☆10Aug 2, 2014Updated 11 years ago
thustorage / Medusa
View on GitHub
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆47May 13, 2025Updated last year
RRZE-HPC / gpu-benches
View on GitHub
collection of benchmarks to measure basic GPU capabilities
☆530Oct 24, 2025Updated 9 months ago
Sys-KU / DeepPlan
View on GitHub
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Aug 6, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
llumnix-project / llumnix-ray
View on GitHub
Efficient and easy multi-instance LLM serving
☆563Mar 12, 2026Updated 4 months ago
vancemiller / CUDA-preemption
View on GitHub
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆16Nov 10, 2016Updated 9 years ago
fzyzcjy / torch_memory_saver
View on GitHub
Allow torch tensor memory to be released and resumed later
☆260Updated this week
microsoft / vattention
View on GitHub
Dynamic Memory Management for Serving LLMs without PagedAttention
☆505Jul 17, 2026Updated last week
nicexlab / GeminiFS
View on GitHub
GeminiFS: A Companion File System for GPUs
☆84Jul 8, 2026Updated 2 weeks ago
Bruce-Lee-LY / cuda_hook
View on GitHub
Hooked CUDA-related dynamic libraries by using automated code generation tools.
☆173Dec 12, 2023Updated 2 years ago
jinzh-hust / GraphM
View on GitHub
An efficient storage system for concurrent graph processing
☆10Feb 1, 2021Updated 5 years ago
DiT-Serving / TetriServe
View on GitHub
[ASPLOS' 26] TetriServe: Efficiently Serving Mixed DiT Workloads
☆17Mar 12, 2026Updated 4 months ago
0x5ec1ab / gpu-tlb
View on GitHub
☆84Apr 18, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
0xD0GF00D / DocumentSASS
View on GitHub
Unofficial description of the CUDA assembly (SASS) instruction sets.
☆224Jul 18, 2025Updated last year
pkusys / TGS
View on GitHub
Artifacts for our NSDI'23 paper TGS
☆97Jun 10, 2024Updated 2 years ago
uchuhimo / amanda
View on GitHub
☆18Apr 21, 2024Updated 2 years ago
NVIDIA / cuda-checkpoint
View on GitHub
CUDA checkpoint and restore utility
☆474Jul 6, 2026Updated 2 weeks ago
tongyx361 / symeval
View on GitHub
Evaluation utilities based on SymPy.
☆22Dec 12, 2024Updated last year
inclusionAI / asystem-amem
View on GitHub
A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.
☆113Dec 17, 2025Updated 7 months ago