Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.
☆56Nov 24, 2025Updated 3 months ago
Alternatives and similar repositories for libsmctrl
Users that are interested in libsmctrl are comparing it to the libraries listed below
Sorting:
- Tutorials for NVIDIA CUPTI samples☆59Nov 3, 2025Updated 4 months ago
- ☆25Mar 9, 2026Updated last week
- An interference-aware scheduler for fine-grained GPU sharing☆161Nov 26, 2025Updated 3 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆43May 29, 2022Updated 3 years ago
- Automatic Parallelism Using LLVM☆10Aug 2, 2014Updated 11 years ago
- ☆12Aug 17, 2022Updated 3 years ago
- ☆77Apr 18, 2025Updated 11 months ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆34Feb 10, 2025Updated last year
- ☆14Feb 5, 2025Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆91Jan 26, 2026Updated last month
- [ArXiv 2025] A curated list of papers on on-device large language models, focusing on model compression and system optimization technique…☆27Jan 27, 2026Updated last month
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- ☆12May 13, 2025Updated 10 months ago
- A group of students who are interested in Compilers, and they want to improve themselves together.☆25Aug 23, 2022Updated 3 years ago
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆171Dec 12, 2023Updated 2 years ago
- ☆28Jan 28, 2026Updated last month
- ☆17Apr 9, 2025Updated 11 months ago
- Personal house automation system with a REST/Json interface☆18Feb 20, 2024Updated 2 years ago
- This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…☆38Sep 25, 2023Updated 2 years ago
- A collection of benchmarks and tests for the Patmos processor and compiler☆18Dec 2, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆18Aug 30, 2024Updated last year
- Open-source implementation of the CUDA API.☆13May 5, 2012Updated 13 years ago
- 2D and 3D Matrix Convolution and Matrix Multiplication with CUDA☆10Jun 14, 2021Updated 4 years ago
- Potluck with different functions for different purposes that can be shared among C programs☆13Mar 4, 2024Updated 2 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- Artifacts for our NSDI'23 paper TGS☆98Jun 10, 2024Updated last year
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 3 years ago
- Reasoning LLMs optimized for Chisel code generation☆24Jun 19, 2025Updated 9 months ago
- A test case for VFIO_PLATFORM currently based on the PL330 DMA controller. The effort on VFIO_PLATFORM has been partially funded by the S…☆13Dec 12, 2022Updated 3 years ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- Logging library for C applications☆23Mar 4, 2024Updated 2 years ago
- PyTorch Implementation of GPT-2☆31Sep 4, 2024Updated last year
- Based on research papers which discuss Fuzzy concept☆12Jul 19, 2019Updated 6 years ago
- ☆51Mar 9, 2026Updated last week
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated last year
- Pressio is latin for compression. Libpressio is a C++ library with C compatible bindings to abstract between different lossless and lossy…☆16Dec 30, 2024Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year