casys-kaist/glet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/casys-kaist/glet)

casys-kaist / glet

☆53

Alternatives and similar repositories for glet

Users that are interested in glet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SJTU-IPADS / reef
View on GitHub
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆108Dec 24, 2022Updated 3 years ago
icloud-ecnu / igniter
View on GitHub
iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.
☆39Jun 11, 2024Updated 2 years ago
SJTU-IPADS / reef-artifacts
View on GitHub
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43May 29, 2022Updated 4 years ago
pkusys / TGS
View on GitHub
Artifacts for our NSDI'23 paper TGS
☆97Jun 10, 2024Updated 2 years ago
casys-kaist / Edge-scheduler
View on GitHub
☆15Aug 5, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
casys-kaist / CoVA
View on GitHub
Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"
☆18Sep 19, 2024Updated last year
eth-easl / orion
View on GitHub
An interference-aware scheduler for fine-grained GPU sharing
☆164Nov 26, 2025Updated 8 months ago
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated last year
HuaizhengZhang / MIGProfiler
View on GitHub
Multi-Instance-GPU profiling tool
☆58Apr 16, 2023Updated 3 years ago
SJTU-IPADS / disb
View on GitHub
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆58Aug 21, 2024Updated last year
LLMServe / dLoRA-artifact
View on GitHub
☆32May 28, 2024Updated 2 years ago
uwsampl / nexus
View on GitHub
☆85Feb 5, 2026Updated 5 months ago
BLepers / JohnnyCache
View on GitHub
Johnny Cache: the End of DRAM Cache Conflicts (in Tiered Main Memory Systems)
☆20Aug 2, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tanzelin430 / libsmctrl
View on GitHub
libsmctrl论文的复现，添加了python端接口，可以在python端灵活调用接口来分配计算资源
☆12May 21, 2024Updated 2 years ago
Sys-KU / DeepPlan
View on GitHub
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Aug 6, 2025Updated 11 months ago
Serverless-Federated-Learning / FedLess
View on GitHub
Secure and Scalable Federated Learning using Serverless Computing
☆13Jan 31, 2024Updated 2 years ago
sjtu-epcc / DVABatch
View on GitHub
☆21May 13, 2022Updated 4 years ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated 2 years ago
Raphael-Hao / Abacus
View on GitHub
☆38Jun 27, 2025Updated last year
resource-disaggregation / karma
View on GitHub
Resource Allocation for Dynamic Demands
☆22Dec 26, 2023Updated 2 years ago
UMass-LIDS / Proteus
View on GitHub
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Mar 7, 2024Updated 2 years ago
eniac / paella
View on GitHub
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆72May 1, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
stanford-mast / INFaaS
View on GitHub
Model-less Inference Serving
☆94Nov 4, 2023Updated 2 years ago
Vic0428 / Paper-Reading-Lists
View on GitHub
Random collections of my interested research papers / projects
☆20May 20, 2021Updated 5 years ago
DMTCP-CRAC / CRAC-early-development
View on GitHub
☆23Dec 22, 2023Updated 2 years ago
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆213Nov 5, 2020Updated 5 years ago
msr-fiddle / synergy
View on GitHub
☆54Dec 13, 2022Updated 3 years ago
gajagajago / deepshare
View on GitHub
Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS'23)
☆20Jul 8, 2025Updated last year
rtenlab / gcaps-super-repo
View on GitHub
GCAPS: GPU Context-Aware Preemptive Scheduling Approach
☆16Mar 22, 2026Updated 4 months ago
microsoft / edge-video-services
View on GitHub
Edge Video Services (EVS) is a Microsoft platform for developing video analytics solutions that can be deployed across the edge and the c…
☆30Jul 5, 2022Updated 4 years ago
timlee0212 / SiDA-MoE
View on GitHub
Code for MLSys 2024 Paper "SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models"
☆22Apr 13, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
amlatyrngom / SQLIR
View on GitHub
SQL Optimizations using MLIR
☆12Apr 5, 2020Updated 6 years ago
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
Mr-Linus / SCV
View on GitHub
SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器
☆20Feb 25, 2023Updated 3 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
Tencent / BlazerML-tvm
View on GitHub
Tencent Distribution of TVM
☆16Apr 7, 2023Updated 3 years ago
microsoft / cusync
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
hkust-adsl / kubernetes-scheduler-simulator
View on GitHub
Kubernetes Scheduler Simulator
☆127Jul 31, 2024Updated last year