shriramsb/vDNN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shriramsb/vDNN)

shriramsb / vDNN

☆22

Alternatives and similar repositories for vDNN

Users that are interested in vDNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shriramsb / vdnn-plus-plus
View on GitHub
Implementation of vDNN++; an improvement over vDNN
☆18Dec 7, 2018Updated 7 years ago
linnanwang / superneurons-release
View on GitHub
this is the release repository of superneurons
☆54Feb 13, 2021Updated 5 years ago
darchr / AutoTM
View on GitHub
Thinking is hard - automate it
☆18Aug 24, 2022Updated 3 years ago
LiuXiaoxuanPKU / Cost-Model-papers
View on GitHub
☆13Feb 22, 2023Updated 3 years ago
saareliad / FTPipe
View on GitHub
FTPipe and related pipeline model parallelism research.
☆44May 16, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Sys-KU / DeepPlan
View on GitHub
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Aug 6, 2025Updated 11 months ago
tbd-ai / tbd-tools
View on GitHub
☆12May 3, 2020Updated 6 years ago
pakmarkthub / dragon
View on GitHub
A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data po…
☆63Sep 11, 2020Updated 5 years ago
mkuchnik / PlumberApp
View on GitHub
Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"
☆10Mar 31, 2022Updated 4 years ago
ruipeterpan / torch_profiler
View on GitHub
Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo
☆12Feb 12, 2023Updated 3 years ago
thustorage / PetPS
View on GitHub
PetPS: Supporting Huge Embedding Models with Tiered Memory
☆34May 21, 2024Updated 2 years ago
ParCoreLab / ComScribe
View on GitHub
ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.
☆28Jul 6, 2023Updated 3 years ago
TACC / FanStore
View on GitHub
☆20Jul 26, 2021Updated 5 years ago
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
eth-easl / cachew
View on GitHub
ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆41Sep 10, 2024Updated last year
BayesWatch / pytorch-blockswap
View on GitHub
Code for BlockSwap (ICLR 2020).
☆33Mar 25, 2021Updated 5 years ago
perone / pynvm
View on GitHub
Python bindings for the NVML. Non-volatile memory for Python.
☆12May 23, 2016Updated 10 years ago
llnl / direct-fuse
View on GitHub
☆18Mar 15, 2020Updated 6 years ago
msr-fiddle / harmony
View on GitHub
☆17Dec 9, 2022Updated 3 years ago
CDECatapult / ml-performance-prediction
View on GitHub
Code that accompanies the paper "Predicting the Computational Cost of Deep Learning Models"
☆21Dec 14, 2018Updated 7 years ago
mlsys-seo / ooo-backprop
View on GitHub
☆26Dec 5, 2022Updated 3 years ago
efeslab / siloz
View on GitHub
☆11Aug 23, 2023Updated 2 years ago
wangruinju / Double-Fusion
View on GitHub
A bayesian approach to examining default mode network functional connectivity and cognitive performance in major depressive disorder
☆13Aug 23, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hku-systems / SOTER
View on GitHub
☆30Oct 27, 2023Updated 2 years ago
Linestro / GRACE
View on GitHub
Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference
☆18Mar 5, 2023Updated 3 years ago
astra-sim / astra-network-analytical
View on GitHub
☆24Nov 12, 2025Updated 8 months ago
santoshgsk / awesome-ai-up-to-date
View on GitHub
A list of best resources covering broad topics including Python, Data Engineering, Data Analysis, Machine Learning, Deep Learning, RL
☆13Feb 26, 2020Updated 6 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
TonyTang2001 / SixFeetBetween_WWDC20SwiftChallenge
View on GitHub
WWDC 2020 Swift Student Challenge Submission "6 Feet Between" by Tony Tang
☆10Jun 17, 2020Updated 6 years ago
Blaok / soda
View on GitHub
Stencil with Optimized Dataflow Architecture
☆12Feb 27, 2024Updated 2 years ago
zendesk / jekyll-theme-zendesk-garden
View on GitHub
Jekyll theme based on Zendesk Garden
☆13Jan 15, 2026Updated 6 months ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pmem / pynvm
View on GitHub
Python bindings for the PMDK. Non-volatile memory for Python.
☆13Mar 22, 2023Updated 3 years ago
Nosayba / kpart
View on GitHub
A hybrid cache sharing-partitioning tool for systems with Intel CAT support.
☆30Mar 28, 2018Updated 8 years ago
SJTU-IPADS / disb
View on GitHub
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆58Aug 21, 2024Updated last year
MovePhilip / Webformer
View on GitHub
unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction
☆13Apr 20, 2023Updated 3 years ago
zslwyuan / Hi-DMM
View on GitHub
Hi-DMM: High-Performance Dynamic Memory Management in HLS (High-Level Synthesis)
☆25Oct 30, 2018Updated 7 years ago
SymbioticLab / tensorflow-salus
View on GitHub
tensorflow fork with Salus integration
☆12Jan 7, 2022Updated 4 years ago
sara-nl / DDLBench
View on GitHub
Distributed Deep Learning Benchmark Suite
☆11Oct 31, 2022Updated 3 years ago