linnanwang/superneurons-release

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linnanwang/superneurons-release)

linnanwang / superneurons-release

this is the release repository of superneurons

☆54

Alternatives and similar repositories for superneurons-release

Users that are interested in superneurons-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shriramsb / vDNN
View on GitHub
☆22Nov 7, 2018Updated 7 years ago
shriramsb / vdnn-plus-plus
View on GitHub
Implementation of vDNN++; an improvement over vDNN
☆18Dec 7, 2018Updated 7 years ago
czkkkkkk / gccl
View on GitHub
☆13Jan 23, 2021Updated 5 years ago
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated last year
tbd-ai / tbd-suite
View on GitHub
☆47Dec 16, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
uwsampl / dtr-prototype
View on GitHub
Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616
☆133Jul 6, 2023Updated 3 years ago
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆213Nov 5, 2020Updated 5 years ago
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
cmikeh2 / grnn
View on GitHub
☆13Jun 20, 2019Updated 7 years ago
S-Lab-System-Group / ChronusArtifact
View on GitHub
☆23Jan 7, 2022Updated 4 years ago
SNU-ARC / flashneuron
View on GitHub
☆41Nov 28, 2022Updated 3 years ago
BoyuanFeng / APNN-TC
View on GitHub
☆20Aug 26, 2021Updated 4 years ago
octoml / synr
View on GitHub
A library for syntactically rewriting Python programs, pronounced (sinner).
☆66Feb 22, 2022Updated 4 years ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
zhisbug / Cavs
View on GitHub
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆15Sep 18, 2020Updated 5 years ago
tbd-ai / tbd-tools
View on GitHub
☆12May 3, 2020Updated 6 years ago
wzsh / wmma_tensorcore_sample
View on GitHub
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
☆147Aug 18, 2020Updated 5 years ago
NVIDIA / cnmem
View on GitHub
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
☆298Nov 28, 2018Updated 7 years ago
xiezhq-hermann / graphiler
View on GitHub
Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…
☆59Oct 3, 2022Updated 3 years ago
CGCL-codes / Frog
View on GitHub
Frog is Asynchronous Graph Processing on GPU with Hybrid Coloring Model. The fundamental idea is based on Pareto principle (or 80-20 rule…
☆36May 29, 2021Updated 5 years ago
gsampler9 / gSampler
View on GitHub
☆29Aug 14, 2024Updated last year
netx-repo / RackSched
View on GitHub
RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers
☆24Oct 5, 2020Updated 5 years ago
pakmarkthub / dragon
View on GitHub
A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data po…
☆63Sep 11, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lsds / KungFu
View on GitHub
Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.
☆295Feb 23, 2024Updated 2 years ago
alibaba / llm-scheduling-artifact
View on GitHub
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆64Jun 5, 2024Updated 2 years ago
darchr / AutoTM
View on GitHub
Thinking is hard - automate it
☆18Aug 24, 2022Updated 3 years ago
pku-liang / FlexTensor
View on GitHub
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆184Apr 25, 2022Updated 4 years ago
spypaul / MQSim_CXL_Linux
View on GitHub
☆31May 31, 2023Updated 3 years ago
iHeartGraph / Enterprise
View on GitHub
Enterprise: Breadth-First Graph Traversal on GPUs. SC'15.
☆33May 20, 2017Updated 9 years ago
msr-fiddle / philly-traces
View on GitHub
☆198Aug 31, 2019Updated 6 years ago
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
cuihenggang / geeps
View on GitHub
GPU-specialized parameter server for GPU machine learning.
☆102Apr 5, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
linnanwang / BLASX
View on GitHub
a heterogeneous multiGPU level-3 BLAS library
☆46Dec 9, 2019Updated 6 years ago
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
casys-kaist / HUVM
View on GitHub
☆27Aug 19, 2022Updated 3 years ago
msr-fiddle / pipedream
View on GitHub
☆394Nov 4, 2022Updated 3 years ago
MITIBMxGraph / SALIENT
View on GitHub
The official SALIENT system described in the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and P…
☆40Jun 28, 2023Updated 3 years ago
nnzhaocs / DupHunter
View on GitHub
☆16May 4, 2021Updated 5 years ago
saurabhkadekodi / geriatrix
View on GitHub
A simple and reproducible, profile-driven file system aging suite.
☆27Jul 12, 2018Updated 8 years ago