zhisbug / Cavs
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆13Updated 4 years ago
Related projects: ⓘ
- An Attention Superoptimizer☆19Updated 4 months ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆17Updated 2 years ago
- An IR for efficiently simulating distributed ML computation.☆24Updated 8 months ago
- ☆23Updated 8 months ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆33Updated last year
- ☆45Updated last year
- ☆14Updated last year
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆17Updated 4 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- ☆72Updated last year
- ☆20Updated last year
- ☆7Updated last year
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆34Updated 6 months ago
- ☆19Updated last year
- ☆14Updated 2 years ago
- ☆13Updated 2 years ago
- ☆14Updated 3 months ago
- SOTA Learning-augmented Systems☆32Updated 2 years ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆55Updated last year
- Thinking is hard - automate it☆18Updated 2 years ago
- ☆14Updated 4 months ago
- ☆47Updated last year
- Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.☆41Updated 9 months ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆58Updated last year
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- ☆18Updated 2 years ago
- ☆31Updated last year
- ☆62Updated 3 years ago
- Benchmark PyTorch Custom Operators☆13Updated last year
- An experimental parallel training platform☆46Updated 5 months ago