BaguaSys / tutorials
Bagua tutorials.
☆12Updated 2 years ago
Alternatives and similar repositories for tutorials:
Users that are interested in tutorials are comparing it to the libraries listed below
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆64Updated 2 years ago
- TileFusion is a highly efficient kernel template library designed to elevate the level of abstraction in CUDA C for processing tiles.☆56Updated this week
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 11 months ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆130Updated 2 years ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆89Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆43Updated last week
- A Python library transfers PyTorch tensors between CPU and NVMe☆104Updated 2 months ago
- ☆59Updated 2 weeks ago
- Distributed preprocessing and data loading for language datasets☆39Updated 10 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated 11 months ago
- DLPack for Tensorflow☆36Updated 4 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- ☆38Updated last year
- Home for OctoML PyTorch Profiler☆107Updated last year
- MLPerf™ logging library☆32Updated this week
- An IR for efficiently simulating distributed ML computation.☆27Updated last year
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- High performance NCCL plugin for Bagua.☆15Updated 3 years ago
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆17Updated this week
- GPTQ inference TVM kernel☆38Updated 9 months ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆28Updated 3 years ago
- Distributed ML Optimizer☆30Updated 3 years ago
- ☆11Updated 3 years ago
- An Attention Superoptimizer☆21Updated last month
- ☆20Updated last year
- ☆44Updated last year
- A library for syntactically rewriting Python programs, pronounced (sinner).☆70Updated 2 years ago
- ☆22Updated 5 years ago
- Ahead of Time (AOT) Triton Math Library☆52Updated this week