soumyadipghosh / eventgrad
Event-Triggered Communication in Parallel Machine Learning
☆25Updated 3 years ago
Alternatives and similar repositories for eventgrad:
Users that are interested in eventgrad are comparing it to the libraries listed below
- ☆18Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆147Updated last year
- C++ API to log data in tensorboard format.☆77Updated last month
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- A tracing JIT compiler for PyTorch☆12Updated 3 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆109Updated 7 months ago
- Automatically insert nvtx ranges to PyTorch models☆17Updated 3 years ago
- A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)☆92Updated 5 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆63Updated 2 years ago
- ☆25Updated last year
- ☆15Updated 3 months ago
- a CUDA implementation of a priority queue☆83Updated 4 years ago
- Demo dataset for libtorch☆55Updated 2 years ago
- ☆9Updated 3 months ago
- LLM training in simple, raw C/CUDA☆91Updated 8 months ago
- A fast tensor library for c++.☆11Updated 9 years ago
- Introduction to CUDA programming☆115Updated 7 years ago
- an environment based on XLA for deep learning compiler optimization research.☆23Updated last year
- Customized matrix multiplication kernels☆53Updated 2 years ago
- PyTorch RFCs (experimental)☆131Updated 4 months ago
- Deadline-based hyperparameter tuning on RayTune.☆31Updated 5 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆73Updated last year
- A library for syntactically rewriting Python programs, pronounced (sinner).☆70Updated 2 years ago
- Learning about CUDA by writing PTX code.☆31Updated 10 months ago
- MagmaDNN: a simple deep learning framework in c++☆48Updated 4 years ago
- ☆29Updated 3 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 10 months ago
- PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021☆55Updated 3 years ago
- Automatic Differentiation C++ Library☆56Updated 4 years ago