microsoft / dist-irLinks

An IR for efficiently simulating distributed ML computation.

☆28

Alternatives and similar repositories for dist-ir

Users that are interested in dist-ir are comparing it to the libraries listed below

Sorting:

awslabs / lorien
☆43Updated last year
chips-compilers-mlsys-21 / chips-compilers-mlsys-21.github.io
☆11Updated 4 years ago
awslabs / ratex
☆23Updated 7 months ago
spcl / mlir-dace
Data-Centric MLIR dialect
☆42Updated last year
zhisbug / Cavs
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆14Updated 4 years ago
iree-org / iree-llvm-sandbox
A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM
☆58Updated 3 months ago
sjtu-epcc / Tacker
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆27Updated 4 months ago
Lin-Mao / DrGPUM
A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.
☆25Updated 8 months ago
Funatiq / gossip
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆59Updated 2 years ago
openxla / shardy
MLIR-based partitioning system
☆97Updated this week
polymage-labs / mlirx
MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com
☆38Updated last year
nox-410 / tvm.tl
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆50Updated 11 months ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆69Updated 3 years ago
parasailteam / coconet
☆79Updated 2 years ago
GVProf / GVProf
GVProf: A Value Profiler for GPU-based Clusters
☆50Updated last year
Oneflow-Inc / dfccl
☆26Updated 4 months ago
tlc-pack / tlcpack
☆24Updated last year
makslevental / nelli
A lightweight, Pythonic, frontend for MLIR
☆81Updated last year
microsoft / torchy
A tracing JIT for PyTorch
☆17Updated 2 years ago
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 2 years ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
iree-org / iree-experimental
Experiments and prototypes associated with IREE or MLIR
☆50Updated 10 months ago
andidr / teckyl
An MLIR frontend for tensor expressions
☆25Updated 4 years ago
microsoft / TileFusion
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆90Updated 3 weeks ago
openucx / torch-ucc
pytorch ucc plugin
☆22Updated 3 years ago
buaa-hipo / dlcompiler-comparison
The quantitative performance comparison among DL compilers on CNN models.
☆74Updated 4 years ago
uwsampl / relay-aot
An experimental ahead of time compiler for Relay.
☆50Updated 5 years ago
apuaaChen / EVT_AE
Artifacts of EVT ASPLOS'24
☆26Updated last year
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆62Updated 2 years ago
saareliad / FTPipe
FTPipe and related pipeline model parallelism research.
☆41Updated 2 years ago