zdevito / custom_loaderLinks
☆13Updated 4 years ago
Alternatives and similar repositories for custom_loader
Users that are interested in custom_loader are comparing it to the libraries listed below
Sorting:
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated last month
- A library for syntactically rewriting Python programs, pronounced (sinner).☆67Updated 3 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 5 years ago
- A tracing JIT compiler for PyTorch☆13Updated 4 years ago
- An IR for efficiently simulating distributed ML computation.☆32Updated 2 years ago
- An experimental ahead of time compiler for Relay.☆50Updated 5 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Updated 3 years ago
- PyTorch RFCs (experimental)☆138Updated 8 months ago
- Python bindings for UCX☆139Updated 4 months ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 3 years ago
- Benchmarks to capture important workloads.☆32Updated last week
- MLIR-based partitioning system☆162Updated this week
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆61Updated 10 months ago
- ☆41Updated last year
- A tracing JIT for PyTorch☆17Updated 3 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆62Updated 3 years ago
- Codebase associated with the PyTorch compiler tutorial☆47Updated 6 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆29Updated 4 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 4 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated last month
- TORCH_TRACE parser for PT2☆72Updated last week
- Convert nvprof profiles into about:tracing compatible JSON files☆73Updated 4 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆36Updated last year
- Symbolic Expression and Statement Module for new DSLs☆205Updated 5 years ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆116Updated 2 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Tools and extensions for CUDA profiling☆67Updated 6 years ago