nod-ai / SRTLinks
Nod.ai π¦ version of π» . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository for mainline development. This repository houses branches and configuration that aren't ready for commit upstream.
β106Updated last week
Alternatives and similar repositories for SRT
Users that are interested in SRT are comparing it to the libraries listed below
Sorting:
- benchmarking some transformer deploymentsβ26Updated this week
- A tracing JIT compiler for PyTorchβ13Updated 3 years ago
- 3X speedup over Appleβs TensorFlow plugin by using Apache TVM on M1β138Updated 3 years ago
- PyTorch interface for the IPUβ181Updated 2 years ago
- Customized matrix multiplication kernelsβ57Updated 3 years ago
- β13Updated 4 years ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters iβ¦β182Updated 3 months ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.β149Updated 2 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compreβ¦β362Updated last week
- β54Updated last year
- Benchmarks to capture important workloads.β31Updated 10 months ago
- PyTorch RFCs (experimental)β136Updated 6 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.β312Updated this week
- Home for OctoML PyTorch Profilerβ114Updated 2 years ago
- The Foundation for All Legate Librariesβ232Updated last week
- π Interactive performance profiling and debugging tool for PyTorch neural networks.β64Updated 10 months ago
- Productionize machine learning predictions, with ONNX or withoutβ66Updated last year
- A tensor-aware point-to-point communication primitive for machine learningβ274Updated 3 weeks ago
- β74Updated 2 years ago
- A library for syntactically rewriting Python programs, pronounced (sinner).β68Updated 3 years ago
- Example code and applications for machine learning on Graphcore IPUsβ332Updated last year
- Notes and artifacts from the ONNX steering committeeβ27Updated last week
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbersβ153Updated 11 months ago
- Torch Distributed Experimentalβ117Updated last year
- Training neural networks in TensorFlow 2.0 with 5x less memoryβ137Updated 3 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mindβ¦β161Updated 2 months ago
- An Aspiring Drop-In Replacement for Pandas at Scaleβ74Updated 4 years ago
- Fast sparse deep learning on CPUsβ56Updated 3 years ago
- β21Updated 8 months ago
- Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024β184Updated last year