IntelLabs / SLIDE_opt_iaLinks
β74Updated last year
Alternatives and similar repositories for SLIDE_opt_ia
Users that are interested in SLIDE_opt_ia are comparing it to the libraries listed below
Sorting:
- Nod.ai π¦ version of π» . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository β¦β106Updated 9 months ago
- PyTorch interface for the IPUβ181Updated 2 years ago
- benchmarking some transformer deploymentsβ26Updated 2 years ago
- Lightweight machine learning library based on OpenCL 1.2β75Updated 4 years ago
- π Pytorch code for the Nero optimiser.β20Updated 3 years ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.β148Updated 2 years ago
- Python Research Frameworkβ106Updated 2 years ago
- Customized matrix multiplication kernelsβ57Updated 3 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodesβ241Updated 2 years ago
- SLIDE (Sub-LInear Deep learning Engine) written in Goβ45Updated 5 years ago
- A GPT, made only of MLPs, in Jaxβ58Updated 4 years ago
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorchβ182Updated 2 years ago
- β40Updated 2 years ago
- A collection of optimizers, some arcane others well known, for Flax.β29Updated 4 years ago
- a lightweight transformer library for PyTorchβ72Updated 3 years ago
- Productionize machine learning predictions, with ONNX or withoutβ66Updated last year
- Deep learning for the Webβ38Updated 4 years ago
- PyTorch implementation of L2L execution algorithmβ108Updated 2 years ago
- Stride visualizationsβ38Updated 7 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compreβ¦β353Updated 4 months ago
- Massively Parallel and Asynchronous Architecture for Logic-based AIβ43Updated 2 years ago
- The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate iβ¦β87Updated 4 years ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree searchβ27Updated 6 years ago
- Hacks for PyTorchβ19Updated 2 years ago
- Large dataset storage format for Pytorchβ45Updated 4 years ago
- tensor4 - pytorch to C++ convertor using lightweight templated tensor libraryβ28Updated 5 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'β38Updated 3 years ago
- Implementation of a Tensorflow XLA rematerialization passβ15Updated 5 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)β117Updated 3 years ago
- β52Updated last year