IntelLabs / SLIDE_opt_iaLinks
β74Updated last year
Alternatives and similar repositories for SLIDE_opt_ia
Users that are interested in SLIDE_opt_ia are comparing it to the libraries listed below
Sorting:
- benchmarking some transformer deploymentsβ26Updated 2 years ago
- π Pytorch code for the Nero optimiser.β20Updated 2 years ago
- β39Updated 2 years ago
- Python Research Frameworkβ106Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodesβ239Updated 2 years ago
- β68Updated last year
- [JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Championβ40Updated 4 years ago
- β18Updated 2 years ago
- Nod.ai π¦ version of π» . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository β¦β106Updated 5 months ago
- PyTorch implementation of L2L execution algorithmβ107Updated 2 years ago
- A collection of optimizers, some arcane others well known, for Flax.β29Updated 3 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)β115Updated 3 years ago
- Customized matrix multiplication kernelsβ56Updated 3 years ago
- A "gym" style toolkit for building lightweight NAS systems.β13Updated 3 years ago
- Automatically insert nvtx ranges to PyTorch modelsβ17Updated 4 years ago
- PyProf2: PyTorch Profiling toolβ82Updated 5 years ago
- Butterfly matrix multiplication in PyTorchβ169Updated last year
- A GPT, made only of MLPs, in Jaxβ58Updated 4 years ago
- DLPack for Tensorflowβ35Updated 5 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memoryβ132Updated 3 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)β¦β29Updated 3 years ago
- β471Updated 3 years ago
- nGraphβ’ Backend for ONNXβ42Updated 2 years ago
- Texture mapping with variational auto-encodersβ40Updated 3 years ago
- tensor4 - pytorch to C++ convertor using lightweight templated tensor libraryβ28Updated 5 years ago
- Implementation of a Tensorflow XLA rematerialization passβ15Updated 5 years ago
- SLIDE (Sub-LInear Deep learning Engine) written in Goβ44Updated 5 years ago
- Benchmarks to capture important workloads.β31Updated 5 months ago
- Torch Distributed Experimentalβ116Updated 10 months ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.β85Updated 4 years ago