VoltaML / volta-treesLinks
☆22Updated 2 years ago
Alternatives and similar repositories for volta-trees
Users that are interested in volta-trees are comparing it to the libraries listed below
Sorting:
- ⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT…☆1,186Updated 2 years ago
- Lightning HPO & Training Studio App☆18Updated 2 years ago
- [ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…☆30Updated 3 years ago
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆184Updated 2 years ago
- Accelerate PyTorch models with ONNX Runtime☆362Updated 4 months ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- Running Stable Diffusion with Metaflow☆33Updated last month
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Fast sparse deep learning on CPUs☆53Updated 2 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆56Updated 3 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆522Updated 5 months ago
- Torch Distributed Experimental☆116Updated 10 months ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- PyTorch interface for the IPU☆180Updated last year
- JAX implementation ViT-VQGAN☆59Updated 2 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆157Updated this week
- Training material for IPU users: tutorials, feature examples, simple applications☆86Updated 2 years ago
- Customized matrix multiplication kernels☆56Updated 3 years ago
- Implementation of Flash Attention in Jax☆213Updated last year
- Implementation of denoising diffusion models with schedules, improved sampling, and other extensions using Keras.☆119Updated last year
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Updated 3 years ago
- ML/DL Math and Method notes☆61Updated last year
- ☆39Updated 2 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆87Updated 3 years ago
- Distributed skorch on Ray Train☆57Updated 2 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆214Updated 2 years ago
- Text to Image Diffusion Models in Keras☆77Updated last year
- Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can gener…☆207Updated 2 years ago
- ☆89Updated 2 years ago