sayakpaul / keras-xla-benchmarksLinks
Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.
☆37Updated 2 years ago
Alternatives and similar repositories for keras-xla-benchmarks
Users that are interested in keras-xla-benchmarks are comparing it to the libraries listed below
Sorting:
- ☆75Updated 2 years ago
- ☆24Updated 3 years ago
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Updated 3 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- ML/DL Math and Method notes☆64Updated last year
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆42Updated 8 months ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 3 years ago
- ☆59Updated last year
- A miniture AI training framework for PyTorch☆42Updated 9 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆123Updated last year
- ☆134Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆82Updated 2 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated 2 years ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆39Updated last month
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆91Updated 2 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆119Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆89Updated last year
- Lightning HPO & Training Studio App☆18Updated 2 years ago
- ☆51Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆68Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 3 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Updated 4 years ago
- Various transformers for FSDP research☆38Updated 2 years ago
- High-performance, asynchronous Python HTTP client library designed for faster file transfers using concurrency, semaphores, and fault-tol…☆58Updated 5 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- Quantization of LLMs and benchmarking.☆10Updated last year
- ☆31Updated 4 months ago
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆58Updated 3 years ago
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago