sayakpaul / keras-xla-benchmarksLinks
Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.
☆37Updated last year
Alternatives and similar repositories for keras-xla-benchmarks
Users that are interested in keras-xla-benchmarks are comparing it to the libraries listed below
Sorting:
- ☆74Updated 2 years ago
- Contains materials for my talk "You don't know TensorFlow".☆9Updated 2 years ago
- Quantization of LLMs and benchmarking.☆10Updated last year
- ☆24Updated 2 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 2 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆42Updated 5 months ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated 2 years ago
- ☆51Updated last year
- ☆16Updated 2 years ago
- A miniture AI training framework for PyTorch☆41Updated 6 months ago
- ☆133Updated last year
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Updated 3 years ago
- ML/DL Math and Method notes☆62Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Various transformers for FSDP research☆37Updated 2 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆75Updated last year
- ☆59Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆86Updated last year
- ☆81Updated last year
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆35Updated 9 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆66Updated 10 months ago
- ☆15Updated 3 years ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- Implementation of DreamBooth in KerasCV and TensorFlow.☆88Updated 2 years ago
- ☆48Updated 9 months ago