sayakpaul / keras-xla-benchmarks
Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.
☆37Updated last year
Alternatives and similar repositories for keras-xla-benchmarks:
Users that are interested in keras-xla-benchmarks are comparing it to the libraries listed below
- Contains materials for my talk "You don't know TensorFlow".☆9Updated 2 years ago
- ☆16Updated 2 years ago
- ☆73Updated 2 years ago
- Various transformers for FSDP research☆37Updated 2 years ago
- ML/DL Math and Method notes☆60Updated last year
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆31Updated 6 months ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- ☆24Updated 2 years ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 2 years ago
- ☆58Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Utilities for PyTorch distributed☆24Updated 2 months ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆82Updated last year
- Notebooks for fine tuning pali gemma☆100Updated 3 weeks ago
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Updated 3 years ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Updated 2 weeks ago
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- Notebooks to demonstrate TimmWrapper☆16Updated 3 months ago
- A miniture AI training framework for PyTorch☆42Updated 3 months ago
- Quantization of LLMs and benchmarking.☆10Updated last year
- A library that includes Keras3 layers, blocks and models with pretrained weights, providing support for transfer learning, feature extrac…☆45Updated 4 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 6 months ago
- Collection of autoregressive model implementation☆85Updated last week
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆118Updated 6 months ago
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆40Updated 2 months ago
- This is a port of Mistral-7B model in JAX☆32Updated 10 months ago
- ☆27Updated 9 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago