Amos optimizer with JEstimator lib.
☆83May 15, 2024Updated 2 years ago
Alternatives and similar repositories for jestimator
Users that are interested in jestimator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- ☆16Dec 10, 2022Updated 3 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- ☆13May 4, 2026Updated 3 weeks ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 5 years ago
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- Very Simple and Basic Implementation of Compositional Pattern Producing Network in TensorFlow☆11Nov 27, 2019Updated 6 years ago
- ☆196May 4, 2026Updated 3 weeks ago
- ☆22Nov 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks☆29Oct 26, 2022Updated 3 years ago
- ☆32May 2, 2026Updated 3 weeks ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- An implementation of shampoo☆79Mar 9, 2018Updated 8 years ago
- ☆13Jan 23, 2017Updated 9 years ago
- ☆18Aug 24, 2024Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆55Oct 12, 2022Updated 3 years ago
- A collection of matrix games in JAX☆13Apr 13, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Example codes in the medium post titled "Optuna meets Weights and Biases."☆24Aug 11, 2022Updated 3 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Jun 2, 2019Updated 6 years ago
- SOTA model implementations in JAX/FLAX☆302Aug 28, 2024Updated last year
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- Multiple dispatch over abstract array types in JAX.☆141May 19, 2026Updated last week
- A metrics library for the JAX ecosystem☆41Mar 16, 2023Updated 3 years ago
- Pytorch LSTM implementation powered by Libtorch☆18Dec 26, 2022Updated 3 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Apr 4, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This tool displays tflite signatures and rewrites the input/output OP name to the name of the signature. There is no need to install Tens…☆14Dec 13, 2023Updated 2 years ago
- Reproduction of "Scyclone" with PyTorch☆16Jan 6, 2021Updated 5 years ago
- Transformers at any scale☆42Jan 18, 2024Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Nov 12, 2022Updated 3 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 3 years ago
- ☆50Oct 22, 2020Updated 5 years ago
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12May 11, 2022Updated 4 years ago