Amos optimizer with JEstimator lib.
☆82May 15, 2024Updated last year
Alternatives and similar repositories for jestimator
Users that are interested in jestimator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- ☆27Apr 12, 2023Updated 2 years ago
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- ☆21Jan 23, 2024Updated 2 years ago
- ☆16Jul 16, 2024Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- ☆195Mar 10, 2026Updated 2 weeks ago
- ☆22Nov 9, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆31Mar 9, 2026Updated 3 weeks ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- ☆13Jan 23, 2017Updated 9 years ago
- ☆18Aug 24, 2024Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆54Oct 12, 2022Updated 3 years ago
- Example codes in the medium post titled "Optuna meets Weights and Biases."☆24Aug 11, 2022Updated 3 years ago
- ☆10Apr 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Nov 12, 2020Updated 5 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment☆23Jun 8, 2020Updated 5 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Jun 2, 2019Updated 6 years ago
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- Train your own sub-1B foundation models JAX/GCP/TPUS in hours☆302Aug 28, 2024Updated last year
- Multiple dispatch over abstract array types in JAX.☆138Dec 15, 2025Updated 3 months ago
- A metrics library for the JAX ecosystem☆41Mar 16, 2023Updated 3 years ago
- Pytorch LSTM implementation powered by Libtorch☆18Dec 26, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Nov 10, 2023Updated 2 years ago
- Equivariant Steerable CNNs Library for Pytorch https://quva-lab.github.io/escnn/☆32Jun 28, 2023Updated 2 years ago
- ☆11May 1, 2022Updated 3 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Apr 4, 2021Updated 4 years ago
- ☆17Mar 22, 2025Updated last year
- Transformers at any scale☆42Jan 18, 2024Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Nov 12, 2022Updated 3 years ago