Amos optimizer with JEstimator lib.
☆83May 15, 2024Updated last year
Alternatives and similar repositories for jestimator
Users that are interested in jestimator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Oct 15, 2022Updated 3 years ago
- ☆16Dec 10, 2022Updated 3 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- ☆13Updated this week
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 5 years ago
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- ☆22Jan 23, 2024Updated 2 years ago
- ☆16Jul 16, 2024Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Very Simple and Basic Implementation of Compositional Pattern Producing Network in TensorFlow☆11Nov 27, 2019Updated 6 years ago
- ☆196Updated this week
- ☆22Nov 9, 2024Updated last year
- AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks☆29Oct 26, 2022Updated 3 years ago
- ☆32May 2, 2026Updated last week
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Mar 5, 2023Updated 3 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated 2 years ago
- Github for the conference paper GLOD-Gaussian Likelihood OOD detector☆16Apr 18, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An implementation of shampoo☆79Mar 9, 2018Updated 8 years ago
- ☆18Aug 24, 2024Updated last year
- Peregrine is a rapid, append-only logging and note-taking app, inspired by @thesephist's Inc.☆23Aug 23, 2025Updated 8 months ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆55Oct 12, 2022Updated 3 years ago
- A collection of matrix games in JAX☆13Apr 13, 2026Updated 3 weeks ago
- Example codes in the medium post titled "Optuna meets Weights and Biases."☆24Aug 11, 2022Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Nov 12, 2020Updated 5 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Jun 2, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An alternative to convolution in neural networks☆261Mar 28, 2024Updated 2 years ago
- SOTA model implementations in JAX/FLAX☆301Aug 28, 2024Updated last year
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- Multiple dispatch over abstract array types in JAX.☆141May 2, 2026Updated last week
- A metrics library for the JAX ecosystem☆41Mar 16, 2023Updated 3 years ago
- ☆12Nov 10, 2023Updated 2 years ago
- Pytorch LSTM implementation powered by Libtorch☆18Dec 26, 2022Updated 3 years ago