Amos optimizer with JEstimator lib.
☆83May 15, 2024Updated 2 years ago
Alternatives and similar repositories for jestimator
Users that are interested in jestimator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 3 years ago
- ☆16Dec 10, 2022Updated 3 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- ☆13May 4, 2026Updated last month
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 5 years ago
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- ☆22Jan 23, 2024Updated 2 years ago
- ☆16Jul 16, 2024Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- Very Simple and Basic Implementation of Compositional Pattern Producing Network in TensorFlow☆11Nov 27, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆197Jun 11, 2026Updated last week
- ☆22Nov 9, 2024Updated last year
- ☆35May 2, 2026Updated last month
- AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks☆30Oct 26, 2022Updated 3 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆122Mar 5, 2023Updated 3 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 6 years ago
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated 2 years ago
- Github for the conference paper GLOD-Gaussian Likelihood OOD detector☆16Apr 18, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jan 23, 2017Updated 9 years ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆55Oct 12, 2022Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Nov 12, 2020Updated 5 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Dec 16, 2022Updated 3 years ago
- Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment☆23Jun 8, 2020Updated 6 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- An alternative to convolution in neural networks☆264Mar 28, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- JAX library for training sub-4B foundation models for edge☆302Aug 28, 2024Updated last year
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- ☆18Aug 27, 2023Updated 2 years ago
- A metrics library for the JAX ecosystem☆41Mar 16, 2023Updated 3 years ago
- Pytorch LSTM implementation powered by Libtorch☆18Dec 26, 2022Updated 3 years ago
- ☆12Nov 10, 2023Updated 2 years ago
- Equivariant Steerable CNNs Library for Pytorch https://quva-lab.github.io/escnn/☆33Jun 28, 2023Updated 2 years ago