Amos optimizer with JEstimator lib.
☆82May 15, 2024Updated last year
Alternatives and similar repositories for jestimator
Users that are interested in jestimator are comparing it to the libraries listed below
Sorting:
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- ☆10Apr 3, 2024Updated last year
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12May 11, 2022Updated 3 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- ☆27Apr 12, 2023Updated 2 years ago
- ☆16Dec 10, 2022Updated 3 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 2 years ago
- bumble bee transformer☆14Apr 19, 2021Updated 4 years ago
- ☆31Updated this week
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- A Learnable LSH Framework for Efficient NN Training☆34Jul 22, 2021Updated 4 years ago
- Estimate resources needed to train LLMs☆14Feb 10, 2026Updated last month
- ☆15Dec 3, 2024Updated last year
- ☆193Feb 27, 2026Updated last week
- ☆21Jan 23, 2024Updated 2 years ago
- Graph Learning with JAX☆14Jul 11, 2022Updated 3 years ago
- A collection of matrix games in JAX☆13Nov 28, 2024Updated last year
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆29Mar 1, 2026Updated last week
- Building a Social Network with SwiftUI☆15May 27, 2020Updated 5 years ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆42Jun 6, 2024Updated last year
- [ICML 2024] Codes for C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models☆18Jun 4, 2024Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Dec 16, 2022Updated 3 years ago
- Materials for the BSc course "Analysis, Design, and Software Architecture" at IT University of Copenhagen, fall 2022☆15Dec 13, 2022Updated 3 years ago
- ☆18Feb 7, 2021Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Jun 20, 2024Updated last year
- ☆19Apr 22, 2024Updated last year
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 9 months ago
- Official Pytorch implementation of "Neural Optimal Transport with General Cost Functionals" (ICLR 2024)☆24Aug 29, 2024Updated last year
- Multiple dispatch over abstract array types in JAX.☆137Dec 15, 2025Updated 2 months ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago