minGPT in JAX
☆48Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for minGPT
Users that are interested in minGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of MuZero in JAX.☆57Nov 8, 2022Updated 3 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- ☆18Mar 18, 2026Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Dec 3, 2023Updated 2 years ago
- Contrastive Language-Image Pretraining☆144Sep 6, 2022Updated 3 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- A metrics library for the JAX ecosystem☆41Mar 16, 2023Updated 3 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆19Oct 13, 2024Updated last year
- ☆15Mar 15, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hypercube Viewer is a program that draws a hypercube of 3 to 10 dimensions.☆13Mar 30, 2025Updated 11 months ago
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 6 months ago
- CuratorNet: Visually-aware Recommendation of Art Images☆13Dec 14, 2021Updated 4 years ago
- JMP is a Mixed Precision library for JAX.☆212Jan 30, 2025Updated last year
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- ☆13Jan 16, 2025Updated last year
- ☆53Jan 18, 2024Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241May 12, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆77Mar 9, 2026Updated 2 weeks ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- ☆63Mar 4, 2022Updated 4 years ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- ☆18Jul 10, 2022Updated 3 years ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆64Feb 15, 2023Updated 3 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is the public repo for the course HMMA238 'Software Development'☆11Apr 20, 2021Updated 4 years ago
- ☆17Jun 11, 2025Updated 9 months ago
- ☆35Jul 5, 2023Updated 2 years ago
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- ☆21Nov 19, 2025Updated 4 months ago
- Named Tensors for Legible Deep Learning in JAX☆217Nov 8, 2025Updated 4 months ago
- Easy Hypernetworks in Pytorch and Jax☆106Jan 27, 2023Updated 3 years ago