minGPT in JAX
☆49Jan 10, 2022Updated 4 years ago
Alternatives and similar repositories for minGPT
Users that are interested in minGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of MuZero in JAX.☆58Nov 8, 2022Updated 3 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- A collection of meta-learning algorithms in Jax☆25Sep 3, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- ☆18Apr 17, 2026Updated 2 weeks ago
- Contrastive Language-Image Pretraining☆146Sep 6, 2022Updated 3 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- A metrics library for the JAX ecosystem☆41Mar 16, 2023Updated 3 years ago
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- ☆15Mar 15, 2021Updated 5 years ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- ☆13Jan 16, 2025Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241May 12, 2023Updated 2 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- ☆62Mar 4, 2022Updated 4 years ago
- ☆28Nov 18, 2022Updated 3 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47May 31, 2024Updated last year
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Jul 21, 2022Updated 3 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 months ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆64Feb 15, 2023Updated 3 years ago
- This is the public repo for the course HMMA238 'Software Development'☆11Apr 20, 2021Updated 5 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- ☆20Nov 19, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- Easy Hypernetworks in Pytorch and Jax☆106Jan 27, 2023Updated 3 years ago
- Named Tensors for Legible Deep Learning in JAX☆219Nov 8, 2025Updated 5 months ago
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"☆63Feb 18, 2026Updated 2 months ago
- ☆13May 8, 2023Updated 2 years ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆299Dec 20, 2023Updated 2 years ago
- ☆10Apr 8, 2021Updated 5 years ago