The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆38Dec 3, 2023Updated 2 years ago
Alternatives and similar repositories for nanoGPT-jax
Users that are interested in nanoGPT-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Jax/Flax rewrite of Karpathy's nanoGPT☆65Feb 15, 2023Updated 3 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- ☆13Jan 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Apr 16, 2022Updated 4 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆10Oct 3, 2023Updated 2 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆57Apr 21, 2026Updated last month
- Minimal but scalable implementation of large language models in JAX☆34Nov 28, 2025Updated 6 months ago
- Heatmap text in Julia.☆11Jul 3, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆23Aug 19, 2022Updated 3 years ago
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- Monitor web pages and get notified when a page has changed☆12Dec 2, 2022Updated 3 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- MLJ Interface for ScikitLearn.jl☆13May 22, 2024Updated 2 years ago
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- ☆10Apr 5, 2024Updated 2 years ago
- Repo for materials for coordinating work on improving Julia's function documentation☆10Jul 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The first large scale formally verified reasoning dataset for Verilog☆21May 16, 2025Updated last year
- ☆13Jan 16, 2019Updated 7 years ago
- text preprocessing library with framework for composable tokenizations☆13Jun 15, 2024Updated last year
- [ 👾 ] ➡️ 💾 ➡️ { 🎮🕹️ } Extra Stable-Baselines3 buffer classes. Reducing RL memory usage drastically with minimal overhead.☆23Dec 9, 2025Updated 5 months ago
- Benchmarks for the FluxML ecosystem for deep learning, scientific machine learning, differentiable programming etc including AD and CUDA …☆15Jun 4, 2022Updated 3 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- An implementation of a Brownian motion using ClojureScript with re-frame and Highcharts☆11Feb 8, 2019Updated 7 years ago
- Julia API for Ray☆12Mar 6, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A research project exploring fine-tuning BERT-style models for text generation☆41Nov 30, 2025Updated 6 months ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Minimal, lightweight JAX implementations of popular models.☆235Mar 27, 2026Updated 2 months ago
- ☆26Jun 14, 2022Updated 3 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- ☆22Apr 6, 2025Updated last year
- JAX implementation of the Llama 2 model☆217Feb 2, 2024Updated 2 years ago