AllanYangZhou/midGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AllanYangZhou/midGPT)

AllanYangZhou / midGPT

Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.

☆27

Alternatives and similar repositories for midGPT

Users that are interested in midGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 7 months ago
yixiaoer / einshard
View on GitHub
Einsum-like high-level array sharding API for JAX
☆35Jul 16, 2024Updated 2 years ago
young-geng / tpu_pod_commander
View on GitHub
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Sep 24, 2025Updated 10 months ago
samblouir / birdie
View on GitHub
☆15Jun 8, 2026Updated last month
okarthikb / state-space-models
View on GitHub
☆27Jul 9, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kvfrans / jax-fid-parallel
View on GitHub
Frechet inception distance (FID) evaluation in JAX
☆14May 28, 2024Updated 2 years ago
nestordemeure / jochastic
View on GitHub
A JAX implementation of stochastic addition.
☆13Aug 15, 2022Updated 3 years ago
real-stanford / umi-arx
View on GitHub
Minimal UMI deployment environment for ARX5 robot arm
☆23Feb 25, 2025Updated last year
justindomke / numbat
View on GitHub
NumPy+Jax with named axes and an uncompromising attitude
☆23Mar 4, 2025Updated last year
j-towns / scanagram
View on GitHub
Tidy autoregressive inference in JAX
☆15Sep 1, 2025Updated 10 months ago
hannahxchen / automatic-paraphrase-dataset-augmentation
View on GitHub
Code and data for automatic paraphrase dataset augmentation.
☆11Mar 8, 2021Updated 5 years ago
UmerHA / triton_util
View on GitHub
Make triton easier
☆49Jun 12, 2024Updated 2 years ago
cgarciae / nanoGPT-jax
View on GitHub
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆38Dec 3, 2023Updated 2 years ago
bitwiseshiftleft / crandom
View on GitHub
Fast, simple, cryptographically strong random numbers in C++. Experimental.
☆19Dec 12, 2013Updated 12 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
mrtzh / Ladder.jl
View on GitHub
A reliable leaderboard algorithm for machine learning competitions
☆17May 19, 2015Updated 11 years ago
google-deepmind / nanodo
View on GitHub
☆304Jul 15, 2024Updated 2 years ago
thecharlieblake / lovely-llama
View on GitHub
An implementation of the Llama architecture, to instruct and delight
☆21May 31, 2025Updated last year
epfl-labos / eagle
View on GitHub
☆13Jan 16, 2019Updated 7 years ago
dlwh / jax_sourceror
View on GitHub
Turn jitted jax functions back into python source code
☆23Dec 16, 2024Updated last year
Sea-Snell / JAXSeq
View on GitHub
Train very large language models in Jax.
☆208Oct 21, 2023Updated 2 years ago
araffin / datasaurust
View on GitHub
Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.
☆19Mar 22, 2026Updated 4 months ago
danqi / drqa-datasets
View on GitHub
The QA datasets used for DrQA evaluation.
☆14Nov 30, 2018Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
catid / dataloader
View on GitHub
High-performance tokenized language data-loader for Python C++ extension
☆15Jul 22, 2024Updated 2 years ago
jax-ml / jax-llm-examples
View on GitHub
Minimal yet performant LLM examples in pure JAX
☆269Jul 4, 2026Updated 3 weeks ago
ShannonAI / mrc-for-dependency-parsing
View on GitHub
☆18May 28, 2021Updated 5 years ago
cgarciae / treeo
View on GitHub
A small library for creating and manipulating custom JAX Pytree classes
☆56Feb 26, 2023Updated 3 years ago
ayaka14732 / llama-2-jax
View on GitHub
JAX implementation of the Llama 2 model
☆217Feb 2, 2024Updated 2 years ago
marin-community / haliax
View on GitHub
Named Tensors for Legible Deep Learning in JAX
☆227Nov 8, 2025Updated 8 months ago
lucidrains / hyena-dna
View on GitHub
Fork of HyenaDNA, a long-range genomic foundation model built with Hyena
☆10Aug 14, 2023Updated 2 years ago
nebius / kvax
View on GitHub
A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆174Nov 11, 2025Updated 8 months ago
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆226Apr 30, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NeilGirdhar / efax
View on GitHub
Exponential families for JAX
☆77Updated this week
moldyn / FastPCA
View on GitHub
Fast, parallelized implementation of Principal Component Analysis with constant memory consumption for large data sets.
☆12Feb 18, 2022Updated 4 years ago
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
3outeille / GPTQ-for-RWKV
View on GitHub
☆13Jun 3, 2023Updated 3 years ago
Deep-Learning-Profiling-Tools / triton-samples
View on GitHub
☆14Mar 8, 2025Updated last year
jax-ml / australis
View on GitHub
☆28Nov 18, 2022Updated 3 years ago
kenkenpa2126 / vanilla_transformer_from_scratch_with_JAX
View on GitHub
☆10Dec 18, 2023Updated 2 years ago