nanoGPT using Equinox
☆15Mar 3, 2023Updated 3 years ago
Alternatives and similar repositories for nanoGPT-equinox
Users that are interested in nanoGPT-equinox are comparing it to the libraries listed below
Sorting:
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- ☆18Aug 24, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆29Updated this week
- ☆23Jun 18, 2024Updated last year
- A port of muP to JAX/Haiku☆25Oct 23, 2022Updated 3 years ago
- Code for "Log Neural Controlled Differential Equations" (ICML 2024) and "Structured Linear CDEs" (NeurIPS 2025, Spotlight)☆30Jan 26, 2026Updated last month
- A collection of niche / personally useful PyTorch optimizers with modified code.☆27Oct 25, 2025Updated 4 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 9 months ago
- ☆15Dec 3, 2022Updated 3 years ago
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆47Feb 26, 2026Updated last week
- // clone this repo with --depth=1 to save disk size // toolchain compatible with Ubuntu 20.04+ //☆15Apr 28, 2022Updated 3 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- TensorFlow Lab for the BMVA Summer School☆13Jul 8, 2025Updated 8 months ago
- Econ5821 2026☆13Updated this week
- An implementation of Tiny Recursive Models (TRM)☆101Feb 16, 2026Updated 3 weeks ago
- ☆11Feb 24, 2026Updated 2 weeks ago
- An awesome list that curates the best Flet tools, tutorials, blogs and more.☆10Jan 8, 2023Updated 3 years ago
- A fine-mapping method integrating GWAS summary statistics and functional annotation data☆11Dec 28, 2023Updated 2 years ago
- Python package to process videos as in Hu and Ma (2024)☆20Sep 29, 2024Updated last year
- Computer Modern Mono proportional font☆10Jan 2, 2013Updated 13 years ago
- Enemies for your LLM☆35Jan 20, 2026Updated last month
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Feb 26, 2026Updated last week
- ☆16Jul 23, 2023Updated 2 years ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- ☆40Jan 5, 2024Updated 2 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆12Nov 22, 2021Updated 4 years ago
- ☆11May 29, 2025Updated 9 months ago
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Discord bot for twitter/twitcasting/twitch tracking...☆14Nov 8, 2025Updated 4 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- ☆12Apr 26, 2024Updated last year
- ICLR 2023: Learning to Extrapolate: A Transductive Approach☆11Aug 15, 2023Updated 2 years ago
- This repository contains the code to generate results from the paper "Artificial Neural Networks to solve dynamic programming problems: a…☆10May 24, 2024Updated last year
- The MAFAT challenge, by the Israeli Department of Defense. Deep Learning based approach to classify radar signatures of humans and animal…☆10Nov 10, 2020Updated 5 years ago