A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
☆118Dec 29, 2025Updated 2 months ago
Alternatives and similar repositories for JAXformer
Users that are interested in JAXformer are comparing it to the libraries listed below
Sorting:
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- ☆566Jul 11, 2024Updated last year
- A simple molecular dynamics code in python☆15Nov 14, 2025Updated 3 months ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- Open Source Mycetoma's First Series of Molecules☆10Sep 22, 2025Updated 5 months ago
- Hackable AlphaFold 3 inference pipeline.☆35Jun 18, 2025Updated 8 months ago
- From the fundamentals of diffusion to flow matching in pi0☆51Jan 13, 2025Updated last year
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆28Jul 14, 2025Updated 7 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆201Jun 1, 2025Updated 9 months ago
- A n body simulation of our solar system completed in python☆11Dec 6, 2021Updated 4 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Aug 18, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 3 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 8 months ago
- Metal Activity Heuristic of Metalloprotein and Enzymatic Sites (MAHOMES) - Predicts if a protein bound metal ion is enzymatic or non-enzy…☆11Apr 19, 2022Updated 3 years ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37May 18, 2025Updated 9 months ago
- Program to plot a Ramachandran plot of all dihedral angles from a given PDB file. Background is empirically generated from the peptides …☆12Feb 25, 2025Updated last year
- ☆83Apr 16, 2024Updated last year
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- ☆10Sep 9, 2023Updated 2 years ago
- Research sources on graph-based anomaly detection☆13Nov 29, 2022Updated 3 years ago
- ☆12Jul 8, 2024Updated last year
- ☆93Jul 5, 2024Updated last year
- ☆47Feb 26, 2026Updated last week
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Feb 13, 2026Updated 3 weeks ago
- ☆292Jul 15, 2024Updated last year
- Minimal yet performant LLM examples in pure JAX☆240Jan 14, 2026Updated last month
- seqax = sequence modeling + JAX☆187Jul 23, 2025Updated 7 months ago
- A rake task to interactive create a GraphQL Schema for Rails☆11Nov 2, 2016Updated 9 years ago
- Taxi fare prediction using tensorflow probability☆15Jul 23, 2019Updated 6 years ago
- ☆10Oct 19, 2020Updated 5 years ago
- NMT based SMILES to IUPAC Translator☆16Jul 16, 2025Updated 7 months ago
- VeighNa框架的LevelDB数据库接口☆13Apr 23, 2023Updated 2 years ago
- ☆18Jul 3, 2025Updated 8 months ago
- ☆12Mar 8, 2022Updated 4 years ago
- ☆12May 12, 2023Updated 2 years ago
- ☆11Jul 21, 2024Updated last year
- Tiny AI model embedded in NES ROMs to generate character names in-game.☆29Sep 28, 2025Updated 5 months ago
- ☆25Updated this week