divyamakkar0 / JAXformerView external linksLinks
A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
☆115Dec 29, 2025Updated last month
Alternatives and similar repositories for JAXformer
Users that are interested in JAXformer are comparing it to the libraries listed below
Sorting:
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated last month
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 6 months ago
- ☆563Jul 11, 2024Updated last year
- A simple molecular dynamics code in python☆15Nov 14, 2025Updated 2 months ago
- llms can learn their own context compression via RL☆41Nov 26, 2025Updated 2 months ago
- Open Source Mycetoma's First Series of Molecules☆10Sep 22, 2025Updated 4 months ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆28Jul 14, 2025Updated 7 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆198Jun 1, 2025Updated 8 months ago
- A n body simulation of our solar system completed in python☆11Dec 6, 2021Updated 4 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Aug 18, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 2 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 7 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆22Nov 13, 2025Updated 3 months ago
- The docs about the crystaline coin are collected here!☆10May 12, 2025Updated 9 months ago
- Use Rust in React Native through WebAssembly☆11Jan 7, 2023Updated 3 years ago
- ☆12Jul 8, 2024Updated last year
- This project compares the performance of Swin-Transformer v2 implemented in JAX and PyTorch.☆12Jun 8, 2022Updated 3 years ago
- Community Eventing and Scripting examples☆18Aug 11, 2025Updated 6 months ago
- Tiny AI model embedded in NES ROMs to generate character names in-game.☆26Sep 28, 2025Updated 4 months ago
- ☆92Jul 5, 2024Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Aug 6, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- ☆291Jul 15, 2024Updated last year
- ☆24Jan 29, 2026Updated 2 weeks ago
- React Map Component for Searchkit☆11Apr 27, 2018Updated 7 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- A visualization experience of AI/ML academic papers hosted on ArXiV - for project work at the University of California, Berkeley MIDS pro…☆10Feb 10, 2023Updated 3 years ago
- Taxi fare prediction using tensorflow probability☆15Jul 23, 2019Updated 6 years ago
- A Kubernetes operator for managing Prefect servers and work pools☆17Feb 2, 2026Updated last week
- ☆18Mar 2, 2025Updated 11 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- Visual autonomous workflows for Claude Code and Codex.☆23Dec 10, 2025Updated 2 months ago
- Let GPT-4 run your Minecraft server!☆10Apr 15, 2023Updated 2 years ago
- Use hodge decomposition for trajectory analysis☆23Nov 12, 2025Updated 3 months ago
- Cross Runtime system service installer.☆13Nov 21, 2024Updated last year
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated last month
- A rake task to interactive create a GraphQL Schema for Rails☆11Nov 2, 2016Updated 9 years ago
- ☆28Dec 15, 2025Updated last month
- ☆11Jan 28, 2024Updated 2 years ago