Implementing scalable LLMs in pure JAX (no third-party libraries)
☆49Apr 23, 2026Updated this week
Alternatives and similar repositories for nanoGPTJAX
Users that are interested in nanoGPTJAX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a port of Mistral-7B model in JAX☆33Jul 1, 2024Updated last year
- ☆18Nov 10, 2023Updated 2 years ago
- ☆55Apr 22, 2026Updated last week
- ☆12Jan 11, 2018Updated 8 years ago
- Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)☆17Aug 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Topology library for Coq☆12Dec 24, 2015Updated 10 years ago
- ☆13Jul 12, 2024Updated last year
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- ☆36Feb 5, 2024Updated 2 years ago
- A clean no-jargon mathematical definition of transforrmer language model with a Python implementation that focuses on clarity rather than…☆11Jul 23, 2022Updated 3 years ago
- A Golang replacement for the Kubeflow Jupyter Web APIs / Un remplacement Golang pour les API de Web de Jupyter, partie de Kubeflow☆16Mar 30, 2026Updated 3 weeks ago
- Vehicle control☆11Jun 8, 2019Updated 6 years ago
- Contains package that allows converting ROS disparity images to depth images.☆12Mar 23, 2020Updated 6 years ago
- pointMass pybullet RL environment for simple experiments☆23Jan 12, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 3 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- ☆11Jun 17, 2016Updated 9 years ago
- Diffusion Probabilistic Model in Jax☆13Apr 20, 2024Updated 2 years ago
- This framework provides out-of-the-box implementations of Referential Games variants in order to study the emergence of artificial langua…☆23Dec 2, 2025Updated 4 months ago
- An end-to-end computational pipeline for large Perturb-seq screens☆15Apr 25, 2025Updated last year
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- Variant impact phenotyping using Perturb-seq☆10Apr 22, 2024Updated 2 years ago
- Open source Java framework to create, process and manage mixtures of exponential family☆14Aug 4, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Accompanying code for AAAI 2021 publication - High-Dimensional Bayesian Optimization via Tree-Structured Additive Models☆11Jun 19, 2024Updated last year
- Probabilistic inference for models of behaviour☆13Mar 5, 2026Updated last month
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated 11 months ago
- ☆11Dec 17, 2019Updated 6 years ago
- fast combinations calculation in jax☆39Jul 12, 2024Updated last year
- Bioinformatic MCP server that wraps the most useful functions of the gget library☆27Oct 27, 2025Updated 6 months ago
- MDP and RL interface for PDDL domains via PDDL.jl + POMDPs.jl.☆16Jun 14, 2024Updated last year
- Source files for my experiments not limited to computer graphics.☆13May 11, 2025Updated 11 months ago
- ☆19Apr 13, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Interactive TooManyCells Trees☆14Dec 6, 2024Updated last year
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆23Jun 8, 2025Updated 10 months ago
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆46Apr 14, 2026Updated 2 weeks ago
- dcEmb, the Embecosm Dynamic Causal Modelling library☆13Aug 26, 2024Updated last year
- Contrastive Language-Image Pretraining☆146Sep 6, 2022Updated 3 years ago
- ☆14Jul 9, 2024Updated last year
- minimalist vector ad☆11Feb 11, 2024Updated 2 years ago