JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al. (https://arxiv.org/abs/1706.03762)
β15Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for vanilla-transformer-jax
Users that are interested in vanilla-transformer-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! π₯ππ»β14Jun 15, 2024Updated last year
- Simple, extensible implementations of some meta-learning algorithms in Jaxβ11Oct 6, 2020Updated 5 years ago
- TD-DMRG and VHCI packageβ11Jul 24, 2025Updated 8 months ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.β21Sep 18, 2020Updated 5 years ago
- [ICML 2024] Official PyTorch implementation of the Vectorized Conditional Neural Field.β18Aug 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β42Jul 22, 2024Updated last year
- minimalistic AI library that resembles HF's transformersβ13Dec 31, 2024Updated last year
- Practicum on Supervised Learning in Function Spacesβ35Feb 17, 2022Updated 4 years ago
- A pathway and collection of resources to learning Jax from beginning to advance.β11Jan 2, 2021Updated 5 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Dec 21, 2021Updated 4 years ago
- Flexible, general-purpose VMC framework, built on JAX.β32Apr 9, 2026Updated last week
- Layered distributions using FLAX/JAXβ10Dec 13, 2020Updated 5 years ago
- A JAX implementation of stochastic addition.β14Aug 15, 2022Updated 3 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variationβ12May 24, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Obsidian plugin to apply spaced repetition to incrementally develop your notes.β22Dec 19, 2025Updated 4 months ago
- β24Jul 17, 2025Updated 9 months ago
- Code for "DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training"β55Jun 10, 2024Updated last year
- Paper introducing jax-cosmoβ13Apr 27, 2023Updated 2 years ago
- JAX implementation of Graph Attention Networksβ13Jan 29, 2022Updated 4 years ago
- Repository containing code for the NAACL 2021 paper (Incorporating External Knowledge to Enhance Tabular Reasoning)β16Jun 20, 2021Updated 4 years ago
- β15Oct 29, 2019Updated 6 years ago
- β10Mar 14, 2021Updated 5 years ago
- Speech in Flax/JAXβ15Jul 11, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A GPT-powered conversational agent with an inner monologue, demonstrating artificial consciousnessβ25Jun 9, 2023Updated 2 years ago
- Short-time Fourier transform (STFT) for JAXβ15Dec 20, 2021Updated 4 years ago
- Implementation of language model papers along with several examples [NOT ALL WRITTEN FROM SCRATCH].β12Oct 2, 2024Updated last year
- JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics Frameworkβ79Oct 29, 2025Updated 5 months ago
- Clockwork VAEs in JAX/Flaxβ32Jul 16, 2021Updated 4 years ago
- Implementation of ICML2023 paper : Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3Dβ51Jul 17, 2023Updated 2 years ago
- PyTorch implemention of the Position-induced Transformer for operator learning in partial differential equationsβ26Jun 3, 2025Updated 10 months ago
- Frame-independent vector-cloud neural network for nonlocal constitutive modelling on arbitrary grids.β11Oct 31, 2021Updated 4 years ago
- Applications of PINOsβ147Oct 10, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A quick test of SIREN on out-of-sample tasksβ13Jul 4, 2020Updated 5 years ago
- Water wave models in one dimensionβ10Apr 2, 2026Updated 2 weeks ago
- β15Nov 20, 2023Updated 2 years ago
- Pre-training BART in Flax on The Pile datasetβ22Jul 24, 2021Updated 4 years ago
- β10Dec 17, 2019Updated 6 years ago
- β12Jul 6, 2022Updated 3 years ago
- Local Attention - Flax module for Jaxβ22May 26, 2021Updated 4 years ago