JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al. (https://arxiv.org/abs/1706.03762)
β15Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for vanilla-transformer-jax
Users that are interested in vanilla-transformer-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! π₯ππ»β14Jun 15, 2024Updated last year
- Simple, extensible implementations of some meta-learning algorithms in Jaxβ11Oct 6, 2020Updated 5 years ago
- TD-DMRG and VHCI packageβ11Jul 24, 2025Updated 10 months ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.β21Sep 18, 2020Updated 5 years ago
- Keras-like APIs for JAX frameworkβ50Mar 25, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2024] Official PyTorch implementation of the Vectorized Conditional Neural Field.β18Aug 1, 2024Updated last year
- β11Nov 4, 2012Updated 13 years ago
- minimalistic AI library that resembles HF's transformersβ13Dec 31, 2024Updated last year
- β42Jul 22, 2024Updated last year
- Practicum on Supervised Learning in Function Spacesβ35Feb 17, 2022Updated 4 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)β10Oct 16, 2024Updated last year
- A pathway and collection of resources to learning Jax from beginning to advance.β11Jan 2, 2021Updated 5 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Dec 21, 2021Updated 4 years ago
- Flexible, general-purpose VMC framework, built on JAX.β32Apr 27, 2026Updated last month
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Impementation of Variational Monte Carlo (VMC) for quantum many-body dynamics using JAX.β71Sep 26, 2025Updated 8 months ago
- νκ΅μ΄ λ¬Έμ₯ λΆμ μμ€ν BCD-KL-Parserβ10Jun 23, 2020Updated 5 years ago
- Procgen2: A community maintained fork of procgenβ12Aug 25, 2022Updated 3 years ago
- Deep NLP 2 (2019.3-5)β10Feb 19, 2019Updated 7 years ago
- Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulationβ115Dec 13, 2024Updated last year
- β10Aug 6, 2022Updated 3 years ago
- β14Mar 13, 2024Updated 2 years ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various pβ¦β14Jan 3, 2025Updated last year
- Layered distributions using FLAX/JAXβ10Dec 13, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A JAX implementation of stochastic addition.β14Aug 15, 2022Updated 3 years ago
- Expanded KR-BERT by adding more training dataβ13Apr 23, 2021Updated 5 years ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and haβ¦β11Nov 26, 2024Updated last year
- β25Jul 17, 2025Updated 10 months ago
- An RPG Maker MZ pluginβ12Nov 2, 2023Updated 2 years ago
- Paper introducing jax-cosmoβ13Apr 27, 2023Updated 3 years ago
- Attend - to what matters.β17Feb 22, 2025Updated last year
- JAX implementation of Graph Attention Networksβ13Jan 29, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β15Oct 29, 2019Updated 6 years ago
- β10Mar 14, 2021Updated 5 years ago
- Code for the blog postβ12Jan 15, 2021Updated 5 years ago
- Speech in Flax/JAXβ15Jul 11, 2022Updated 3 years ago
- β46Oct 5, 2024Updated last year
- An implementation of the transformer quantum state, a multi-purpose model for quantum many-body problemsβ38Sep 8, 2023Updated 2 years ago
- π Pytorch code for the Nero optimiser.β22Oct 12, 2022Updated 3 years ago