JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al. (https://arxiv.org/abs/1706.03762)
☆15Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for vanilla-transformer-jax
Users that are interested in vanilla-transformer-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- TD-DMRG and VHCI package☆11Jul 24, 2025Updated 8 months ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Keras-like APIs for JAX framework☆50Mar 25, 2023Updated 3 years ago
- Benchmarking of diffusion models for global field reconstruction from sparse observations☆34Mar 22, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2024] Official PyTorch implementation of the Vectorized Conditional Neural Field.☆17Aug 1, 2024Updated last year
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆15Jul 4, 2025Updated 8 months ago
- ☆11Nov 4, 2012Updated 13 years ago
- Practicum on Supervised Learning in Function Spaces☆35Feb 17, 2022Updated 4 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Unofficial JAX implementation of the SOAP optimizer (https://arxiv.org/abs/2409.11321)☆25Jan 9, 2026Updated 2 months ago
- Flexible, general-purpose VMC framework, built on JAX.☆31Nov 25, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Impementation of Variational Monte Carlo (VMC) for quantum many-body dynamics using JAX.☆71Sep 26, 2025Updated 6 months ago
- 한국어 문장 분석 시스템 BCD-KL-Parser☆10Jun 23, 2020Updated 5 years ago
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Deep NLP 2 (2019.3-5)☆10Feb 19, 2019Updated 7 years ago
- Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulation☆111Dec 13, 2024Updated last year
- ☆14Mar 13, 2024Updated 2 years ago
- Layered distributions using FLAX/JAX☆10Dec 13, 2020Updated 5 years ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆13Jul 31, 2023Updated 2 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- ☆23Jul 17, 2025Updated 8 months ago
- Code for "DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training"☆54Jun 10, 2024Updated last year
- Paper introducing jax-cosmo☆13Apr 27, 2023Updated 2 years ago
- An RPG Maker MZ plugin☆12Nov 2, 2023Updated 2 years ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- ☆14Sep 30, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Mar 14, 2021Updated 5 years ago
- Code for the blog post☆12Jan 15, 2021Updated 5 years ago
- Short-time Fourier transform (STFT) for JAX☆15Dec 20, 2021Updated 4 years ago
- An implementation of the transformer quantum state, a multi-purpose model for quantum many-body problems☆37Sep 8, 2023Updated 2 years ago
- ☆15Mar 3, 2022Updated 4 years ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Frame-independent vector-cloud neural network for nonlocal constitutive modelling on arbitrary grids.☆11Oct 31, 2021Updated 4 years ago