JAX/Flax implimentation of 'Attention Is All You Need' by Vaswani et al. (https://arxiv.org/abs/1706.03762)
β15Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for vanilla-transformer-jax
Users that are interested in vanilla-transformer-jax are comparing it to the libraries listed below
Sorting:
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! π₯ππ»β14Jun 15, 2024Updated last year
- Simple, extensible implementations of some meta-learning algorithms in Jaxβ11Oct 6, 2020Updated 5 years ago
- [ICML 2024] Official PyTorch implementation of the Vectorized Conditional Neural Field.β17Aug 1, 2024Updated last year
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.β21Sep 18, 2020Updated 5 years ago
- Benchmarking of diffusion models for global field reconstruction from sparse observationsβ31Dec 4, 2024Updated last year
- An RPG Maker MZ pluginβ12Nov 2, 2023Updated 2 years ago
- Practicum on Supervised Learning in Function Spacesβ35Feb 17, 2022Updated 4 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and haβ¦β11Nov 26, 2024Updated last year
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"β15Jul 4, 2025Updated 8 months ago
- β14Mar 3, 2022Updated 4 years ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various pβ¦β14Jan 3, 2025Updated last year
- Water wave models in one dimensionβ10Feb 24, 2026Updated last week
- Source code and datasets for Circuit Design Completion using GNNs paperβ10Jan 26, 2023Updated 3 years ago
- Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulationβ109Dec 13, 2024Updated last year
- Layered distributions using FLAX/JAXβ10Dec 13, 2020Updated 5 years ago
- Code for the blog postβ12Jan 15, 2021Updated 5 years ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".β12Jan 9, 2025Updated last year
- Repo for the Neurips Learning to Predict Structural Vibrations paper, provides a dataset and method for vibration prediction in plate strβ¦β11Oct 7, 2025Updated 5 months ago
- β10Mar 14, 2021Updated 4 years ago
- β11Nov 4, 2012Updated 13 years ago
- β15Dec 31, 2023Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)β10Oct 16, 2024Updated last year
- β16Mar 14, 2025Updated 11 months ago
- β10Aug 22, 2023Updated 2 years ago
- Modified Beam Search with periodical restartβ12Sep 12, 2024Updated last year
- [Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environmentβ10Dec 22, 2018Updated 7 years ago
- β12Nov 29, 2021Updated 4 years ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiongβ¦β11Jun 18, 2018Updated 7 years ago
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATIONβ12May 4, 2021Updated 4 years ago
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detectionβ10Oct 22, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- Learning Generalized Physical Representation from a Few Examplesβ18Feb 9, 2026Updated last month
- minimalistic AI library that resembles HF's transformersβ13Dec 31, 2024Updated last year
- β16Jun 25, 2025Updated 8 months ago
- Attend - to what matters.β17Feb 22, 2025Updated last year
- Published in IEEE Trans. Comput. Imag.β10Oct 22, 2024Updated last year
- β11Mar 5, 2024Updated 2 years ago
- An introductory course on computational physics taught at the University of Vermontβ11Dec 3, 2021Updated 4 years ago
- Generate Parametric Ship Hull with a Guided Tabular Diffusion Modelβ15Dec 5, 2024Updated last year