HomebrewNLP in JAX flavour for maintable TPU-Training
☆51Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for Olmax
Users that are interested in Olmax are comparing it to the libraries listed below
Sorting:
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆132Aug 6, 2022Updated 3 years ago
- ☆13Dec 15, 2025Updated 2 months ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- One stop shop for all things carp☆59Sep 9, 2022Updated 3 years ago
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆29Feb 14, 2026Updated 2 weeks ago
- Train vision models using JAX and 🤗 transformers☆100Dec 14, 2025Updated 2 months ago
- Easily turn large sets of audio urls to an audio dataset.☆21Dec 27, 2022Updated 3 years ago
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- Contrastive Language-Image Pretraining☆144Sep 6, 2022Updated 3 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆73Feb 17, 2026Updated last week
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆101Dec 16, 2025Updated 2 months ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆16Apr 7, 2025Updated 10 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Modular optimization library for PyTorch (work-in-progress).☆13Feb 4, 2026Updated 3 weeks ago
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆190Jan 11, 2026Updated last month
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆31Updated this week
- ☆29Jul 9, 2024Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Extracting minimal DFA's from well-trained RNN's☆11Nov 26, 2018Updated 7 years ago
- Gym env for Slay the Spire☆16Dec 31, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- ☆18Apr 3, 2023Updated 2 years ago
- Train a bidirectional or normal LSTM recurrent neural network to generate text on a free GPU using any dataset. Just upload your text fil…☆12Jan 29, 2019Updated 7 years ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 3 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆34May 28, 2023Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- Collect papers about Mamba (a selective state space model).☆14Aug 6, 2024Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago