A collection of optimizers, some arcane others well known, for Flax.
☆29Aug 6, 2021Updated 4 years ago
Alternatives and similar repositories for flaxOptimizers
Users that are interested in flaxOptimizers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- ☆12Mar 16, 2022Updated 4 years ago
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- A toolkit for interpreting and analyzing neural networks (vision)☆31Jul 28, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆19May 17, 2021Updated 4 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Apr 4, 2021Updated 5 years ago
- ☆25Jan 29, 2026Updated 2 months ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- PyTorch Code for the Paper: "Exploiting Uncertainty of Loss Landscape for Stochastic Optimization [Bhaskara et al. (2019)]☆16Dec 8, 2025Updated 4 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 10 months ago
- ☆19Nov 27, 2020Updated 5 years ago
- Code for the paper: "Invertible CNN-Based Super Resolution with Downsampling Awareness" by Andrew Geiss and Joseph C. Hardin, Nov 2020☆12Nov 11, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆18Dec 2, 2024Updated last year
- ☆14Aug 28, 2019Updated 6 years ago
- A custom transfer agent for git-lfs that uses rclone to transfer files.☆12Jan 14, 2025Updated last year
- Image scraper for DuckDuckGo and Google for creating DL datasets☆22Sep 18, 2020Updated 5 years ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆86Feb 10, 2026Updated 2 months ago
- Mobile Viewer for W&B, built on top of Flutter.☆41Mar 2, 2024Updated 2 years ago
- Ranger deep learning optimizer rewrite to use newest components☆341Mar 17, 2026Updated last month
- Estimate derivatives with finite differences☆17Nov 18, 2024Updated last year
- Practical Deep Learning for Time Series / Sequential Data using fastai2/ Pytorch☆12Nov 12, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 5, 2024Updated 2 years ago
- An recognition oriented deep learning framework for biometric sample quality assessment☆12Aug 24, 2023Updated 2 years ago
- TPU support for the fastai library☆13Apr 15, 2021Updated 5 years ago
- A react frontend app template/boilerplate, using Material-UI for UI.☆10May 13, 2024Updated last year
- Perceiver (transformer variant) implemented in JAX and Flax☆13Mar 29, 2021Updated 5 years ago
- Deep learning lectures I am holding for the MSc on Data Science and Scientific Computing☆15Jul 2, 2022Updated 3 years ago
- ☆62Mar 4, 2022Updated 4 years ago
- Code you can use jointly with fastai☆93Nov 30, 2020Updated 5 years ago
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆15Oct 18, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for OGB competition☆12Feb 24, 2024Updated 2 years ago
- "A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)☆11Apr 26, 2021Updated 4 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 10 months ago
- A collection of inference modules for fastai2☆91Oct 6, 2022Updated 3 years ago
- ☆15Apr 26, 2022Updated 3 years ago
- A Citation Manager and Zotero Integration for RemNote! Cite research all within your knowledge base!☆30Jan 22, 2026Updated 2 months ago
- Starlight: A Kernel Optimizer for GPU Processing☆16Jan 10, 2024Updated 2 years ago