[Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers
☆16Apr 15, 2026Updated last month
Alternatives and similar repositories for mu_learned_optimization
Users that are interested in mu_learned_optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient implementation of learned optimizers in PyTorch☆45Apr 21, 2026Updated last month
- ☆11Oct 11, 2023Updated 2 years ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 11 months ago
- ☆50Jan 18, 2024Updated 2 years ago
- A deep learning-powered visual navigation engine to enables autonomous navigation of pocket-size quadrotor - running on PULP☆13Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 11 months ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 4 months ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- ☆24Sep 25, 2024Updated last year
- ISMIR 2021: Curriculum Learning for Imbalanced Classification in Large Vocabulary Automatic Chord Recognition☆10Nov 8, 2021Updated 4 years ago
- A port of muP to JAX/Haiku☆25Oct 23, 2022Updated 3 years ago
- ACCO: An optimization algorithm for sharded distributed LLM training.☆13May 22, 2025Updated last year
- ☆13Apr 7, 2022Updated 4 years ago
- ☆30Feb 27, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DeMo: Decoupled Momentum Optimization☆201Dec 2, 2024Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆57Mar 10, 2025Updated last year
- Unofficial JAX implementation of the SOAP optimizer (https://arxiv.org/abs/2409.11321)☆25Jan 9, 2026Updated 4 months ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and produc…☆10Dec 25, 2024Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆57Feb 24, 2026Updated 2 months ago
- Memory Replay with Data Compression (ICLR 2022)☆16Sep 26, 2023Updated 2 years ago
- Generative model for 3D objects.☆18Aug 12, 2023Updated 2 years ago
- ☆10Aug 18, 2016Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Stick-breaking attention☆63Jul 1, 2025Updated 10 months ago
- ☆13Oct 8, 2021Updated 4 years ago
- Code for "Optimizing Quantum Variational Circuits with Deep Reinforcement Learning"☆20May 10, 2024Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆57Aug 20, 2024Updated last year
- Implementation for robust ViT and scaled attention☆21Apr 4, 2025Updated last year
- Temporal WaSR-T model for maritime obstacle detection via semantic segmentation☆27Nov 29, 2023Updated 2 years ago
- ☆20Oct 21, 2022Updated 3 years ago
- Implementation for Object Permanence Emerges in a Random Walk along Memory☆23Dec 11, 2022Updated 3 years ago
- [WACV'24] Object Re-Identification from Point Clouds☆19Jan 16, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generic build server☆65May 25, 2014Updated 11 years ago
- For optimization algorithm research and development.☆568May 6, 2026Updated 2 weeks ago
- 🧱 Modula software package☆329Aug 18, 2025Updated 9 months ago
- recipe for training fully-featured self supervised image jepa models☆14Jun 4, 2025Updated 11 months ago
- Pipeline parallelism for the minimalist☆39Aug 6, 2025Updated 9 months ago
- simple dmabuf eglimage example☆10Sep 18, 2014Updated 11 years ago
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆42Sep 30, 2025Updated 7 months ago