Reinforcement learning in pure JAX.
☆13Dec 24, 2025Updated 3 months ago
Alternatives and similar repositories for dopamax
Users that are interested in dopamax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆37Mar 11, 2026Updated last month
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 10 months ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- ☆10Jul 7, 2025Updated 9 months ago
- ☆31Mar 11, 2026Updated last month
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Feb 12, 2026Updated 2 months ago
- Slides and demo code for my presentation entitled Beyond the Basics with Azure ML.☆12Aug 3, 2023Updated 2 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- LOANΞR is a Loan DApp built with smart contracts that runs on the Ethereum blockchain.☆11Jan 24, 2023Updated 3 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- official implementation of GenPO☆31Jan 7, 2026Updated 3 months ago
- Estimators to perform off-policy evaluation☆13Sep 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RL Environments in JAX 🌍☆880Apr 2, 2026Updated 2 weeks ago
- ☆23Aug 19, 2022Updated 3 years ago
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- Machine learning models implemented from the ground up.☆15Jul 18, 2020Updated 5 years ago
- book for Halide language programming☆13Sep 8, 2021Updated 4 years ago
- Code for "Optimizing Quantum Variational Circuits with Deep Reinforcement Learning"☆20May 10, 2024Updated last year
- Machine Learning from Human Preferences☆31Mar 23, 2026Updated 3 weeks ago
- reinforcement learning for bridge☆18Jul 25, 2024Updated last year
- I moved this folder. Keeping this repo up for archival purposes only.☆17Jun 5, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Personal machine learning surrvey in Japanese☆23Apr 24, 2017Updated 8 years ago
- An efficient implementation of learned optimizers in PyTorch☆46Apr 5, 2026Updated last week
- ☆13Dec 31, 2023Updated 2 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆15May 28, 2025Updated 10 months ago
- Code from PLDI '21 paper "Provable Repair of Deep Neural Networks."☆10Nov 26, 2022Updated 3 years ago
- Zephyr is a declarative neural network library on top of JAX allowing for easy and fast neural network designing, creation, and manipulat…☆39Mar 21, 2026Updated 3 weeks ago
- ☆21Oct 6, 2021Updated 4 years ago
- Scala staging framework☆18Jul 13, 2018Updated 7 years ago
- Different approaches for finetuning, evaluating, optimizations for code generation model - codestral☆11Jun 18, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆17Dec 19, 2019Updated 6 years ago
- It's your data, look at it anywhere☆66May 25, 2013Updated 12 years ago
- Exploring learned cooperation, coevolution and free-riding. Learning is achieved through Multi-Agent Deep Reinforcement Learning (MADRL) …☆25Mar 19, 2026Updated 3 weeks ago
- Two agents shooting at each other, controlled by a neural network optimized with a genetic algorithm.☆24Dec 17, 2023Updated 2 years ago
- Efficiently send large arrays across machines☆17Jul 24, 2024Updated last year
- ☆16Jul 26, 2017Updated 8 years ago
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Nov 3, 2019Updated 6 years ago