Reinforcement learning in pure JAX.
☆13Dec 24, 2025Updated 3 months ago
Alternatives and similar repositories for dopamax
Users that are interested in dopamax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆37Mar 11, 2026Updated 2 weeks ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 9 months ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- ☆10Jul 7, 2025Updated 8 months ago
- ☆31Mar 11, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Feb 12, 2026Updated last month
- Slides and demo code for my presentation entitled Beyond the Basics with Azure ML.☆12Aug 3, 2023Updated 2 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- LOANΞR is a Loan DApp built with smart contracts that runs on the Ethereum blockchain.☆11Jan 24, 2023Updated 3 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- official implementation of GenPO☆30Jan 7, 2026Updated 2 months ago
- Estimators to perform off-policy evaluation☆13Sep 3, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- RL Environments in JAX 🌍☆873May 30, 2025Updated 9 months ago
- ☆23Aug 19, 2022Updated 3 years ago
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- Machine learning models implemented from the ground up.☆15Jul 18, 2020Updated 5 years ago
- book for Halide language programming☆13Sep 8, 2021Updated 4 years ago
- Code for "Optimizing Quantum Variational Circuits with Deep Reinforcement Learning"☆20May 10, 2024Updated last year
- Machine Learning from Human Preferences☆30Feb 13, 2026Updated last month
- reinforcement learning for bridge☆18Jul 25, 2024Updated last year
- I moved this folder. Keeping this repo up for archival purposes only.☆17Jun 5, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Personal machine learning surrvey in Japanese☆23Apr 24, 2017Updated 8 years ago
- Zephyr is a declarative neural network library on top of JAX allowing for easy and fast neural network designing, creation, and manipulat…☆39Updated this week
- An efficient implementation of learned optimizers in PyTorch☆45Dec 2, 2025Updated 3 months ago
- ☆13Dec 31, 2023Updated 2 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- Code from PLDI '21 paper "Provable Repair of Deep Neural Networks."☆10Nov 26, 2022Updated 3 years ago
- ☆21Oct 6, 2021Updated 4 years ago
- Scala staging framework☆18Jul 13, 2018Updated 7 years ago
- Different approaches for finetuning, evaluating, optimizations for code generation model - codestral☆11Jun 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Dec 19, 2019Updated 6 years ago
- It's your data, look at it anywhere☆66May 25, 2013Updated 12 years ago
- Exploring learned cooperation, coevolution and free-riding. Learning is achieved through Multi-Agent Deep Reinforcement Learning (MADRL) …☆25Mar 19, 2026Updated last week
- Two agents shooting at each other, controlled by a neural network optimized with a genetic algorithm.☆24Dec 17, 2023Updated 2 years ago
- Efficiently send large arrays across machines☆17Jul 24, 2024Updated last year
- ☆16Jul 26, 2017Updated 8 years ago
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Nov 3, 2019Updated 6 years ago