Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)
β43Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for decentralized-rl
Users that are interested in decentralized-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Receding Horizon Curiosity Algrithmβ13Mar 24, 2023Updated 3 years ago
- π Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)β18Jul 6, 2023Updated 2 years ago
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).β14Apr 4, 2025Updated 11 months ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learningβ20Jan 11, 2023Updated 3 years ago
- MuJoCo models for Unitree Robotsβ12Nov 24, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β18Feb 7, 2021Updated 5 years ago
- Repo for the multi-agent PressurePlate environmentβ18Feb 4, 2022Updated 4 years ago
- A small library for creating and manipulating custom JAX Pytree classesβ56Feb 26, 2023Updated 3 years ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding loβ¦β16Nov 27, 2024Updated last year
- A set of environments utilizing pybullet for simulation of robotic manipulation tasks.β29Mar 8, 2021Updated 5 years ago
- β16Updated this week
- POMDP wrappers for OpenAI Gymβ15Nov 4, 2019Updated 6 years ago
- A brief JAX tutorial with examples from control theoryβ12Nov 17, 2022Updated 3 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actionsβ30Jun 30, 2020Updated 5 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Neural Fixed-Point Acceleration for Convex Optimizationβ29Oct 6, 2022Updated 3 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020β16Jun 22, 2022Updated 3 years ago
- β14Jun 8, 2023Updated 2 years ago
- Extending rllab to event-driven multiagent environmentsβ13Oct 1, 2018Updated 7 years ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programsβ20Oct 19, 2025Updated 5 months ago
- Code for magnetic mirror descent.β18Oct 5, 2023Updated 2 years ago
- Scaling scaling laws with board games.β53Jul 17, 2023Updated 2 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Modelsβ30Apr 30, 2021Updated 4 years ago
- Quasi-Newton Algorithm for Stochastic Optimizationβ11May 20, 2022Updated 3 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- AGAC: Adversarially Guided Actor-Criticβ47Sep 16, 2021Updated 4 years ago
- Clockwork VAEs in JAX/Flaxβ32Jul 16, 2021Updated 4 years ago
- Model-Free-Episodic-Control implementation.β17Jun 3, 2019Updated 6 years ago
- β25Jan 2, 2019Updated 7 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.β15Jan 3, 2023Updated 3 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICLβ¦β55Dec 27, 2020Updated 5 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).β21Jan 15, 2020Updated 6 years ago
- Differentiable Gaussian Process Motion Planningβ51Sep 1, 2021Updated 4 years ago
- Experiments in protein folding through language modelingβ10Dec 10, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)β22Aug 4, 2022Updated 3 years ago
- β17Mar 21, 2021Updated 5 years ago
- β22Nov 8, 2021Updated 4 years ago
- Code for the paper Continual Learning from Demonstration of Robotic Skillsβ34May 3, 2023Updated 2 years ago
- Public Release of Plan2vec Implementation in pyTorchβ57Oct 28, 2022Updated 3 years ago
- Code for "Learning Control-Oriented Dynamical Structure from Data" by Spencer M. Richards, Jean-Jacques Slotine, Navid Azizan, and Marco β¦β16Oct 23, 2023Updated 2 years ago
- β17Sep 12, 2025Updated 6 months ago