Official PyTorch implementation of AlberDICE
☆23Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for alberdice
Users that are interested in alberdice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆33Nov 30, 2023Updated 2 years ago
- Repository (preliminary codes) for DSTC10 SIMMC track.☆19Dec 9, 2022Updated 3 years ago
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆27Jun 3, 2024Updated last year
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆36Oct 7, 2024Updated last year
- ☆12Nov 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆73Jun 13, 2024Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆47Sep 11, 2024Updated last year
- Reinforcement learning for batch bioprocess optimization (Computers & Chemical Engineering, 2020)☆16Jun 14, 2022Updated 3 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning☆14Dec 8, 2022Updated 3 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- ☆28Jul 28, 2022Updated 3 years ago
- A collection of matrix games in JAX☆13Apr 13, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆132Nov 12, 2025Updated 5 months ago
- [ICML2025] Official implementation of Efficient Online Reinforcement Learning for Diffusion Policies appearing in ICML 2025.☆59Apr 25, 2026Updated 2 weeks ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- [NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning☆13Oct 19, 2022Updated 3 years ago
- SWATgym: a reinforcement learning environment for crop management.☆14Sep 17, 2025Updated 7 months ago
- PyPSA Model of the South African Energy System☆24Apr 8, 2026Updated last month
- Comparison between GFlowNets & Maximum Entropy RL☆19Feb 19, 2024Updated 2 years ago
- ☆17Dec 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An OpenAI gym environment for crop management☆24Jul 12, 2021Updated 4 years ago
- code for ROMANCE☆14Oct 12, 2024Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆28Nov 23, 2024Updated last year
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆20Sep 12, 2025Updated 7 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆89Oct 15, 2023Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- Emergency Vehicle Smart Grid to provide faster movement to emergency vehicles.☆11Dec 12, 2019Updated 6 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- Lecture slides for the MARL book (www.marl-book.com)☆170May 14, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This is the code implementation of the Neural ordinary differential equations-based Lyapunov-Barrier Actor-Critic (NLBAC)☆16Sep 4, 2024Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Jul 27, 2021Updated 4 years ago
- ☆115Aug 6, 2024Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆34Oct 10, 2020Updated 5 years ago
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆19Mar 17, 2025Updated last year