PyTorch Implementation of the Sequential Multiagent Rollout algorithm
☆11Jun 28, 2024Updated last year
Alternatives and similar repositories for bertsekas-marl
Users that are interested in bertsekas-marl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A reinforcement leaning environment for discrete MDPs.☆25Nov 10, 2024Updated last year
- Creates radiative transfer operator LUTs for the Optimal Retrieval of Aerosol and Cloud (ORAC) code and various other retrieval systems.☆10May 19, 2024Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆22Nov 18, 2022Updated 3 years ago
- Cost-aware Bayesian optimization via the Pandora's box Gittins index☆14Aug 8, 2025Updated 8 months ago
- Generative models and other stuff too, maybe, perhaps even probably☆16Dec 12, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repo for Stochastic Processes & Optimization Lab (Public Repo)☆11May 18, 2025Updated 11 months ago
- Code for the paper "Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning…☆13Nov 15, 2023Updated 2 years ago
- ☆18Apr 7, 2025Updated last year
- A Unified and General Framework for Continual Learning, ICLR 2024☆15Mar 22, 2024Updated 2 years ago
- LaTex Poster for S3-NeRF (NeurIPS 2022)☆19Feb 14, 2023Updated 3 years ago
- Improved Hypergradient optimizers for ML, providing better generalization and faster convergence.☆16Apr 3, 2024Updated 2 years ago
- 论文Reinforcement Learning of Sequential Price Mechanisms的复现☆12Nov 3, 2022Updated 3 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Feb 24, 2023Updated 3 years ago
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆17Jan 4, 2023Updated 3 years ago
- Capstone Project for Fall 2017. Name will be changed after project is solidified☆15Dec 7, 2022Updated 3 years ago
- ☆13Oct 23, 2025Updated 6 months ago
- Selected list of papers on World Models that I found interesting and/or useful.☆37Feb 8, 2025Updated last year
- ☆12Apr 12, 2026Updated 3 weeks ago
- Julia library for quantum circuit simulation using tensor networks☆16Jan 7, 2020Updated 6 years ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year
- ☆13Mar 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SAP Security research sample code and tutorials for generating differentially private synthetic datasets using generative deep learning m…☆22Mar 7, 2024Updated 2 years ago
- AlexNet , VGG Blocks , Network In Network (NIN),GoogleNet,ResNet,DenseNet Using Pytorch☆12Dec 2, 2019Updated 6 years ago
- Implementation of Deep Q-network to play the game 2048 using Keras.☆13Oct 3, 2021Updated 4 years ago
- PyPARC - A Python Package for Piecewise Affine Regression and Classification☆12Jul 4, 2023Updated 2 years ago
- Deep universal probabilistic programming with Python and PyTorch☆12Apr 1, 2020Updated 6 years ago
- PyTorch implementation of Expectation over Transformation☆13Jul 18, 2025Updated 9 months ago
- 适用于真寻Bot的防撤回和解析闪照插件☆10Mar 29, 2023Updated 3 years ago
- Code for replicating experiments from the paper, Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes, publi…☆13Jun 22, 2023Updated 2 years ago
- code for BINOCULARS and Multi-Step BO☆12Dec 7, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Mar 17, 2024Updated 2 years ago
- ☆14Jun 10, 2022Updated 3 years ago
- NeurIPS 2019 Paper☆12Dec 9, 2019Updated 6 years ago
- The Python implementation of the proposed framework in the paper Evolutionary Multi-Objective Deep Reinforcement Learning for Autonomous …☆32Aug 31, 2023Updated 2 years ago
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆20Jun 6, 2024Updated last year
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- The official code release for Unsupervised Out-of-distribution Detection with Diffusion Inpainting (ICML 2023)☆28Aug 16, 2023Updated 2 years ago