Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for Actor-Critic-Alignment
Users that are interested in Actor-Critic-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆60Feb 3, 2023Updated 3 years ago
- This is the unofficial implementation of LEMON (ICLR'2024).☆13Apr 14, 2024Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆64Apr 4, 2023Updated 3 years ago
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Aug 21, 2023Updated 2 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆58Feb 8, 2025Updated last year
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆12Aug 20, 2024Updated last year
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆11Jan 3, 2023Updated 3 years ago
- ☆14Nov 16, 2024Updated last year
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- RL for Energy Management of Microgrids☆11Mar 28, 2020Updated 6 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 6 years ago
- Hierarchical Framework for Interpretable Deep Reinforcement Learning Based- Predictive Maintenance (Applied to NASA Turbofan engine datas…☆14Feb 9, 2024Updated 2 years ago
- Exploiting Transformer in Reinforcement Learning for Interpretable Temporal Logic Motion Planning (RAL 2023)☆13Jul 17, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ORIentate tutorial: SLAM and factor graphs☆16Nov 2, 2024Updated last year
- Tools for GPS navigation in ROS☆13Feb 1, 2023Updated 3 years ago
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- Accompanying code for the paper "Conditional Unscented Autoencoders for Trajectory Prediction"☆16Sep 6, 2024Updated last year
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)☆10Dec 3, 2025Updated 5 months ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An accurate and reliable wind power forecasting model that can handle the variability and uncertainty of the wind resource. An ensemble …☆13Jul 6, 2023Updated 2 years ago
- Code repository for article: "URA*: Uncertainty-aware Path Planning using Image-based Aerial-to-Ground Traversability Estimation for Off-…☆11Sep 21, 2023Updated 2 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated last year
- ☆18Oct 28, 2023Updated 2 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- Decoupled Q-Chunking☆67May 3, 2026Updated 2 weeks ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- ☆14May 30, 2019Updated 6 years ago
- DeepVO - An RCNN approach to visual odometry☆14Dec 19, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 5 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- ☆11Nov 18, 2023Updated 2 years ago
- A web page to collect reproduced papers in one place with their codes☆14Mar 8, 2023Updated 3 years ago
- ☆18Apr 20, 2025Updated last year
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 6 years ago
- SemiDefinite Programming Algorithm (SDPA) for Python☆12Jan 27, 2025Updated last year