RLA is a tool for managing your RL experiments automatically
☆31Jan 11, 2025Updated last year
Alternatives and similar repositories for RLAssistant
Users that are interested in RLAssistant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆71Feb 7, 2023Updated 3 years ago
- ☆12May 14, 2024Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆31Sep 28, 2024Updated last year
- LAMPOS, a strategy-based solution approach for mp-MILPs for real-time mixed-integer MPC with sub-optimality quantification☆11Jun 25, 2023Updated 2 years ago
- ☆30Mar 1, 2022Updated 4 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- ☆35Oct 23, 2022Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Dec 30, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆135Nov 21, 2024Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- A beamer template for LAMDA lab at NJU☆16Oct 17, 2020Updated 5 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 5 months ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- ☆19Oct 27, 2025Updated 6 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.or…☆41Oct 11, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- Algorithm for multiple-shooting differential dynamic programming (MS-DDP) implemented in MATLAB, with a few robotics examples.☆25Apr 10, 2024Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 5 months ago
- Neural Fixed-Point Acceleration for Convex Optimization☆30Oct 6, 2022Updated 3 years ago
- 关于Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee这篇论文的详细代码解读☆11Dec 27, 2023Updated 2 years ago
- ChatGPT技术介绍☆21May 9, 2023Updated 2 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- Official implementation of NeurIPS'22 paper "Monte Carlo Tree Search based Variable Selection for High-Dimensional Bayesian Optimization"☆42Mar 6, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- SeqGAN but with more bells and whistles☆24Feb 15, 2018Updated 8 years ago
- Tutorial on Multi-Objective Recommender Systems @ KDD 2021☆19Dec 4, 2022Updated 3 years ago
- ☆24Feb 16, 2022Updated 4 years ago
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year