Pendu / ContainerGymLinks
A RL benchmark framework based on real world problem
☆11Updated 2 years ago
Alternatives and similar repositories for ContainerGym
Users that are interested in ContainerGym are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆80Updated last year
- Lecture slides for the MARL book (www.marl-book.com)☆104Updated last month
- Test LLMs automatically with Giskard and CI/CD☆30Updated 10 months ago
- Minimal code for A Generalist Agent☆42Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- Additional code for Stable-baselines3 to load and upload models from the Hub.☆87Updated 11 months ago
- Pretrain Vision and Large Language Models in Python, Published by Packt☆88Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆21Updated 6 months ago
- ☆12Updated 4 years ago
- Very little code to make PyTorch Lightning models☆16Updated last year
- Unity Machine Learning Agents Toolkit☆48Updated 2 years ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆33Updated last year
- ☆48Updated 11 months ago
- RLlib tutorials☆66Updated 3 years ago
- Explainable Reinforcement Learning (XRL) Resources☆41Updated 9 months ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆57Updated 2 months ago
- Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)☆82Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- In this repository, we try to solve musculoskeletal tasks with `Double DQN reinforcement learning` by using a `transformer` model has bee…☆16Updated last year
- A Residual Network Design with less than 5 million trainable parameters achieving an accuracy of 96.04% on CIFAR-10.☆28Updated 11 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆28Updated 3 weeks ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- ☆18Updated last year
- Demo for Using GitHub Actions in MLOps☆40Updated 2 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆92Updated last week
- ☆20Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆49Updated 2 years ago