Pendu / ContainerGym
A RL benchmark framework based on real world problem
☆10Updated last year
Alternatives and similar repositories for ContainerGym:
Users that are interested in ContainerGym are comparing it to the libraries listed below
- Computer Vision Papers of the week☆17Updated 2 years ago
- Lecture slides for the MARL book (www.marl-book.com)☆69Updated last month
- Repo to reproduce the First-Explore paper results☆37Updated 3 weeks ago
- This repository contains a better implementation of Kolmogorov-Arnold networks☆59Updated 8 months ago
- Wave height prediction for the Huntington beach in California, USA.☆51Updated last year
- Minimal code for A Generalist Agent☆38Updated 2 years ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 7 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆95Updated 2 months ago
- A collection of hand on notebook for LLMs practitioner☆41Updated last week
- In this repository, we try to solve musculoskeletal tasks with `Double DQN reinforcement learning` by using a `transformer` model has bee…☆15Updated last year
- A set of jupyter notebooks☆23Updated last month
- ☆25Updated last year
- Test LLMs automatically with Giskard and CI/CD☆29Updated 5 months ago
- General multi-task deep RL Agent☆174Updated 7 months ago
- CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning☆13Updated 6 months ago
- Pretrain Vision and Large Language Models in Python, Published by Packt☆86Updated last year
- ☆13Updated 9 months ago
- Computer Vision Industry Use Cases☆60Updated 2 years ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆26Updated 2 months ago
- ☆44Updated 6 months ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆30Updated this week
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆11Updated 3 months ago
- MLFlow End to End Workshop at Chandigarh University☆11Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆39Updated last year
- ☆24Updated 9 months ago
- ☆12Updated 3 years ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆89Updated 3 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- ☆27Updated last year
- A hands on advanced RAG tutorials☆22Updated 3 months ago