Pendu / ContainerGym
A RL benchmark framework based on real world problem
β10Updated last year
Related projects β
Alternatives and complementary repositories for ContainerGym
- Additional code for Stable-baselines3 to load and upload models from the Hub.β77Updated 4 months ago
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 5 months ago
- Gradient Boosting Reinforcement Learning (GBRL)β88Updated this week
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGOβ47Updated last month
- Minimal code for A Generalist Agentβ36Updated 2 years ago
- β21Updated last year
- A paper list of sample-efficient reinforcement learningβ13Updated 2 years ago
- This FastAPI-based RAG service processes OCR data, generates embeddings using OpenAI, and utilizes Pinecone as a vector database for searβ¦β11Updated 4 months ago
- Repo to reproduce the First-Explore paper resultsβ36Updated 3 weeks ago
- Test LLMs automatically with Giskard and CI/CDβ28Updated 3 months ago
- In this repository, we try to solve musculoskeletal tasks with `Double DQN reinforcement learning` by using a `transformer` model has beeβ¦β14Updated last year
- β25Updated last year
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β76Updated last month
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effeβ¦β19Updated 9 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."β23Updated 3 weeks ago
- Unity Machine Learning Agents Toolkitβ45Updated last year
- Collection of python scripts to demonstrate asynchronous programming in pythonβ11Updated 2 years ago
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.β32Updated 10 months ago
- β20Updated 8 months ago
- β12Updated 3 years ago
- Clean RL implementation using MLXβ27Updated 8 months ago
- Coherent Soft Imitation Learningβ17Updated 3 months ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.β26Updated 9 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"β31Updated last year
- Building GPT ...β17Updated 3 months ago
- A collection of hand on notebook for LLMs practitionerβ38Updated 2 months ago
- β44Updated 4 months ago
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]β11Updated last year