huggingface / gym-hil
Human in the loop Reinforcement Learning suite
β13Updated this week
Alternatives and similar repositories for gym-hil:
Users that are interested in gym-hil are comparing it to the libraries listed below
- π§© Create your own puzzle, use my agents to solve it π€ try them out! π§©β9Updated 3 years ago
- β¨π² Hierarchical extreme multiclass and multi-label classification.β17Updated 2 years ago
- Robust Reinforcement Learning Suiteβ29Updated 4 months ago
- DiffuLab is designed to provide a simple and flexible way to train diffusion models while allowing full customization of its core componeβ¦β26Updated last week
- Python package for emotion analysis in Frenchβ14Updated 3 years ago
- Tidy up your machine learning experimentsβ17Updated 5 years ago
- Toy environment set for multi-agent reinforcement learning and moreβ38Updated 5 months ago
- WIPβ34Updated 9 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)β40Updated last year
- Reinforcement learning training framework for entity-gym environments.β17Updated last year
- Build and train Lipschitz-constrained networks: PyTorch implementation of 1-Lipschitz layers. For TensorFlow/Keras implementation, see htβ¦β29Updated 2 months ago
- β35Updated last month
- Efficiently Composable Data Augmentation on the GPU with Jaxβ33Updated 9 months ago
- π Code for the paper: "Look at the Variance! Efficient Black-box Explanations with Sobol-based Sensitivity Analysis" (NeurIPS 2021)β30Updated 2 years ago
- π Explain why metrics change by unpacking themβ38Updated last week
- Baselines for gymnax π€β66Updated 2 years ago
- WandB sweeps integration with Hydra sweeperβ48Updated last year
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BCβ55Updated last year
- Autoregressive Bayesian linear modelβ21Updated 4 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β100Updated last year
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022β28Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.β20Updated 10 months ago
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ57Updated last year
- Reduce multiple PyTorch TensorBoard runs to new event (or CSV) files.β72Updated 3 weeks ago
- An implementation of AlphaZero and MCTS with neural networks for Tetrisβ19Updated last month
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - ββ68Updated 2 months ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!β28Updated this week
- Simple single-file baselines for Q-Learning in pure-GPU settingβ159Updated last month
- Comparison between GFlowNets & Maximum Entropy RLβ16Updated last year
- An interactive framework to visualize and analyze your AutoML process in real-time.β87Updated last week