hbonnavaud / sciborg
Reinforcement learning framework.
☆12Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for sciborg
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last month
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆33Updated 8 months ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆89Updated last year
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆63Updated 2 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆46Updated this week
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Corax: Core RL in JAX☆35Updated 9 months ago
- Reinforcement Learning inside a 3D soccer simulation☆24Updated 2 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆17Updated 5 months ago
- Robust Reinforcement Learning Suite☆19Updated 5 months ago
- a modular reinforcement learning library with JAX agents☆22Updated last year
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆26Updated 3 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆72Updated 3 weeks ago
- Baselines for gymnax 🤖☆60Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆39Updated 2 years ago
- ☆28Updated 2 years ago
- ☆63Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆36Updated 2 weeks ago
- ☆55Updated last month
- A2C is a special case of PPO!☆19Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆91Updated last year
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆51Updated 2 weeks ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆23Updated 3 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆46Updated last year
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- ☆20Updated 6 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago