Farama-Foundation / CrowdPlay
A web based platform for collecting human actions in reinforcement learning environments
☆26Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CrowdPlay
- ☆42Updated 2 years ago
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆91Updated last year
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆12Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated last week
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- ☆28Updated 2 years ago
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957☆63Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 5 months ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 6 months ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- ☆30Updated 3 months ago
- Baselines for gymnax 🤖☆58Updated last year
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆23Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 7 months ago
- The official implementation of MeDQN algorithm.☆10Updated 9 months ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Generalised UDRL☆37Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆14Updated 3 years ago
- a modular reinforcement learning library with JAX agents☆22Updated 11 months ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- A PyTorch Implementation of Skipper☆20Updated last month