facebookresearch / xbanditsrl
Contextual Bandit Spectral Representation Learner
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for xbanditsrl
- TaskMet Task-driven Metric Learning for Model Learning☆18Updated 9 months ago
- ☆29Updated 2 years ago
- ☆16Updated 3 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- ☆14Updated last month
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated last year
- Generalised UDRL☆37Updated 2 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 2 years ago
- ☆11Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Toy environment set for multi-agent reinforcement learning and more☆38Updated 2 years ago
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- Neuroevolution Benchmark in JAX 🦕☆36Updated last year
- ☆20Updated 5 years ago
- ☆29Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Understanding RL vision Distill article☆23Updated last year
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 3 years ago
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆15Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Gym environment for playing Wordle with RL agents☆38Updated 2 years ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆25Updated this week
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated last year
- Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.☆11Updated 4 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆18Updated 2 years ago