zhoubin-me / agent0
Agent Zero RL Framework
☆15Updated 3 months ago
Alternatives and similar repositories for agent0:
Users that are interested in agent0 are comparing it to the libraries listed below
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆32Updated last week
- ☆16Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 3 months ago
- Baselines for gymnax 🤖☆63Updated last year
- Behavioural cloning solution to MineRL2020 competition☆16Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆37Updated last month
- ☆41Updated 7 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated last year
- Episodic Control☆19Updated 2 years ago
- On-the-fly conversions between Jax and NumPy tensors☆49Updated last year
- Reinforcement learning training framework for entity-gym environments.☆17Updated 11 months ago
- A tool for recording RL trajectories.☆101Updated 3 months ago
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆12Updated 8 months ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆36Updated 3 years ago
- A toolkit for practical Human-AI cooperation research☆13Updated 10 months ago
- Efficient baselines for autocurricula in JAX.☆179Updated 5 months ago
- Modular framework for Reinforcement Learning in python☆171Updated 2 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆80Updated last week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆137Updated 2 months ago
- ☆13Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆30Updated 6 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆60Updated last year
- ☆20Updated 8 months ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- Adaptable tools to make reinforcement learning and evolutionary computation algorithms.☆56Updated 2 years ago
- ☆71Updated 6 months ago