Farama-Foundation / Procgen-Staging
Procgen2: A community maintained fork of procgen
☆11Updated 2 years ago
Related projects: ⓘ
- Standard interface for entity based reinforcement learning environments.☆35Updated 6 months ago
- Baselines for gymnax 🤖☆57Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Fast and procedurally generated side-scroller-game-like graphical environments (formerly Procgen)☆28Updated last year
- General Modules for JAX☆57Updated last month
- An implementation of MuZero in JAX.☆52Updated last year
- Collection of in-progress libraries for entity neural networks.☆29Updated 2 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated last year
- ☆56Updated 3 weeks ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆31Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆54Updated last year
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 4 years ago
- Reinforcement Learning inside a 3D soccer simulation☆19Updated this week
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 3 months ago
- Levin tree search guided by both a policy and a heuristic function☆14Updated last year
- Collection of reinforcement learning algorithms☆15Updated 2 years ago
- Accelerated replay buffers in JAX☆39Updated 2 years ago
- An Open-Ended Agentic Simulator☆17Updated last month
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated last month
- Episodic Control☆19Updated 2 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆77Updated 2 years ago
- ☆33Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- GPT implementation in Flax☆18Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆27Updated 2 years ago
- a modular reinforcement learning library with JAX agents☆20Updated 10 months ago