Creating fixed-length vectors to describe RL/GA policies
☆20Oct 23, 2021Updated 4 years ago
Alternatives and similar repositories for policy-supervectors
Users that are interested in policy-supervectors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- ☆16Aug 7, 2021Updated 4 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30May 26, 2020Updated 6 years ago
- Code for the experiments done in the paper "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters"☆24May 13, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21Jul 14, 2020Updated 5 years ago
- Toribash Learning Environment☆53Aug 31, 2023Updated 2 years ago
- StarCraft 2 Imitation Learning☆29Jul 2, 2021Updated 4 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Jul 9, 2021Updated 4 years ago
- ☆22Mar 28, 2025Updated last year
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- Reinforcement learning training framework for entity-gym environments.☆17Mar 18, 2024Updated 2 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- implementation of our self-guided and self-regularized actor-critic algorithm☆29Jan 1, 2023Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)☆13Nov 13, 2020Updated 5 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- The source code for mastering the game of Chutes and Ladders☆19Apr 2, 2021Updated 5 years ago
- Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.☆19Mar 22, 2026Updated 2 months ago
- ☆13Aug 9, 2022Updated 3 years ago
- Code and exercises for the Computational Neurodynamics course at Imperial College London☆28Nov 22, 2016Updated 9 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Oct 8, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Behavioural cloning experiments with video games☆32Apr 15, 2020Updated 6 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Mar 19, 2021Updated 5 years ago
- Library for controlling and capturing images from video games☆28Jan 7, 2020Updated 6 years ago
- Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience☆13May 7, 2018Updated 8 years ago
- ☆20Sep 8, 2023Updated 2 years ago
- Render a JSON with jq patterns.☆20Aug 20, 2023Updated 2 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆27Jan 23, 2022Updated 4 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆91Nov 21, 2023Updated 2 years ago
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 5 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 7 years ago
- Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.☆21Feb 13, 2023Updated 3 years ago
- ☆28Jun 23, 2020Updated 5 years ago