Miffyli/policy-supervectors

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Miffyli/policy-supervectors)

Miffyli / policy-supervectors

Creating fixed-length vectors to describe RL/GA policies

☆20

Alternatives and similar repositories for policy-supervectors

Users that are interested in policy-supervectors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Miffyli / rl-human-prior-tricks
View on GitHub
Evaluating different engineering tricks that make RL work
☆15Jun 3, 2021Updated 5 years ago
Ending2015a / unstable_baselines
View on GitHub
A TF2.0 implementation of RL baselines.
☆10Sep 24, 2021Updated 4 years ago
MichalOp / MineRL2020
View on GitHub
☆16Aug 7, 2021Updated 4 years ago
Miffyli / rl-action-space-shaping
View on GitHub
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30May 26, 2020Updated 6 years ago
Miffyli / gan-aimbots
View on GitHub
Code for the experiments done in the paper "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters"
☆24May 13, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
amiranas / minerl_imitation_learning
View on GitHub
☆21Jul 14, 2020Updated 6 years ago
chscheller / sc2_imitation_learning
View on GitHub
StarCraft 2 Imitation Learning
☆29Jul 2, 2021Updated 5 years ago
Danielhp95 / Regym
View on GitHub
☆12Jan 3, 2022Updated 4 years ago
hegde95 / Agents_that_Listen
View on GitHub
Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory
☆14Jul 9, 2021Updated 5 years ago
LucasAlegre / mbcd
View on GitHub
Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"
☆11Aug 7, 2023Updated 2 years ago
stanford-iprl-lab / GRAC
View on GitHub
implementation of our self-guided and self-regularized actor-critic algorithm
☆29Jan 1, 2023Updated 3 years ago
bhairavmehta95 / ant-env
View on GitHub
Ant Gather and Ant Maze envs, separated from RLLab
☆11Aug 2, 2018Updated 7 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
entity-neural-network / incubator
View on GitHub
Collection of in-progress libraries for entity neural networks.
☆29Jun 24, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
codingfisch / flashrl
View on GitHub
Fast reinforcement learning 💨
☆29Jul 15, 2025Updated last year
Miffyli / minecraft-bc
View on GitHub
Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)
☆13Nov 13, 2020Updated 5 years ago
Miffyli / mastering-chutes-and-ladders
View on GitHub
The source code for mastering the game of Chutes and Ladders
☆19Apr 2, 2021Updated 5 years ago
schmidtdominik / Rainbow
View on GitHub
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆44Dec 11, 2021Updated 4 years ago
araffin / datasaurust
View on GitHub
Blazingly fast implementation of the Datasaurus paper. Same Stats, Different Graphs.
☆19Mar 22, 2026Updated 4 months ago
pmediano / ComputationalNeurodynamics
View on GitHub
Code and exercises for the Computational Neurodynamics course at Imperial College London
☆28Nov 22, 2016Updated 9 years ago
flowersteam / rl-difference-testing
View on GitHub
Simple tools for statistical analyses in RL experiments
☆67Jun 21, 2018Updated 8 years ago
resibots / kaushik_2018_multi-dex
View on GitHub
Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)
☆13Oct 8, 2018Updated 7 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
joonaspu / video-game-behavioural-cloning
View on GitHub
Behavioural cloning experiments with video games
☆32Apr 15, 2020Updated 6 years ago
hari-sikchi / LOOP
View on GitHub
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆42Aug 27, 2022Updated 3 years ago
takuseno / d4rl-pybullet
View on GitHub
Datasets for data-driven deep reinforcement learning with PyBullet environments
☆152Mar 19, 2021Updated 5 years ago
nelsonuhan / bellmanford
View on GitHub
Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience
☆13May 7, 2018Updated 8 years ago
Misterio77 / dotfiles
View on GitHub
Dotfiles for my Arch setup.
☆19Jul 7, 2021Updated 5 years ago
IRLL / HIPPO_Gym
View on GitHub
☆20Sep 8, 2023Updated 2 years ago
KMarino / hrl-ep3
View on GitHub
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Feb 21, 2019Updated 7 years ago
mxu34 / mbrl-gpmm
View on GitHub
☆28Jun 23, 2020Updated 6 years ago
wkentaro / jqk
View on GitHub
Render a JSON with jq patterns.
☆20Aug 20, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
kachayev / gym-microrts-paper-sb3
View on GitHub
RL agent to play μRTS with Stable-Baselines3 and PyTorch
☆28Jan 23, 2022Updated 4 years ago
zhihanyang2022 / off-policy-continuous-control
View on GitHub
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆93Nov 21, 2023Updated 2 years ago
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 7 years ago
asonabend / ESRL
View on GitHub
Code for Expert Supervised Reinforcement Learning
☆10Apr 7, 2021Updated 5 years ago
qgallouedec / deep_rl
View on GitHub
Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.
☆21Feb 13, 2023Updated 3 years ago
MultiPath / NMT-RDPG
View on GitHub
Neural machine translation with Recurrent Deterministic Policy Gradient
☆10Aug 18, 2016Updated 9 years ago