vwxyzjn/a2c_is_a_special_case_of_ppo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vwxyzjn/a2c_is_a_special_case_of_ppo)

vwxyzjn / a2c_is_a_special_case_of_ppo

A2C is a special case of PPO!

☆23

Alternatives and similar repositories for a2c_is_a_special_case_of_ppo

Users that are interested in a2c_is_a_special_case_of_ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
Miffyli / rl-action-space-shaping
View on GitHub
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30May 26, 2020Updated 6 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
jsw7460 / sb3_jax
View on GitHub
☆13Aug 9, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
tsmatz / minecraft-rl-example
View on GitHub
Applying Reinforcement Learning in Minecraft - Project Malmo Tutorial (Mar 2021)
☆21Nov 8, 2023Updated 2 years ago
EleutherAI / equivariance
View on GitHub
A framework for implementing equivariant DL
☆10May 25, 2021Updated 5 years ago
xingchenwan / bgpbt
View on GitHub
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆31Sep 16, 2022Updated 3 years ago
p-kar / a2c-acktr-vizdoom
View on GitHub
A2C, ACKTR and A2T implementations for ViZDoom
☆10Dec 18, 2017Updated 8 years ago
vwxyzjn / jupyter_disqus
View on GitHub
Add Disqus to your Jupyter notebook.
☆14Feb 14, 2018Updated 8 years ago
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
markub3327 / rl-toolkit
View on GitHub
RL-Toolkit: A Research Framework for Robotics
☆21Jan 22, 2026Updated 6 months ago
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
vwxyzjn / SC2AI
View on GitHub
Integrated Tensorforce and OpenAI Gym to train SC II game agents.
☆13Oct 26, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
IgnisDa / rust-libs
View on GitHub
A set of packages for different rust needs
☆15Jan 11, 2023Updated 3 years ago
citizenhicks / openai_DDDQN
View on GitHub
☆14Mar 24, 2021Updated 5 years ago
levelupai / rl-slg
View on GitHub
Reinforcement learning training project for a SLG game
☆13Dec 21, 2017Updated 8 years ago
mwydmuch / PyOblige
View on GitHub
PyOblige is Python wrapper for OBLIGE - random level generator for Doom
☆11Jul 2, 2018Updated 8 years ago
entity-neural-network / entity-gym
View on GitHub
Standard interface for entity based reinforcement learning environments.
☆39Feb 28, 2024Updated 2 years ago
IgnisDa / ocean-pv
View on GitHub
A website that can visualize your personality
☆12Feb 15, 2023Updated 3 years ago
IgnisDa / printr
View on GitHub
The smarter echo alternative
☆12Jan 28, 2022Updated 4 years ago
newera-001 / motor-system
View on GitHub
A project copied from google-research which named motion-imitation was rewrited with PyTorch
☆10Sep 30, 2022Updated 3 years ago
instadeepai / fastpbrl
View on GitHub
Vectorization techniques for fast population-based training.
☆57Apr 26, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cwher / RL-Sekiro
View on GitHub
☆12Jun 30, 2022Updated 4 years ago
NathanGavenski / IL-Datasets
View on GitHub
This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking
☆13Jun 23, 2026Updated last month
jetnew / visrl
View on GitHub
A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.
☆14Jan 8, 2022Updated 4 years ago
davevad93 / pass-gen
View on GitHub
A simple password generator web app
☆16Mar 30, 2026Updated 3 months ago
vwxyzjn / gym-microrts-paper
View on GitHub
The source code for the gym-microrts paper.
☆44Aug 5, 2022Updated 3 years ago
ketatam / Exploring-Munchausen-Reinforcement-Learning
View on GitHub
PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces
☆15Oct 3, 2021Updated 4 years ago
kkhetarpal / ioc
View on GitHub
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Jul 31, 2020Updated 5 years ago
ingambe / RayEnvWrapper
View on GitHub
OpenAi's gym environment wrapper to vectorize them with Ray
☆23May 25, 2023Updated 3 years ago
uic-nlp-lab / virtualcoachdata
View on GitHub
☆15Dec 12, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
vwxyzjn / invalid-action-masking
View on GitHub
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
☆168May 9, 2023Updated 3 years ago
evgenii-nikishin / rl_with_resets
View on GitHub
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆106May 17, 2022Updated 4 years ago
MasterScrat / rl-insights
View on GitHub
🤖 Reinforcement Learning paper summaries, notebooks, and articles.
☆26Apr 16, 2020Updated 6 years ago
davevad93 / rest-countries-django-app
View on GitHub
Small project built with Django that retrieves data from the REST Countries API and the Wikipedia API.
☆22Mar 26, 2026Updated 4 months ago
zhoubin-me / agent0
View on GitHub
Agent Zero RL Framework
☆15Nov 22, 2024Updated last year
polixir / causal-mbrl
View on GitHub
Toolkit of Causal Model-based Reinforcement Learning.
☆33Jun 5, 2023Updated 3 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago