General implementation of Advantage Actor Critic using Pytorch
☆28Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for PyTorch-A2C
Users that are interested in PyTorch-A2C are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- A well-documented A2C written in PyTorch☆53Jun 3, 2019Updated 7 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Jun 7, 2024Updated 2 years ago
- OpenAI Gym environment for graph search problems such as shortest path.☆11Dec 24, 2019Updated 6 years ago
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- Study to test if Volume leak index (VLI) is a marker of severity of illness in sepsis.☆14Sep 29, 2022Updated 3 years ago
- PyTorch implementation of both discrete and continuous ACER☆25Jan 27, 2019Updated 7 years ago
- This repository contains the R code used analyse the eICU and MIMIC-III databases for the Sarkar et al paper "Performance of intensive ca…☆10Nov 27, 2020Updated 5 years ago
- RL Algorithms☆13Mar 19, 2023Updated 3 years ago
- Course Homepage☆11Aug 29, 2016Updated 9 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Dec 26, 2017Updated 8 years ago
- ☆11Jan 5, 2023Updated 3 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Jan 3, 2018Updated 8 years ago
- PyTorch Implementation of Global Neuron Shape Reasoning with Point Affinity Transformers☆14Mar 16, 2026Updated 2 months ago
- Fast asynchronous GPU monitoring tool across multiple machines through SSH☆11Nov 26, 2024Updated last year
- A2C is a special case of PPO!☆23May 20, 2022Updated 4 years ago
- ☆18Apr 6, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code companion of Multi-task Learning for Aggregated Data using Gaussian Processes paper☆11Apr 6, 2020Updated 6 years ago
- ☆11Sep 29, 2021Updated 4 years ago
- ☆15Oct 9, 2022Updated 3 years ago
- Awesome list of Semantic Communications (SemCom) for Resource Allocation☆11Aug 19, 2024Updated last year
- Attempting to predict player action in multiplayer pot limit Texas Hold'em☆11Dec 4, 2011Updated 14 years ago
- ☆15Dec 3, 2022Updated 3 years ago
- Solving the Stable Marriage/Matching Problem with the Gale–Shapley algorithm☆13Jul 14, 2019Updated 6 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- prediction-correction scheme based on Lagrange multiplier☆10Aug 24, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GAN: An example for generating Gaussian distribution by a simple generating adversarial network.☆12Dec 28, 2020Updated 5 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆31Jan 19, 2023Updated 3 years ago
- Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.☆21Nov 9, 2025Updated 7 months ago
- Normaized X-Corr Model for person reidentification implementation in keras with tensorflow as backend.☆11Jan 17, 2018Updated 8 years ago
- Inexact Block Coordinate Descent Methods For Symmetric Nonnegative Matrix Factorization☆15Mar 1, 2017Updated 9 years ago
- Multi Agent Task sharing implementation using RRT algorithm. Implementation in MatLab☆12Oct 18, 2016Updated 9 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆302Feb 13, 2024Updated 2 years ago