General implementation of Advantage Actor Critic using Pytorch
☆28Dec 7, 2021Updated 4 years ago
Alternatives and similar repositories for PyTorch-A2C
Users that are interested in PyTorch-A2C are comparing it to the libraries listed below
Sorting:
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- A well-documented A2C written in PyTorch☆52Jun 3, 2019Updated 6 years ago
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆17Jan 3, 2022Updated 4 years ago
- ☆13Jun 7, 2024Updated last year
- OpenAI Gym environment for graph search problems such as shortest path.☆11Dec 24, 2019Updated 6 years ago
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- Study to test if Volume leak index (VLI) is a marker of severity of illness in sepsis.☆14Sep 29, 2022Updated 3 years ago
- PyTorch implementation of both discrete and continuous ACER☆25Jan 27, 2019Updated 7 years ago
- This repository contains the R code used analyse the eICU and MIMIC-III databases for the Sarkar et al paper "Performance of intensive ca…☆10Nov 27, 2020Updated 5 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆16Mar 28, 2020Updated 5 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- Harry Potter Deep Learning Experiment☆10Mar 25, 2023Updated 2 years ago
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- This is a pytorch implementation of our AAAI paper for learned image transmission with HVAE☆11Mar 2, 2026Updated 2 weeks ago
- Distributed implementation of popular evolutionary methods☆64Dec 26, 2017Updated 8 years ago
- ☆15May 20, 2025Updated 10 months ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Fast asynchronous GPU monitoring tool across multiple machines through SSH☆11Nov 26, 2024Updated last year
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- ☆29Apr 16, 2021Updated 4 years ago
- paddle cifar100 training☆14May 28, 2021Updated 4 years ago
- Code companion of Multi-task Learning for Aggregated Data using Gaussian Processes paper☆10Apr 6, 2020Updated 5 years ago
- ☆11Sep 29, 2021Updated 4 years ago
- Solving the Stable Marriage/Matching Problem with the Gale–Shapley algorithm☆13Jul 14, 2019Updated 6 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Jan 19, 2023Updated 3 years ago
- Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.☆20Nov 9, 2025Updated 4 months ago
- ☆25Sep 2, 2022Updated 3 years ago
- A template engine for LLM prompts with support for writing prompts with prompts☆23Mar 31, 2025Updated 11 months ago
- Multi Agent Task sharing implementation using RRT algorithm. Implementation in MatLab☆12Oct 18, 2016Updated 9 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Feb 13, 2024Updated 2 years ago
- Python and MATLAB codes☆13Jan 30, 2022Updated 4 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- ☆17Mar 22, 2024Updated last year
- Accelerated replay buffers in JAX☆46Sep 17, 2022Updated 3 years ago