rgilman33 / baselines-A2CView external linksLinks
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆53Feb 4, 2020Updated 6 years ago
Alternatives and similar repositories for baselines-A2C
Users that are interested in baselines-A2C are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- Single shot neural network pruning before training the model, based on connection sensitivity☆11Aug 7, 2019Updated 6 years ago
- Multiplayer Game in Unity3D☆11Dec 13, 2020Updated 5 years ago
- Gold Loss Correction for training neural networks with labels corrupted with severe noise☆13Aug 17, 2019Updated 6 years ago
- Reinforcement learning - Batched Impala - PyTorch - Mario Kart☆13Jul 21, 2020Updated 5 years ago
- A blockchain based land registry application.☆18Jan 5, 2023Updated 3 years ago
- PyTorch implementation of Proximal Policy Optimization☆53Dec 20, 2017Updated 8 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆17Jan 6, 2018Updated 8 years ago
- ☆17Jun 28, 2018Updated 7 years ago
- OpenAI Gym Environments for the Application of Reinforcement Learning in the Simulation of Wireless Networked Feedback Control Loops☆14Feb 5, 2021Updated 5 years ago
- Manifold-Mixup implementation for fastai V1☆19Oct 1, 2020Updated 5 years ago
- Implementation of PPO in Pytorch☆41Dec 6, 2017Updated 8 years ago
- My website☆18Nov 29, 2024Updated last year
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Oct 30, 2020Updated 5 years ago
- my udacity projects☆22Jan 22, 2017Updated 9 years ago
- Official Implementation of "Random Path Selection for Incremental Learning" paper. NeurIPS 2019☆53Dec 15, 2022Updated 3 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,314Sep 25, 2019Updated 6 years ago
- Multi-Target Embodied Question Answering☆26Jul 17, 2020Updated 5 years ago
- This repository contains code for my only numpy basic.☆25Aug 24, 2024Updated last year
- Training Deep AutoEncoders for Collaborative Filtering☆22Sep 9, 2019Updated 6 years ago
- ☆31Dec 28, 2018Updated 7 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Aug 1, 2017Updated 8 years ago
- ☆26Feb 15, 2023Updated 2 years ago
- A website to find online courses☆30Nov 6, 2019Updated 6 years ago
- Modular PyTorch implementation of policy gradient methods☆25Nov 15, 2018Updated 7 years ago
- ☆32Aug 1, 2018Updated 7 years ago
- General implementation of Advantage Actor Critic using Pytorch☆28Dec 7, 2021Updated 4 years ago
- A PyTorch Library for Reinforcement Learning Research☆198Jun 22, 2025Updated 7 months ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,273Feb 9, 2021Updated 5 years ago
- Hey 👋🏻☆11Jan 15, 2026Updated 3 weeks ago
- Awesome papers on Earth Observation (EO), Machine Learning (ML), and Causal Inference (CI) [Edward Elgar Publishing]☆11Jan 18, 2026Updated 3 weeks ago
- The Google Summer of Code (GSoC) page for RADAR-base☆10Feb 2, 2026Updated last week
- SciFin is a python package for Science & Finance.☆11Oct 25, 2020Updated 5 years ago
- Stripped Python images based on alpine variant of library's Python☆10Jan 20, 2022Updated 4 years ago
- PCL grabber for DepthSense devices☆11Sep 26, 2015Updated 10 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- A research notes about how to get benefits from Cython to be asynchronous beyond IO tasks☆11Feb 17, 2020Updated 5 years ago