A well-documented A2C written in PyTorch
☆53Jun 3, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-a2c
Users that are interested in pytorch-a2c are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning☆20Aug 12, 2021Updated 4 years ago
- General implementation of Advantage Actor Critic using Pytorch☆28Dec 7, 2021Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 5 years ago
- A LLM-friendly framework for translating dynamical equations to gymnasium-compatible RL environments.☆33Mar 18, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆18Jun 25, 2019Updated 6 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 5 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Jan 22, 2021Updated 5 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Sep 12, 2023Updated 2 years ago
- Reinforcement learning with VizDoom platform☆13Apr 18, 2022Updated 4 years ago
- Proximal Policy Optimization in PyTorch☆39Dec 10, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A3C LSTM Atari with Pytorch plus A3G design☆567Apr 18, 2023Updated 3 years ago
- ☆14Jun 7, 2024Updated 2 years ago
- Specialization of BERT architecture both for the Spanish language and the Twitter domain☆13Nov 6, 2020Updated 5 years ago
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- ☆136Jul 25, 2024Updated last year
- ☆14Feb 22, 2023Updated 3 years ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Study to test if Volume leak index (VLI) is a marker of severity of illness in sepsis.☆14Sep 29, 2022Updated 3 years ago
- This repository contains the R code used analyse the eICU and MIMIC-III databases for the Sarkar et al paper "Performance of intensive ca…☆10Nov 27, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆29Mar 8, 2026Updated 3 months ago
- RL Algorithms☆13Mar 19, 2023Updated 3 years ago
- SiDeGame - Simplified Defusal Game☆13Apr 17, 2025Updated last year
- Implementation of Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks☆27Aug 24, 2023Updated 2 years ago
- Self-Supervised Attention-Aware Reinforcement Learning☆18May 20, 2022Updated 4 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- ☆30Jun 4, 2022Updated 4 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- Deep Integrated Perception framework for social service robots☆14Sep 6, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 计算中国偏度指数(CBOE skew index)☆15May 23, 2021Updated 5 years ago
- Disentangling Factors of Variation by Mixing Them codes☆16Mar 13, 2019Updated 7 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆16Mar 28, 2020Updated 6 years ago
- Tutorial & scripts to run a meta-rl model on DeepMind Lab's Harlow task environment.☆15Mar 28, 2019Updated 7 years ago
- Python bindings for the Rusty Object Notation.☆18May 30, 2026Updated last week
- Генератор российских автомобильных номеров☆16Dec 24, 2020Updated 5 years ago
- ☆22Oct 4, 2019Updated 6 years ago