A well-documented A2C written in PyTorch
☆52Jun 3, 2019Updated 6 years ago
Alternatives and similar repositories for pytorch-a2c
Users that are interested in pytorch-a2c are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Advantage Actor-Critic (A2C)☆46Nov 25, 2017Updated 8 years ago
- General implementation of Advantage Actor Critic using Pytorch☆28Dec 7, 2021Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 4 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆40Feb 18, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 4 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Jan 22, 2021Updated 5 years ago
- Gym environment of simple microgrid simulation for Reinforcement Learning☆10Oct 12, 2022Updated 3 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Sep 12, 2023Updated 2 years ago
- Proximal Policy Optimization in PyTorch☆39Dec 10, 2017Updated 8 years ago
- A pytorch implementation of spiking neural networks and backpropagation through spikes☆13Oct 3, 2024Updated last year
- A3C LSTM Atari with Pytorch plus A3G design☆568Apr 18, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Jun 7, 2024Updated last year
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- ☆133Jul 25, 2024Updated last year
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Study to test if Volume leak index (VLI) is a marker of severity of illness in sepsis.☆14Sep 29, 2022Updated 3 years ago
- ☆17Jul 11, 2020Updated 5 years ago
- This repository contains the R code used analyse the eICU and MIMIC-III databases for the Sarkar et al paper "Performance of intensive ca…☆10Nov 27, 2020Updated 5 years ago
- RL Algorithms☆13Mar 19, 2023Updated 3 years ago
- Preprocessing and baseline for N-Omniglot. Details can be found at https://www.nature.com/articles/s41597-022-01851-z..☆19Apr 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation of Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks☆28Aug 24, 2023Updated 2 years ago
- Self-Supervised Attention-Aware Reinforcement Learning☆18May 20, 2022Updated 3 years ago
- ☆30Jun 4, 2022Updated 3 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- 计算中国偏度指数(CBOE skew index)☆15May 23, 2021Updated 4 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆16Mar 28, 2020Updated 6 years ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆15Dec 15, 2022Updated 3 years ago
- Python bindings for the Rusty Object Notation.☆18May 28, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Генератор российских автомобильных номеров☆16Dec 24, 2020Updated 5 years ago
- ☆22Oct 4, 2019Updated 6 years ago
- Pointer Networks Implementation in Keras☆11Aug 17, 2017Updated 8 years ago
- Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)☆10Oct 11, 2019Updated 6 years ago
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆18Feb 27, 2023Updated 3 years ago
- Predict selection targets of lasso selection using deep learning model.☆12Nov 1, 2019Updated 6 years ago
- Container with Xvfb installed as a Service☆19Feb 8, 2015Updated 11 years ago