A well-documented A2C written in PyTorch
☆53Jun 3, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-a2c
Users that are interested in pytorch-a2c are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning☆20Aug 12, 2021Updated 4 years ago
- General implementation of Advantage Actor Critic using Pytorch☆28Dec 7, 2021Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 5 years ago
- ☆16May 4, 2021Updated 5 years ago
- Pytorch implementation of [Feudal Net](https://arxiv.org/abs/1703.01161). ([Tensorflow version](https://github.com/dmakian/feudal_networ…☆18Jun 25, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 9 years ago
- Octax: Accelerated CHIP-8 Arcade Environments for JAX☆56Apr 20, 2026Updated 2 months ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 5 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Jan 22, 2021Updated 5 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- Proximal Policy Optimization in PyTorch☆39Dec 10, 2017Updated 8 years ago
- A pytorch implementation of spiking neural networks and backpropagation through spikes☆13Oct 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2023] Learning Geometry-aware Representations by Sketching☆15Dec 13, 2024Updated last year
- Centralized cooperative reinforcement learning☆13Jan 8, 2023Updated 3 years ago
- ☆136Jul 25, 2024Updated last year
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- ☆17Jul 11, 2020Updated 5 years ago
- SiDeGame - Simplified Defusal Game☆12Apr 17, 2025Updated last year
- Self-Supervised Attention-Aware Reinforcement Learning☆18May 20, 2022Updated 4 years ago
- ☆30Jun 4, 2022Updated 4 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 计算中国偏度指数(CBOE skew index)☆15May 23, 2021Updated 5 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆16Mar 28, 2020Updated 6 years ago
- Official implementation of "Cross-Domain Transfer via Semantic Skill Imitation", Pertsch et al., CoRL 2022☆15Dec 15, 2022Updated 3 years ago
- Tutorial & scripts to run a meta-rl model on DeepMind Lab's Harlow task environment.☆15Mar 28, 2019Updated 7 years ago
- Caclualtes Bitcoin Dominance using Coingecko's API and writes it to a CSV along with the date☆10May 12, 2021Updated 5 years ago
- An AI that plays google chrome's dinosaur game☆11Aug 20, 2020Updated 5 years ago
- Генератор российских автомобильных номеров☆16Dec 24, 2020Updated 5 years ago
- Active Learning in the era of Foundation Models☆13Apr 16, 2025Updated last year
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Aug 13, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)☆10Oct 11, 2019Updated 6 years ago
- A PyTorch implementation of "Generating Sentences from a Continuous Space"☆13Feb 22, 2018Updated 8 years ago
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆19Feb 27, 2023Updated 3 years ago
- LuaJIT and luarocks in one location☆16Aug 10, 2015Updated 10 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- ATS for NeurIPS 2021☆24Nov 4, 2021Updated 4 years ago