Actor Critic model to play Cartpole game
☆53Aug 4, 2018Updated 7 years ago
Alternatives and similar repositories for Actor-Critic-pytorch
Users that are interested in Actor-Critic-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- advantage actor-critic reinforcement learning for openai gym cartpole☆66Jul 13, 2017Updated 8 years ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Feb 2, 2021Updated 5 years ago
- Gym Environment for AUV docking procedure☆11Sep 20, 2022Updated 3 years ago
- Unscented estimation and adaptive control package☆13Jun 24, 2017Updated 8 years ago
- Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment☆27Aug 2, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Accepted by AROB 2021. A car-agent navigates in complex traffic conditions by Mixed_Input_PPO_CNN_LSTM model.☆14May 22, 2021Updated 4 years ago
- ☆10Nov 27, 2019Updated 6 years ago
- Code that trains cancer soft-robot networks☆17Oct 11, 2016Updated 9 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- ☆10Apr 18, 2017Updated 8 years ago
- Learning Transferable Features with Deep Adaptation Networks☆12Jul 18, 2023Updated 2 years ago
- A toy example of Policy Gradient implemented in Pytorch☆95Jan 24, 2018Updated 8 years ago
- Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)☆10Dec 16, 2018Updated 7 years ago
- PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.☆421Mar 17, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Codebase - Comparing DRL algorithms' ability to safely navigate challenging waters☆16Aug 18, 2021Updated 4 years ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- ☆16Feb 24, 2023Updated 3 years ago
- Sim2Real Transfer for Deep Reinforcement Learning with Stochastic State Transition Delays, CORL-2020.☆26Jun 3, 2021Updated 4 years ago
- Text Summarizer implementation with Tensorflow 2.0 using conditional GAN☆14May 13, 2019Updated 6 years ago
- simple keras implement for 《Memory Fusion Network for Multi-view Sequential Learning》☆14Apr 9, 2021Updated 4 years ago
- [ICLR 2025] "Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning"☆13Nov 30, 2025Updated 3 months ago
- ☆10Jul 20, 2023Updated 2 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space☆11Dec 10, 2021Updated 4 years ago
- ☆10May 13, 2019Updated 6 years ago
- ☆10Dec 21, 2024Updated last year
- ☆21Mar 20, 2019Updated 7 years ago
- path planners for underwater autonomous vehicles (AUVs)☆24Feb 4, 2021Updated 5 years ago
- pix2pix and Cycle GAN architectures for image style transfer☆13May 27, 2021Updated 4 years ago
- Code for Colangelo and Lee (2025)☆15Feb 3, 2025Updated last year
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row☆26Dec 8, 2022Updated 3 years ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 9 months ago
- ☆28Feb 8, 2026Updated last month
- ☆15Nov 19, 2021Updated 4 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Jul 26, 2022Updated 3 years ago
- code for sentence compression☆20Mar 3, 2018Updated 8 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago