Actor Critic model to play Cartpole game
☆53Aug 4, 2018Updated 7 years ago
Alternatives and similar repositories for Actor-Critic-pytorch
Users that are interested in Actor-Critic-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- gym-auv repository upgraded to Stable-Baselines 3☆12Aug 24, 2023Updated 2 years ago
- 原稿用紙;原稿紙;稿紙;日式便箋;UPTEX/UPLATEX 縱書☆10Nov 27, 2019Updated 6 years ago
- Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment☆27Aug 2, 2020Updated 5 years ago
- Accepted by AROB 2021. A car-agent navigates in complex traffic conditions by Mixed_Input_PPO_CNN_LSTM model.☆14May 22, 2021Updated 4 years ago
- ☆10Nov 27, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code that trains cancer soft-robot networks☆17Oct 11, 2016Updated 9 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.☆13Mar 23, 2021Updated 5 years ago
- ☆22Dec 3, 2025Updated 4 months ago
- Accompanying code for our NeurIPS 2019 paper☆11Nov 7, 2019Updated 6 years ago
- ☆16Oct 25, 2023Updated 2 years ago
- A toy example of Policy Gradient implemented in Pytorch☆95Jan 24, 2018Updated 8 years ago
- Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)☆10Dec 16, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.☆422Mar 17, 2021Updated 5 years ago
- Bidirectionally-Coordinated Net Implements with PyTorch 1.0☆15Apr 10, 2019Updated 7 years ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- ☆24Feb 22, 2023Updated 3 years ago
- Sim2Real Transfer for Deep Reinforcement Learning with Stochastic State Transition Delays, CORL-2020.☆26Jun 3, 2021Updated 4 years ago
- Feedback Linearization Controller for Autonomous Underwater Vehicle☆18Jun 3, 2021Updated 4 years ago
- simple keras implement for 《Memory Fusion Network for Multi-view Sequential Learning》☆14Apr 9, 2021Updated 5 years ago
- ☆13Sep 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Collision Avoidance simulator for USV using Deep RL. A result of TTK4550 Fordypningsoppgave at NTNU☆21Mar 21, 2024Updated 2 years ago
- [ICLR 2025] "Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning"☆13Nov 30, 2025Updated 4 months ago
- ☆10Jul 20, 2023Updated 2 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- ☆10May 13, 2019Updated 6 years ago
- GNN implementations with PyG☆18Dec 5, 2024Updated last year
- pix2pix and Cycle GAN architectures for image style transfer☆13May 27, 2021Updated 4 years ago
- 主要利用QLearning,DQN,ImprovedDQN(Ddouble DQN) 解决gym框架下的三个问题CartPole-v0,MountainCar-v0,Acrobot-v1☆14Jan 14, 2018Updated 8 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- (6DOF) Six Degrees Of Freedom simulation of an AUV (Autonomous Underwater Vehicle) with rate feedback PID controllers for pitch and yaw☆22Jul 28, 2025Updated 8 months ago
- Towards causal inference for spatio-temporal data: conflict and forest loss in Colombia☆25Jan 11, 2022Updated 4 years ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 10 months ago
- ☆17Aug 2, 2024Updated last year
- PhoneGap NFC peer to peer demo☆22Jan 6, 2017Updated 9 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 7 years ago
- SaTML'23 paper "Backdoor Attacks on Time Series: A Generative Approach" by Yujing Jiang, Xingjun Ma, Sarah Monazam Erfani, and James Bail…☆21Feb 5, 2023Updated 3 years ago