In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆12Nov 18, 2020Updated 5 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆59Jun 30, 2020Updated 5 years ago
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆15Jun 30, 2020Updated 5 years ago
- Control the ur5 robot with the proportional-derivative and sliding mode control method. The two control methods modify their control gain…☆10Mar 17, 2022Updated 4 years ago
- Adaptive PID neural network controller implementation in Python☆16Dec 26, 2021Updated 4 years ago
- Super Mario Bros. (NES) gameplay dataset for machine learning.☆14Jul 22, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Pytorch implementation of large network design in continous control RL.☆19Jan 5, 2022Updated 4 years ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- Compiler and code generator for a dialect of Abstract Syntax Description Language☆13Oct 13, 2018Updated 7 years ago
- Implement the inverse kinematics of a UR5 employing the Poduct of Exponentials approach and control the UR5 in coppeliaSim using Python.☆23May 30, 2020Updated 6 years ago
- Mobile manipulator project demonstrating inverse kinematics and path following capabilities☆25Jan 30, 2022Updated 4 years ago
- Code for ICML 2025: SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation☆19Jun 21, 2025Updated 11 months ago
- algorithms for solving the Children's Book Test (CBT)☆10Jun 8, 2016Updated 10 years ago
- This is a code reproduction for the paper titled "N-BaIoT—Network-Based Detection of IoT Botnet Attacks Using Deep Autoencoders"☆14Apr 22, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Oct 21, 2025Updated 7 months ago
- TIGRIS: An Informed Sampling-based Informative Path Planner☆23Oct 11, 2022Updated 3 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- Size-Invariant Graph Representations for Graph Classification Extrapolations (ICML 2021 Long Talk)☆22Jan 26, 2023Updated 3 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- ☆19Sep 17, 2022Updated 3 years ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- multi-task learning for routing problem☆24Dec 2, 2025Updated 6 months ago
- ☆13Aug 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22May 4, 2025Updated last year
- HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic Programming☆23Mar 10, 2025Updated last year
- A simple 2D classical random walk and quantum walk simulation☆26Jun 15, 2016Updated 9 years ago
- ☆29May 10, 2023Updated 3 years ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆124Feb 16, 2022Updated 4 years ago
- ☆26Nov 16, 2018Updated 7 years ago
- Source Code for Paper "DAGAD: Data Augmentation for Graph Anomaly Detection" ICDM 2022☆22Mar 14, 2023Updated 3 years ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch implementation of DQN, DDQN and Dueling DQN to solve Atari games including PongNoFrameskip-v4, BreakoutNoFrameskip-v4 and BoxingN…☆18Jun 1, 2023Updated 3 years ago
- Tool-use Robotic Benchmark built with Drake Simulation☆29Jul 9, 2024Updated last year
- Reading List☆35Jul 16, 2023Updated 2 years ago
- Deep Reinforcement Learning with Double Q-learning☆14Nov 17, 2020Updated 5 years ago
- ☆25Aug 25, 2021Updated 4 years ago
- A notebook walking through how to use Keras RL to solve Atari environments.☆24Dec 30, 2020Updated 5 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago