In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆12Nov 18, 2020Updated 5 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆57Jun 30, 2020Updated 5 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆16Jul 1, 2018Updated 7 years ago
- Adaptive PID neural network controller implementation in Python☆16Dec 26, 2021Updated 4 years ago
- Super Mario Bros. (NES) gameplay dataset for machine learning.☆12Jul 22, 2025Updated 8 months ago
- Some useful Blender scripts☆13Jan 15, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implementation of PtrNet to solve sorting problem.☆12Dec 19, 2017Updated 8 years ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- Compiler and code generator for a dialect of Abstract Syntax Description Language☆13Oct 13, 2018Updated 7 years ago
- Implement the inverse kinematics of a UR5 employing the Poduct of Exponentials approach and control the UR5 in coppeliaSim using Python.☆22May 30, 2020Updated 5 years ago
- Mobile manipulator project demonstrating inverse kinematics and path following capabilities☆24Jan 30, 2022Updated 4 years ago
- Code for ICML 2025: SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation☆17Jun 21, 2025Updated 9 months ago
- ☆12Jul 15, 2020Updated 5 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- multi-task learning for routing problem☆24Dec 2, 2025Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A simple 2D classical random walk and quantum walk simulation☆27Jun 15, 2016Updated 9 years ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- ☆12Aug 22, 2024Updated last year
- ☆23May 4, 2025Updated 10 months ago
- ☆30May 10, 2023Updated 2 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆15May 30, 2020Updated 5 years ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- ☆26Nov 16, 2018Updated 7 years ago
- ☆20Oct 29, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tool-use Robotic Benchmark built with Drake Simulation☆29Jul 9, 2024Updated last year
- Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".☆23Feb 21, 2021Updated 5 years ago
- Reading List☆35Jul 16, 2023Updated 2 years ago
- Deep Reinforcement Learning with Double Q-learning☆14Nov 17, 2020Updated 5 years ago
- Code for the paper entitled "Towards Driving-Oriented Metric for Lane Detection Models" (CVPR 2022)☆25Mar 19, 2022Updated 4 years ago
- ☆24Apr 7, 2021Updated 4 years ago
- ☆32Mar 22, 2026Updated last week
- Notes from the book 'The Elements of Statistical Learning'☆21Jun 20, 2025Updated 9 months ago
- ☆29Oct 2, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- ☆41Jun 19, 2024Updated last year
- ☆33Jun 16, 2023Updated 2 years ago
- PyTorch Extension Library of Optimized Unique Operation☆37Mar 1, 2019Updated 7 years ago
- A Multiplicative Value Function for Safe and Efficient Reinforcement Learning. IROS 2023.☆25Sep 24, 2023Updated 2 years ago
- Web annotation tool for point clouds☆45May 6, 2023Updated 2 years ago
- GPU cluster kubernetes configurations and usages☆34Nov 24, 2021Updated 4 years ago