In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆12Nov 18, 2020Updated 5 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆58Jun 30, 2020Updated 5 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- Super Mario Bros. (NES) gameplay dataset for machine learning.☆14Jul 22, 2025Updated 11 months ago
- PyTorch implementation of PtrNet to solve sorting problem.☆12Dec 19, 2017Updated 8 years ago
- use DQN(pytorch) to play pong☆12May 30, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- 清华大学电子系小学期 MATLAB 大作业☆10Sep 18, 2021Updated 4 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- ☆15Oct 21, 2025Updated 8 months ago
- PyTorch implementation of GNN models☆23Jul 11, 2025Updated 11 months ago
- This repository contains a gym environment that can be used for developing solvers for robotic 3D bin packing problems.☆23Dec 5, 2025Updated 6 months ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- TIGRIS: An Informed Sampling-based Informative Path Planner☆23Oct 11, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Size-Invariant Graph Representations for Graph Classification Extrapolations (ICML 2021 Long Talk)☆22Jan 26, 2023Updated 3 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆23Apr 16, 2022Updated 4 years ago
- multi-task learning for routing problem☆24Dec 2, 2025Updated 6 months ago
- ☆13Aug 22, 2024Updated last year
- ☆22May 4, 2025Updated last year
- HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic Programming☆21Mar 10, 2025Updated last year
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆15May 30, 2020Updated 6 years ago
- Unsupervised Meta Learning for Image Classification (UMTRA) algorithm☆21Dec 15, 2022Updated 3 years ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- PPO, DDPG, SAC implementation on mujoco environment☆126Feb 16, 2022Updated 4 years ago
- 自然语言理解 基准测试 数据集 | Benchmark datasets for Natural Language Understanding (NLU)☆20Nov 26, 2018Updated 7 years ago
- Source Code for Paper "DAGAD: Data Augmentation for Graph Anomaly Detection" ICDM 2022☆22Mar 14, 2023Updated 3 years ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- Reading List☆35Jul 16, 2023Updated 2 years ago
- Deep Reinforcement Learning with Double Q-learning☆14Nov 17, 2020Updated 5 years ago
- ☆25Aug 25, 2021Updated 4 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- Code for the paper entitled "Towards Driving-Oriented Metric for Lane Detection Models" (CVPR 2022)☆25Mar 19, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Apr 7, 2021Updated 5 years ago
- Notes from the book 'The Elements of Statistical Learning'☆21Jun 20, 2025Updated last year
- ☆29Oct 2, 2023Updated 2 years ago
- ☆24Jan 19, 2023Updated 3 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- ☆17Mar 31, 2022Updated 4 years ago
- ☆33Jun 16, 2023Updated 3 years ago