In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆12Nov 18, 2020Updated 5 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below
Sorting:
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆15Jun 30, 2020Updated 5 years ago
- Super Mario Bros. (NES) gameplay dataset for machine learning.☆13Jul 22, 2025Updated 7 months ago
- Code for ICML 2025: SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation☆17Jun 21, 2025Updated 8 months ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- PyTorch implementation of PtrNet to solve sorting problem.☆12Dec 19, 2017Updated 8 years ago
- ☆11Aug 22, 2024Updated last year
- ☆12Jul 15, 2020Updated 5 years ago
- Compiler and code generator for a dialect of Abstract Syntax Description Language☆13Oct 13, 2018Updated 7 years ago
- ☆15Oct 21, 2025Updated 4 months ago
- Implement common statistical machine learning algorithms with raw Numpy.☆16Jun 30, 2020Updated 5 years ago
- PyTorch implementation of GNN models☆23Jul 11, 2025Updated 7 months ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 4 months ago
- HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic Programming☆22Mar 10, 2025Updated 11 months ago
- ☆19Sep 17, 2022Updated 3 years ago
- Solving the OpenAI Gym (MountainCarContinuous-v0) with DDPG☆21Jan 23, 2023Updated 3 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆15May 30, 2020Updated 5 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- ☆24Apr 7, 2021Updated 4 years ago
- multi-task learning for routing problem☆23Dec 2, 2025Updated 3 months ago
- Unsupervised Meta Learning for Image Classification (UMTRA) algorithm☆21Dec 15, 2022Updated 3 years ago
- ☆20Oct 29, 2018Updated 7 years ago
- A Multiplicative Value Function for Safe and Efficient Reinforcement Learning. IROS 2023.☆25Sep 24, 2023Updated 2 years ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- Interface for reading the Paraphrase Database (PPDB)☆24Mar 14, 2018Updated 7 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- ☆26Nov 16, 2018Updated 7 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆28Dec 6, 2020Updated 5 years ago
- Tool-use Robotic Benchmark built with Drake Simulation☆29Jul 9, 2024Updated last year
- ☆26Mar 12, 2015Updated 10 years ago
- Reading List☆35Jul 16, 2023Updated 2 years ago
- Iterative Alternating Neural Attention for Machine Reading in Tensorflow☆29May 31, 2017Updated 8 years ago
- ☆33Jun 16, 2023Updated 2 years ago
- Dense-Resolution Network for Point Cloud Classification and Segmentation (WACV 2021)☆35May 5, 2021Updated 4 years ago
- ☆31Sep 4, 2021Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- A super simple Python wrapper for the constrained traveling salesman and vehicle routing problem solver LKH-3.☆40Jan 13, 2026Updated last month
- ☆44Aug 28, 2024Updated last year