YunjiaXi / Implementation-and-Some-Modification-about-DQN-and-SACView external linksLinks
In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and their combination in Atari environments Pong and Boxing. I also do some experiments on SAC with DDPG as baseline on three MuJoCo environments Hopper-v2, Ant-v2, and HalfCheetah-v2.
☆12Nov 18, 2020Updated 5 years ago
Alternatives and similar repositories for Implementation-and-Some-Modification-about-DQN-and-SAC
Users that are interested in Implementation-and-Some-Modification-about-DQN-and-SAC are comparing it to the libraries listed below
Sorting:
- Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.☆15Jun 30, 2020Updated 5 years ago
- Code for ICML 2025: SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation☆17Jun 21, 2025Updated 7 months ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- Authors' implementation of PEER☆11Jul 13, 2023Updated 2 years ago
- PyTorch implementation of PtrNet to solve sorting problem.☆12Dec 19, 2017Updated 8 years ago
- Some useful Blender scripts☆13Jan 15, 2025Updated last year
- ☆12Jul 15, 2020Updated 5 years ago
- Compiler and code generator for a dialect of Abstract Syntax Description Language☆13Oct 13, 2018Updated 7 years ago
- algorithms for solving the Children's Book Test (CBT)☆10Jun 8, 2016Updated 9 years ago
- ☆24May 4, 2025Updated 9 months ago
- use DQN(pytorch) to play pong☆12May 30, 2021Updated 4 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 3 months ago
- Certifying Geometric Robustness of Neural Networks☆16Mar 24, 2023Updated 2 years ago
- TIGRIS: An Informed Sampling-based Informative Path Planner☆22Oct 11, 2022Updated 3 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆15May 30, 2020Updated 5 years ago
- ☆23Apr 7, 2021Updated 4 years ago
- multi-task learning for routing problem☆22Dec 2, 2025Updated 2 months ago
- ☆22Jan 19, 2023Updated 3 years ago
- Device-Free Gesture Tracking Using Acoustic Signals☆22Dec 30, 2017Updated 8 years ago
- A Multiplicative Value Function for Safe and Efficient Reinforcement Learning. IROS 2023.☆25Sep 24, 2023Updated 2 years ago
- ☆20Oct 29, 2018Updated 7 years ago
- 自然语言理解 基准测试 数据集 | Benchmark datasets for Natural Language Understanding (NLU)☆20Nov 26, 2018Updated 7 years ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆24Aug 19, 2023Updated 2 years ago
- Interface for reading the Paraphrase Database (PPDB)☆24Mar 14, 2018Updated 7 years ago
- Source code and data for ACL 2019 Long Paper ``Semantic Parsing with Dual Learning".☆23Feb 21, 2021Updated 4 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- Code for the paper entitled "Towards Driving-Oriented Metric for Lane Detection Models" (CVPR 2022)☆25Mar 19, 2022Updated 3 years ago
- The official repository for NeurIPS 2024 Oral <Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Model…☆31Mar 20, 2025Updated 10 months ago
- Tool-use Robotic Benchmark built with Drake Simulation☆29Jul 9, 2024Updated last year
- ☆26Mar 12, 2015Updated 10 years ago
- Reading List☆35Jul 16, 2023Updated 2 years ago
- Iterative Alternating Neural Attention for Machine Reading in Tensorflow☆29May 31, 2017Updated 8 years ago
- ☆33Jun 16, 2023Updated 2 years ago
- ☆31Sep 4, 2021Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- [AAAI 2025] Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving☆49May 16, 2025Updated 9 months ago
- ☆44Aug 28, 2024Updated last year
- ☆48Apr 24, 2022Updated 3 years ago
- ☆48Jun 13, 2025Updated 8 months ago