Dueling DQN Pytorch
☆14Dec 13, 2021Updated 4 years ago
Alternatives and similar repositories for Dueling_DQN
Users that are interested in Dueling_DQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Double DQN Pytorch☆20Dec 13, 2021Updated 4 years ago
- TD3 in Pytorch☆36Jan 17, 2022Updated 4 years ago
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- Deep Reinforcement Learning with Double Q-learning☆14Nov 17, 2020Updated 5 years ago
- DDPG in Pytorch☆48Jan 16, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- todo: desc☆11Aug 12, 2021Updated 4 years ago
- QMR implementation using DroNet☆14May 24, 2024Updated last year
- ☆13Apr 12, 2022Updated 3 years ago
- 基于金融数据的前端量化框架,方便金融数据的处理☆11Jul 29, 2016Updated 9 years ago
- ☆14Aug 9, 2019Updated 6 years ago
- 利用强化学习预测股价变化☆22May 25, 2017Updated 8 years ago
- my code for paper Parameterized-DQN☆25Mar 5, 2021Updated 5 years ago
- A framework for the implementation and evaluation of routing algorithms based on the Ant Colony Optimization (ACO) metaheuristic.☆20Jun 26, 2016Updated 9 years ago
- A simple python implementation of stochastic network calculus☆19Oct 28, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Using N-step dueling DDQN with PER for playing Pacman game☆22Oct 27, 2019Updated 6 years ago
- ☆16May 13, 2024Updated last year
- Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout☆96Jun 22, 2018Updated 7 years ago
- Using Resnet architecture in the contextual bandit framework for financial asset trading☆16Nov 22, 2024Updated last year
- [AAAI 2026] TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution☆15Aug 1, 2025Updated 7 months ago
- This is the source code of the paper titled "QMR: Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc Networ…☆23Apr 8, 2022Updated 3 years ago
- Learning Based FEC for Non-Terrestrial Networks with Delayed Feedback☆15Mar 1, 2022Updated 4 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆27Oct 23, 2025Updated 5 months ago
- Reinforcement Learning (RL) based navigation implementation for mobile robot navigation. The algorithms of TD3, DDPG, SAC, DQN, Q-Learnin…☆33Oct 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Adaptive Decoding Mechanisms for UAV-enabled Double-Uplink Coordinated NOMA, IEEE Transactions on Vehicular Technology, Mar. 2023☆21Apr 8, 2023Updated 2 years ago
- Code for SIGKDD2025 paper: An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem☆14Jan 28, 2025Updated last year
- [ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…☆10Dec 19, 2023Updated 2 years ago
- the source code of IJCAI 2023 paper "Multi-Scale subgraph contrastive learning"☆11Apr 25, 2023Updated 2 years ago
- ☆18Oct 6, 2025Updated 5 months ago
- Repository for the ns3 implementation of enhanced Q-routing with QoS☆21Aug 15, 2020Updated 5 years ago
- This repo has our initial codes for offline implementation of NOMA with CVX☆17Nov 6, 2019Updated 6 years ago
- ☆10Dec 26, 2023Updated 2 years ago
- ☆15Jan 22, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆10Sep 19, 2023Updated 2 years ago
- https://arxiv.org/abs/2006.04992☆19Jun 17, 2021Updated 4 years ago
- "Learning Stable Classifiers by Transferring Unstable Features" ICML 2022☆14Jul 24, 2022Updated 3 years ago
- This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.☆10Feb 13, 2025Updated last year
- Greedy Perimeter Stateless Routing (GPSR) implement on NS3 platform☆21Aug 17, 2017Updated 8 years ago
- End-to-end reinforcement learning using DDPG and PPO algorithms in a simulated robot environment☆21Mar 13, 2020Updated 6 years ago
- Heterogeneous capacitated vehicle routing problem☆11Aug 4, 2018Updated 7 years ago