强化学习常见算法的实现,Q-Learning/DQN/PG/AC/DDPG/PPO/SAC
☆26Feb 17, 2022Updated 4 years ago
Alternatives and similar repositories for RL-demo
Users that are interested in RL-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 30, 2023Updated 2 years ago
- This is a important file!☆10Feb 14, 2019Updated 7 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆152Jan 23, 2026Updated 4 months ago
- Implementation of Pareto Deep Q Networks in a multi-objective Gym Reinforcement Learning Environment☆17Jun 19, 2023Updated 2 years ago
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Pytorch Code for "Rethinking Degradation: Radiograph Super-Resolution via AID-SRGAN" - MICCAI 2022 Workshop☆16Dec 11, 2024Updated last year
- 使用pytorch构建深度强化学习模型DQN☆26Dec 5, 2017Updated 8 years ago
- qmix☆23May 28, 2020Updated 6 years ago
- Python implementation of the img2net algorithm.☆10Jan 7, 2026Updated 4 months ago
- This contains the simulation of a kinova robot and the code for collecting data and training both a grasp classifier and a RL agent☆30Jan 22, 2022Updated 4 years ago
- ☆15Apr 4, 2025Updated last year
- ☆14Mar 26, 2025Updated last year
- ☆11Dec 4, 2025Updated 5 months ago
- lecture32_AI挑战星际争霸II(强化学习)☆18Aug 23, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Data loaders for various deep learning datasets☆17Jun 10, 2023Updated 2 years ago
- Compute the likeliness of an image region to vessels or ridges☆29Jul 20, 2017Updated 8 years ago
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the fi…☆34Nov 3, 2025Updated 6 months ago
- Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow☆14Apr 8, 2017Updated 9 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago
- 使用 NSGAII 算法求解 FJSP 问题(柔性作业车间调度)☆27Feb 12, 2025Updated last year
- ☆10May 6, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).☆32Oct 27, 2021Updated 4 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 4 years ago
- 在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, an…☆134Oct 8, 2023Updated 2 years ago
- 使用深度强化学习解决视觉跟踪和视觉导航问题☆28Mar 18, 2021Updated 5 years ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 7 years ago
- SimCSE☆15Oct 1, 2022Updated 3 years ago
- Covert Keras models to Pytorch☆12Dec 22, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Comparaison of adversarial training algorithms (FreeLB, FreeAT and K-PGD) on natural language tasks☆12Feb 14, 2020Updated 6 years ago
- ☆22Apr 22, 2025Updated last year
- My work during the research project "Comfort-oriented adaptive cruise control of an autonomous vehicle" at GIPSA-lab, Jan.-Jun. 2020, lat…☆30Apr 30, 2021Updated 5 years ago
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 4 years ago
- code for Model-Guided Multi-Contrast Deep Unfolding Network for MRI Super-resolution Reconstruction☆17Oct 23, 2023Updated 2 years ago
- 本课程主要介绍强化学习的基础知识,其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程,动态规划,无模型预测与控制(SASA,Q-Learning),价值函数逼近(DQN),策略梯度方法(REINFORCE),执行者/评论者…☆17Oct 17, 2022Updated 3 years ago
- 强化学习求解迷宫问题,Q-learning和监督学习☆24Sep 20, 2020Updated 5 years ago