强化学习常见算法的实现,Q-Learning/DQN/PG/AC/DDPG/PPO/SAC
☆26Feb 17, 2022Updated 4 years ago
Alternatives and similar repositories for RL-demo
Users that are interested in RL-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 30, 2023Updated 2 years ago
- This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…☆15May 22, 2025Updated 10 months ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆151Jan 23, 2026Updated 2 months ago
- Implementation of Pareto Deep Q Networks in a multi-objective Gym Reinforcement Learning Environment☆17Jun 19, 2023Updated 2 years ago
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Pytorch Code for "Rethinking Degradation: Radiograph Super-Resolution via AID-SRGAN" - MICCAI 2022 Workshop☆16Dec 11, 2024Updated last year
- 使用pytorch构建深度强化学习模型DQN☆26Dec 5, 2017Updated 8 years ago
- 使用PPO算法+OU噪声进行机械臂轨迹规划仿真☆18May 10, 2024Updated last year
- qmix☆23May 28, 2020Updated 5 years ago
- 基于分层强化学习和逆向强化学习的自适应巡航算法☆27Oct 8, 2019Updated 6 years ago
- Python implementation of the img2net algorithm.☆10Jan 7, 2026Updated 3 months ago
- 基于强化学习的游戏空战推演☆13May 8, 2021Updated 4 years ago
- 基于PPO算法的轨迹规划☆19Apr 11, 2024Updated 2 years ago
- Physics-Guided Reinforcement Learning System for Realistic Vehicle Active Suspension Control (IEEE ICMLA 2023)☆28Aug 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Dec 4, 2025Updated 4 months ago
- Simulated & controlled semi-active suspension system of car on MATLAB & Simulink.☆23Dec 28, 2021Updated 4 years ago
- lecture32_AI挑战星际争霸II(强化学习)☆17Aug 23, 2022Updated 3 years ago
- Compute the likeliness of an image region to vessels or ridges☆29Jul 20, 2017Updated 8 years ago
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the fi…☆34Nov 3, 2025Updated 5 months ago
- Simple PyTorch graph capturing.☆21May 31, 2023Updated 2 years ago
- High-fidelity simulator for off-road driving☆32Jun 6, 2024Updated last year
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 4 years ago
- 基于ppo算法的计算卸载策略研究☆29Jan 17, 2023Updated 3 years ago
- Code accompanying https://arxiv.org/abs/1802.02219☆19Oct 5, 2022Updated 3 years ago
- PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).☆31Oct 27, 2021Updated 4 years ago
- multi task learning for multi-classification using keras☆13Feb 10, 2020Updated 6 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 4 years ago
- 在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, an…☆131Oct 8, 2023Updated 2 years ago
- ppo-lstm-parallel☆49Mar 26, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SimCSE☆15Oct 1, 2022Updated 3 years ago
- 应用强化学习在复杂的交通环境下自动学习最佳驾驶策略的方案,在测试环境下准确率达到100%。☆22Feb 26, 2017Updated 9 years ago
- Comparaison of adversarial training algorithms (FreeLB, FreeAT and K-PGD) on natural language tasks☆13Feb 14, 2020Updated 6 years ago
- My work during the research project "Comfort-oriented adaptive cruise control of an autonomous vehicle" at GIPSA-lab, Jan.-Jun. 2020, lat…☆30Apr 30, 2021Updated 4 years ago
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 4 years ago
- code for Model-Guided Multi-Contrast Deep Unfolding Network for MRI Super-resolution Reconstruction☆17Oct 23, 2023Updated 2 years ago
- Ryu component-based software defined networking framework☆31Sep 17, 2021Updated 4 years ago