Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)
☆12Feb 16, 2023Updated 3 years ago
Alternatives and similar repositories for RL-v2
Users that are interested in RL-v2 are comparing it to the libraries listed below
Sorting:
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- ☆11Sep 17, 2018Updated 7 years ago
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- The code for MuddSub's Alfie AUV☆10Mar 1, 2026Updated last week
- 自己学习 Qt QCustomPlot 库的例程☆12Oct 18, 2020Updated 5 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆67Jul 9, 2019Updated 6 years ago
- Open Source Firmware for the Pegasus Touch 3D Printer☆17Jan 6, 2022Updated 4 years ago
- 水下物体检测算法赛(光学图像赛道)方案☆10Jul 13, 2020Updated 5 years ago
- RoboMaster power-rune task☆12Jun 19, 2022Updated 3 years ago
- Deep residual networks for dimensionality reduction and surrogate modeling in high-dimensional inverse problems☆10Mar 20, 2021Updated 4 years ago
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- Man in the middle attack demo☆11Jan 14, 2018Updated 8 years ago
- Winning solution of the Microsoft Research "First TextWorld Problems: A Reinforcement and Language Learning Challenge"☆12Jun 21, 2022Updated 3 years ago
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated 3 weeks ago
- Arduino Timer2 library☆10Jun 25, 2023Updated 2 years ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- Marlin is an optimized firmware for RepRap 3D printers based on the Arduino platform. | Many commercial 3D printers come with Marlin inst…☆11Jun 5, 2023Updated 2 years ago
- Classification of human emotion using multi-modal models☆12Jun 27, 2020Updated 5 years ago
- Collection of ROS 2 packages and hardware interfaces that support ROS 2 integration with ArduSub☆14Feb 23, 2026Updated last week
- Training and testing pipeline for ransomware classification based on screenshots of the splash screens or ransom notes (https://arxiv.org…☆11Jul 19, 2020Updated 5 years ago
- 1. Simulation of a job shop production system 2. Reinforcement Learning agent to control the production system☆11Sep 8, 2021Updated 4 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- ☆10Aug 14, 2020Updated 5 years ago
- moziai强化学习和行为树的代码☆10Mar 18, 2020Updated 5 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- A underwater vehicle simulation test-bed with SAUVC swimming pool environment with 6-vectored thruster configuration vehicle operating in…☆10Mar 12, 2022Updated 3 years ago
- ☆12Feb 19, 2026Updated 2 weeks ago
- python package for Japanese NLP and many other utils☆13Oct 19, 2019Updated 6 years ago
- Official Repository for Heterogeneous Models Dataset Condensation (ECCV 2024, Oral)☆10Dec 15, 2024Updated last year
- The Third Place Winner in Generative Track of the ECCV 2024 DD Challenge☆10Oct 11, 2024Updated last year
- Align depth image to color image for rgbd camera.☆14Sep 2, 2017Updated 8 years ago
- 2020腾讯游戏安全技术竞赛机器学习组优秀奖源码☆10Apr 16, 2020Updated 5 years ago
- Multiple Traveling Salesman Problem (mTSP) for Flight Path Planning using Mixed-Integer Linear Programming (MILP)☆11Mar 20, 2022Updated 3 years ago
- Repo for the walking robot's vision based navigation code☆10Jun 6, 2023Updated 2 years ago
- Program for detection and tracking players on a sports ground and calculation of basic statistical indicators using deep learning☆10Nov 13, 2022Updated 3 years ago