bhatiaabhinav / RL-v2View external linksLinks
Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)
☆12Feb 16, 2023Updated 2 years ago
Alternatives and similar repositories for RL-v2
Users that are interested in RL-v2 are comparing it to the libraries listed below
Sorting:
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- ☆11Sep 17, 2018Updated 7 years ago
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- 自己学习 Qt QCustomPlot库的例程☆12Oct 18, 2020Updated 5 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- The code for MuddSub's Alfie AUV☆10Nov 23, 2025Updated 2 months ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆67Jul 9, 2019Updated 6 years ago
- Arduino Timer2 library☆10Jun 25, 2023Updated 2 years ago
- Classification of human emotion using multi-modal models☆12Jun 27, 2020Updated 5 years ago
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- Marlin is an optimized firmware for RepRap 3D printers based on the Arduino platform. | Many commercial 3D printers come with Marlin inst…☆11Jun 5, 2023Updated 2 years ago
- Man in the middle attack demo☆11Jan 14, 2018Updated 8 years ago
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- Collection of ROS 2 packages and hardware interfaces that support ROS 2 integration with ArduSub☆14Nov 24, 2025Updated 2 months ago
- Winning solution of the Microsoft Research "First TextWorld Problems: A Reinforcement and Language Learning Challenge"☆12Jun 21, 2022Updated 3 years ago
- Multiple Traveling Salesman Problem (mTSP) for Flight Path Planning using Mixed-Integer Linear Programming (MILP)☆11Mar 20, 2022Updated 3 years ago
- Training and testing pipeline for ransomware classification based on screenshots of the splash screens or ransom notes (https://arxiv.org…☆11Jul 19, 2020Updated 5 years ago
- Open Source Firmware for the Pegasus Touch 3D Printer☆17Jan 6, 2022Updated 4 years ago
- 水下物体检测算法赛(光学图像赛道)方案☆10Jul 13, 2020Updated 5 years ago
- Evaluation Pipeline for medical tasks.☆12Updated this week
- RoboMaster power-rune task☆12Jun 19, 2022Updated 3 years ago
- Anomaly detection with GAN+RL☆11Jan 12, 2019Updated 7 years ago
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.☆20Dec 22, 2025Updated last month
- ☆10Aug 14, 2020Updated 5 years ago
- 2020腾讯游戏安全技术竞赛机器学习组优秀奖源码☆10Apr 16, 2020Updated 5 years ago
- Simulation of car parking in different parking lots using Unity ML-Agents☆12Dec 16, 2023Updated 2 years ago
- ☆14Jul 13, 2017Updated 8 years ago
- Align depth image to color image for rgbd camera.☆14Sep 2, 2017Updated 8 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Code for "Efficient Relation-aware Scoring Function Search for Knowledge Graph Embedding" ICDE 2021☆11Apr 26, 2021Updated 4 years ago
- My Master Thesis at the ASL supervised by Hermann Blum, Francesco Milano and Dr. Cadena Cesar☆11Dec 20, 2022Updated 3 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- The Third Place Winner in Generative Track of the ECCV 2024 DD Challenge☆10Oct 11, 2024Updated last year
- character recognition, textline recognition☆10Aug 31, 2019Updated 6 years ago
- moziai强化学习和行为树的代码☆10Mar 18, 2020Updated 5 years ago
- A C++ toolbox for computing Discrete and Fast Fourier Transforms (DFT,FFT), Power Spectral Density (PSD) estimates, and the sound pressur…☆15Mar 19, 2023Updated 2 years ago
- ☆10Feb 23, 2021Updated 4 years ago
- python package for Japanese NLP and many other utils☆13Oct 19, 2019Updated 6 years ago