XinJingHao / PPO-Discrete-PytorchView external linksLinks
A clean and robust Pytorch implementation of PPO on Discrete action space
☆72Jun 8, 2024Updated last year
Alternatives and similar repositories for PPO-Discrete-Pytorch
Users that are interested in PPO-Discrete-Pytorch are comparing it to the libraries listed below
Sorting:
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- A clean and robust Pytorch implementation of SAC on discrete action space☆42Oct 23, 2024Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆90Apr 13, 2025Updated 10 months ago
- Learning to Incentivize Other Learning Agents☆35Jun 13, 2022Updated 3 years ago
- ☆22Mar 7, 2021Updated 4 years ago
- ☆40Nov 17, 2021Updated 4 years ago
- Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.☆1,441Mar 29, 2023Updated 2 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 2 years ago
- Deep Learning Project☆23Jan 18, 2020Updated 6 years ago
- Collection of OpenAI parametrized action-space environments.☆69Mar 19, 2025Updated 10 months ago
- Implementations of a large collection of reinforcement learning algorithms.☆28Nov 30, 2023Updated 2 years ago
- AI for google research football☆27Dec 14, 2020Updated 5 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Estimate probability of failure using reframed Bayesian optimization☆10Aug 14, 2025Updated 6 months ago
- ☆11Jul 15, 2025Updated 6 months ago
- ☆10Dec 10, 2021Updated 4 years ago
- A c++ implementation for calculating the accuracy metrics (Accuracy, Error Rate, Precision(micro/macro), Recall(micro/macro), Fscore(micr…☆12Jul 2, 2019Updated 6 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Datasets and code to accompany Briceno-Mena, Luis A. and Venugopalan, Gokul and Romagnoli, José A. and Arges, Christopher G., Machine Lea…☆10Oct 17, 2022Updated 3 years ago
- ☆15May 20, 2025Updated 8 months ago
- ☆37Mar 10, 2022Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Sep 19, 2021Updated 4 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆26Oct 16, 2025Updated 3 months ago
- ☆14Feb 2, 2026Updated last week
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆17Updated this week
- GAN: An example for generating Gaussian distribution by a simple generating adversarial network.☆12Dec 28, 2020Updated 5 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- This is a pytorch implementation of our AAAI paper for learned image transmission with HVAE☆10Aug 8, 2025Updated 6 months ago
- torch7 wrapper for knn CUDA code☆10Dec 1, 2014Updated 11 years ago
- Amaru - Finite element library☆13May 15, 2025Updated 8 months ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆167May 9, 2023Updated 2 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- ☆10May 31, 2020Updated 5 years ago
- A NLEIS toolbox for impedance.py that provides RC level nonlinear equivalent circuit modeling (nECM) and analysis☆12Nov 16, 2025Updated 2 months ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago