GAIL learning to imitate PPO playing CartPole.
☆13May 27, 2021Updated 4 years ago
Alternatives and similar repositories for PPO-GAIL-cartpole
Users that are interested in PPO-GAIL-cartpole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End Driving via Generative Adversarial Imitation Learning☆29Apr 4, 2023Updated 3 years ago
- ☆10Apr 23, 2021Updated 5 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)☆13Aug 17, 2019Updated 6 years ago
- ☆13Jan 22, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kuhn poker implemented in accordance to OpenAI gym interface☆14Dec 5, 2019Updated 6 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- ☆16Nov 7, 2020Updated 5 years ago
- Lecture notes for a course on Decision and Game Theory for undergraduates studying AI☆13Dec 14, 2018Updated 7 years ago
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- ☆22May 20, 2021Updated 4 years ago
- Soccer Trajectory Prediction Competition☆15Aug 28, 2025Updated 8 months ago
- Collection of OpenAI parametrized action-space environments.☆69Mar 19, 2025Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆176Mar 22, 2022Updated 4 years ago
- Detect keypoints at a football pitch☆25Jan 14, 2023Updated 3 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- ☆10Jun 4, 2024Updated last year
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- fork of gitlab.com/bpaassen/five_clique. Solution to Matt Parker's 5-clique problem☆10Aug 4, 2022Updated 3 years ago
- ☆84Dec 4, 2018Updated 7 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆56Aug 30, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation☆14Nov 5, 2025Updated 6 months ago
- IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.☆11Aug 12, 2022Updated 3 years ago
- Evidential Calibration☆11Mar 8, 2022Updated 4 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- A dataset for hockey player tracking, following the same format as the MOT challenge dataset.☆24Oct 31, 2023Updated 2 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- Open AI gym poker environment built using the clubs package☆35Jan 14, 2024Updated 2 years ago
- AI for google research football☆28Dec 14, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 论文Reinforcement Learning of Sequential Price Mechanisms的复现☆12Nov 3, 2022Updated 3 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Learning to draw samples: with application to amortized maximum likelihood estimator for generative adversarial learning☆10Dec 28, 2021Updated 4 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Dec 8, 2022Updated 3 years ago
- IRL implementation based on Norvig's AIMA code.☆14May 2, 2014Updated 12 years ago
- ☆11Oct 8, 2022Updated 3 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago