GAIL learning to imitate PPO playing CartPole.
☆13May 27, 2021Updated 5 years ago
Alternatives and similar repositories for PPO-GAIL-cartpole
Users that are interested in PPO-GAIL-cartpole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End Driving via Generative Adversarial Imitation Learning☆29Apr 4, 2023Updated 3 years ago
- ☆10Apr 23, 2021Updated 5 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)☆13Aug 17, 2019Updated 6 years ago
- ☆13Jan 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Kuhn poker implemented in accordance to OpenAI gym interface☆14Dec 5, 2019Updated 6 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- ☆16Nov 7, 2020Updated 5 years ago
- CartPole-v0 via PPO with GAE, PyTorch☆21Feb 10, 2019Updated 7 years ago
- Lecture notes for a course on Decision and Game Theory for undergraduates studying AI☆13Dec 14, 2018Updated 7 years ago
- ☆22May 20, 2021Updated 5 years ago
- ☆24Sep 27, 2024Updated last year
- Soccer Trajectory Prediction Competition☆15Aug 28, 2025Updated 9 months ago
- Collection of OpenAI parametrized action-space environments.☆69Mar 19, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Dec 13, 2019Updated 6 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 4 years ago
- ☆21Dec 22, 2020Updated 5 years ago
- ☆10Jun 4, 2024Updated last year
- Generalised UDRL☆37May 12, 2022Updated 4 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆84Dec 4, 2018Updated 7 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆56Aug 30, 2024Updated last year
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Aug 19, 2022Updated 3 years ago
- Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation☆13Nov 5, 2025Updated 6 months ago
- Inclined Drone landing using deep reinforcement learning☆24Feb 10, 2022Updated 4 years ago
- IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.☆12Aug 12, 2022Updated 3 years ago
- Evidential Calibration☆11Mar 8, 2022Updated 4 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A dataset for hockey player tracking, following the same format as the MOT challenge dataset.☆24Oct 31, 2023Updated 2 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆17Oct 12, 2022Updated 3 years ago
- Learning to draw samples: with application to amortized maximum likelihood estimator for generative adversarial learning☆10Dec 28, 2021Updated 4 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Dec 8, 2022Updated 3 years ago
- IRL implementation based on Norvig's AIMA code.☆14May 2, 2014Updated 12 years ago