Run OpenAI Gym on a Server
☆18Aug 25, 2017Updated 8 years ago
Alternatives and similar repositories for CartPole
Users that are interested in CartPole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 7 years ago
- Motion imitation with deep reinforcement learning.☆13Jul 24, 2019Updated 6 years ago
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Sep 22, 2023Updated 2 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Various DQN method with cartpole☆11May 30, 2018Updated 7 years ago
- ☆46Feb 12, 2021Updated 5 years ago
- Personal website☆15Jun 14, 2025Updated 10 months ago
- Works for Applied Deep Learning / Machine Learning and Having It Deep and Structured (2017 FALL) @ NTU☆11Aug 14, 2018Updated 7 years ago
- Code for "Dynamic Discounted Counterfactual Regret Minimization", ICLR 2024 (Spotlight)☆17Apr 22, 2024Updated last year
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Which fellows cited my article?☆24Mar 6, 2022Updated 4 years ago
- Convolutional Neural Network for Click-Through Rate prediction.☆15Sep 28, 2016Updated 9 years ago
- This repository mainly organizes resources related to embodied intelligence, including data, models, hardware, and software infrastructur…☆10Jan 30, 2024Updated 2 years ago
- ☆17Oct 22, 2022Updated 3 years ago
- a python implementation of plsa☆25Oct 25, 2014Updated 11 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- PyTorch implementation of "The Option Keyboard: Combining Skills in Reinforcement Learning" (NeurIPS 2019)☆12Jul 2, 2020Updated 5 years ago
- ☆16Jul 1, 2021Updated 4 years ago
- Sokoban solver☆17Apr 6, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 利用图神经网络进行CTR预估☆15Nov 22, 2019Updated 6 years ago
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- env for gym, match3 game☆11Jun 2, 2019Updated 6 years ago
- Repo for tutorials☆13Jan 22, 2019Updated 7 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated last year
- A docker container that lets you run AirSim without building it.☆14Sep 20, 2017Updated 8 years ago
- A Tensorflow implementation of DSSM (slightly modified).☆24May 19, 2016Updated 9 years ago
- ☆29Sep 22, 2019Updated 6 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A PyTorch implementation of the Deep Dictionary model☆14Nov 11, 2020Updated 5 years ago
- Pytorch Implementation of Deepmind's 'Hybrid computing using a neural network with dynamic external memory' (Differentiable Neural Comput…☆20Dec 9, 2017Updated 8 years ago
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆23Apr 1, 2024Updated 2 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆18Mar 1, 2021Updated 5 years ago
- Collection of game-theoretic algorithms for Poker☆30Apr 6, 2019Updated 7 years ago
- kafka学习实例demo☆12Aug 23, 2016Updated 9 years ago
- ☆24Jun 5, 2021Updated 4 years ago