Run OpenAI Gym on a Server
☆18Aug 25, 2017Updated 8 years ago
Alternatives and similar repositories for CartPole
Users that are interested in CartPole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 让所有人都可以让dummy机械臂跑在moveit2上☆28Jul 17, 2025Updated 11 months ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Motion imitation with deep reinforcement learning.☆13Jul 24, 2019Updated 6 years ago
- Implementation of Genetic Algorithm to balance inverted pendulum in OpenAI gym environment☆19Aug 10, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 8 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Various DQN method with cartpole☆11May 30, 2018Updated 8 years ago
- ☆47Feb 12, 2021Updated 5 years ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- Code for "Dynamic Discounted Counterfactual Regret Minimization", ICLR 2024 (Spotlight)☆18Apr 22, 2024Updated 2 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- A site comparing services of different Cloud Vendors☆10Jan 4, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Mar 13, 2017Updated 9 years ago
- 采样FCRN: Fully-Convolutional Regression Network (全卷积回归网络),出自VGG 实验室这篇 CVPR2016的Paper:Synthetic Data for Text Localisation in Natural Image…☆10Jun 13, 2017Updated 9 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- opencvprojects for android☆13Jan 27, 2013Updated 13 years ago
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 4 years ago
- Which fellows cited my article?☆25Mar 6, 2022Updated 4 years ago
- Convolutional Neural Network for Click-Through Rate prediction.☆15Sep 28, 2016Updated 9 years ago
- This repository mainly organizes resources related to embodied intelligence, including data, models, hardware, and software infrastructur…☆10Jan 30, 2024Updated 2 years ago
- a python implementation of plsa☆25Oct 25, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Sokoban solver☆17Jun 11, 2026Updated 3 weeks ago
- 利用图神经网络进行CTR预估☆15Nov 22, 2019Updated 6 years ago
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- env for gym, match3 game☆11Jun 2, 2019Updated 7 years ago
- AdBandit: A New Algorithm For Multi-Armed Bandits☆11Mar 22, 2015Updated 11 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated 2 years ago
- A docker container that lets you run AirSim without building it.☆14Sep 20, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A simple tool for labeling object bounding boxes in images☆12Oct 7, 2017Updated 8 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Pytorch Implementation of Deepmind's 'Hybrid computing using a neural network with dynamic external memory' (Differentiable Neural Comput…☆20Dec 9, 2017Updated 8 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- Collection of game-theoretic algorithms for Poker☆30Apr 6, 2019Updated 7 years ago
- 微信朋友圈,QQ空间,微博等列表展示的功能实现☆15May 24, 2017Updated 9 years ago
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆25Apr 1, 2024Updated 2 years ago