Play games in the OpenAI gym using the keyboard
☆16Nov 21, 2017Updated 8 years ago
Alternatives and similar repositories for OpenAIGaming
Users that are interested in OpenAIGaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [RAL 2023] transformer + reinforcement learning for navigation + POMPD☆15Jul 19, 2023Updated 2 years ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repo for vidar and vidarc: video foundation model for robotics.☆41Dec 22, 2025Updated 5 months ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- PyTorch implementation of PtrNet to solve sorting problem.☆12Dec 19, 2017Updated 8 years ago
- Gradient based receptive field estimation for Convolutional Neural Networks☆14Nov 25, 2017Updated 8 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Data Analysis and Visualization on Airbnb Data☆11Aug 17, 2018Updated 7 years ago
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- ☆16May 28, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An PyTorch implementation of "Importance Weighted Actor-Learner Architectures" https://arxiv.org/abs/1802.01561☆12Jan 6, 2021Updated 5 years ago
- ☆22May 23, 2025Updated last year
- Neural Potential Field for Obstacle-Aware Local Motion Planning☆23Jun 2, 2024Updated 2 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based Losses (ECCV 2018)☆16Aug 14, 2019Updated 6 years ago
- python solver for tangram puzzles☆14Jan 3, 2019Updated 7 years ago
- Reinforcement Learning with Turtlebot in Gazebo☆21Sep 23, 2021Updated 4 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Action Value Gradient Algorithm☆28May 18, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Aug 4, 2025Updated 10 months ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- Quickly create a UI for any python file with a CLI☆17May 11, 2026Updated last month
- Weight Agnostic Neural Networks (in Python)☆18Jun 17, 2019Updated 7 years ago
- The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk…☆29Jan 5, 2022Updated 4 years ago
- Reproduced Detailed-VideoAvatar project for 3D Human-Body Reconstruction☆23Jan 12, 2019Updated 7 years ago
- VanillaJS-based Web Components for the IndieWeb☆14May 27, 2017Updated 9 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- ☆17Feb 18, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Value & Policy Iteration for the frozenlake environment of OpenAI☆15May 14, 2019Updated 7 years ago
- A Clinic Bubbleprof example☆14Aug 21, 2020Updated 5 years ago
- Series of CLI tools for Hyperswarm☆18Jun 18, 2022Updated 4 years ago
- CLI tool to update a package lock file☆12Jun 18, 2020Updated 6 years ago
- Generates ffi-compatible layer for your rust code☆11Jul 4, 2020Updated 5 years ago
- ☆19Mar 6, 2012Updated 14 years ago
- Utility methods related to public key cryptography to be used with distributed mutable storage☆15May 13, 2020Updated 6 years ago