solution to cartpole problem of openAI gym with different approaches
☆12Jul 26, 2018Updated 7 years ago
Alternatives and similar repositories for CartPole-OpenAI-GYM
Users that are interested in CartPole-OpenAI-GYM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch official implementation for Imitating Unknown Policies via Exploration.☆14Oct 3, 2023Updated 2 years ago
- This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking☆13Mar 10, 2026Updated last month
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- Code for controlling a Kinova Gen3 Manipulator via Drake.☆33Sep 20, 2023Updated 2 years ago
- ☆13May 25, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆15May 28, 2025Updated 10 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- ☆14Jul 13, 2025Updated 9 months ago
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆28Dec 12, 2025Updated 4 months ago
- Clone of the classic Snake game as an OpenAI Gym environment☆14Feb 12, 2018Updated 8 years ago
- Silicitect, model neural network architectures in JavaScript for silicon hardware.☆21Oct 10, 2016Updated 9 years ago
- ☆10Jun 27, 2024Updated last year
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jan 21, 2020Updated 6 years ago
- This is a watcher to rerun scripts, execute tests and run lint after you change a directory or a file.☆25Dec 28, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- Use Javascript via the Python language☆14Dec 7, 2022Updated 3 years ago
- Official implementation of ICML paper Imitating Latent Policies from Observation☆75May 13, 2019Updated 6 years ago
- ☆12Nov 5, 2024Updated last year
- ☆16Sep 22, 2024Updated last year
- Python script that will take two manga pages, identify all the regions where the two are different, and let you choose which parts of whi…☆10Mar 28, 2021Updated 5 years ago
- ☆76Mar 12, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆18Nov 24, 2025Updated 4 months ago
- Pointax: PointMaze Environment for JAX☆27Oct 22, 2025Updated 5 months ago
- ☆15Sep 4, 2025Updated 7 months ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆15Oct 22, 2023Updated 2 years ago
- Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA☆32Nov 27, 2025Updated 4 months ago
- Deep memory and sequence models in JAX☆25Apr 13, 2026Updated last week
- ☆21Apr 2, 2025Updated last year
- Formate converter from one type of qa task datasets to another type☆39Dec 31, 2018Updated 7 years ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆34Jul 8, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Nov 4, 2024Updated last year
- High quality implementations of imitation and inverse reinforcement learning algorithms☆24Aug 19, 2025Updated 8 months ago
- Learning material for the DP-100 exam☆37Feb 15, 2021Updated 5 years ago
- ☆18Jul 4, 2025Updated 9 months ago
- PoE-World: Compositional World Modeling with Products of Programmatic Experts☆44Feb 5, 2026Updated 2 months ago
- OpenAI Gym interface for Universal Robots with ROS Gazebo☆19Jan 12, 2024Updated 2 years ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Jun 19, 2024Updated last year