AI learning to walk in gym's BipedalWalker environment.
☆67Jun 29, 2017Updated 8 years ago
Alternatives and similar repositories for bipedal-es
Users that are interested in bipedal-es are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast Evolution Strategy implementation in Python☆272Apr 27, 2020Updated 5 years ago
- An AI agent Learning to play Flappy Bird using Evolution Strategies and deep learning models.☆45Sep 30, 2020Updated 5 years ago
- random search, hill climbing, policy gradient☆145Sep 17, 2018Updated 7 years ago
- This repository is a ROS implementation of the Exponential Moving Average (EMA) formula for velocity smoothing.☆19Feb 24, 2024Updated 2 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Jun 4, 2024Updated last year
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- Applying Reinforcement learning models for stock price predictions☆25Nov 2, 2018Updated 7 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.☆26Jun 13, 2024Updated last year
- Prioritized Experience Replay (PER) implementation in PyTorch☆359Feb 3, 2020Updated 6 years ago
- Large Language Models and Robotics.☆22Apr 27, 2024Updated last year
- Code release for the paper "Calibrating Energy-based Generative Adversarial Networks"☆24Oct 31, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This package aims to make development with ML-Agents quicker and easier.☆10Oct 31, 2019Updated 6 years ago
- ☆12Jul 3, 2021Updated 4 years ago
- Video about NP-completeness, circuit SAT and "reversing time"☆15Aug 18, 2024Updated last year
- ☆57Dec 2, 2021Updated 4 years ago
- This is the code for the 'Tensorflow Neural Network' Live session by @Sirajology on Youtube☆31Jul 6, 2019Updated 6 years ago
- ☆16Feb 19, 2025Updated last year
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- Code to study the generalisability of benchmark models on non-stationary EHRs.☆15Aug 7, 2019Updated 6 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,420Apr 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- ☆11May 24, 2024Updated last year
- pass game protect☆12Apr 26, 2014Updated 11 years ago
- Notes for references from internet websites.☆13Dec 23, 2014Updated 11 years ago
- Learning to trade under the reinforcement learning framework☆517Oct 15, 2016Updated 9 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- [ICRA 2025] We present a Morphology-Informed Heterogeneous Graph Neural Network (MI-HGNN) for learning-based contact perception. The arch…☆38Feb 2, 2026Updated 2 months ago
- Chess reinforcement learning by AlphaZero methods.☆40Jan 6, 2018Updated 8 years ago
- Repository for the Udacity Deep Reinforcement Learning Nanodegree☆12Jul 9, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Emotiv SDK Community Edition☆12Oct 9, 2015Updated 10 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated last week
- Windows hidden thread suspend POC with code injection☆12May 27, 2017Updated 8 years ago
- 6-DoF wheeled biped robot☆18Jan 19, 2022Updated 4 years ago
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- Installing Sensable Phantom devices in Linux.☆10Jul 5, 2019Updated 6 years ago
- A collection of awesome projects using MuJoCo.☆16May 27, 2025Updated 10 months ago