AI learning to walk in gym's BipedalWalker environment.
☆67Jun 29, 2017Updated 8 years ago
Alternatives and similar repositories for bipedal-es
Users that are interested in bipedal-es are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast Evolution Strategy implementation in Python☆274Apr 27, 2020Updated 6 years ago
- An AI agent Learning to play Flappy Bird using Evolution Strategies and deep learning models.☆45Sep 30, 2020Updated 5 years ago
- random search, hill climbing, policy gradient☆145Sep 17, 2018Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Jun 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Oct 6, 2020Updated 5 years ago
- Applying Reinforcement learning models for stock price predictions☆25Nov 2, 2018Updated 7 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Safe Multi-Agent Robosuite benchmark for safe multi-agent reinforcement learning research.☆25Jun 13, 2024Updated last year
- Let AI play snake game with evolutionary algorithm☆32Sep 14, 2023Updated 2 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆360Feb 3, 2020Updated 6 years ago
- Analyzes and adjusts the volume of MP3 files☆12Apr 7, 2019Updated 7 years ago
- ☆31Feb 26, 2024Updated 2 years ago
- Large Language Models and Robotics.☆22Apr 27, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for the paper "Calibrating Energy-based Generative Adversarial Networks"☆24Oct 31, 2017Updated 8 years ago
- Berkeley DeepRL Homework☆11Aug 13, 2017Updated 8 years ago
- Code for the Deep Learning with PyTorch lesson☆366Jun 28, 2022Updated 3 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- ☆12Jul 3, 2021Updated 4 years ago
- ☆11Jul 29, 2021Updated 4 years ago
- Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"☆1,629Oct 31, 2019Updated 6 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆17Jan 6, 2018Updated 8 years ago
- ☆57Dec 2, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the code for the 'Tensorflow Neural Network' Live session by @Sirajology on Youtube☆31Jul 6, 2019Updated 6 years ago
- ☆16Feb 19, 2025Updated last year
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,425Apr 16, 2024Updated 2 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- automated machine learning toolkit☆15Apr 8, 2018Updated 8 years ago
- PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data☆12May 19, 2019Updated 7 years ago
- Learning to trade under the reinforcement learning framework☆519Oct 15, 2016Updated 9 years ago
- Using a shared file to exchange data between Unity and Python☆13Oct 30, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Emotiv SDK Community Edition☆13Oct 9, 2015Updated 10 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated last month
- Windows hidden thread suspend POC with code injection☆12May 27, 2017Updated 8 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆11May 30, 2018Updated 7 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Installing Sensable Phantom devices in Linux.☆10Jul 5, 2019Updated 6 years ago