AI learning to walk in gym's BipedalWalker environment.
☆66Jun 29, 2017Updated 9 years ago
Alternatives and similar repositories for bipedal-es
Users that are interested in bipedal-es are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- random search, hill climbing, policy gradient☆145Sep 17, 2018Updated 7 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Jun 4, 2024Updated 2 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Applying Reinforcement learning models for stock price predictions☆25Nov 2, 2018Updated 7 years ago
- Code for cd0377- Introduction to Natural Language Processing taught by Luis Serrano and Arpan Chakraborty☆19Dec 11, 2025Updated 6 months ago
- Automated deep learning!☆26Oct 6, 2017Updated 8 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Let AI play snake game with evolutionary algorithm☆32Sep 14, 2023Updated 2 years ago
- Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.☆12Dec 24, 2016Updated 9 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆360Feb 3, 2020Updated 6 years ago
- SplitNet implemented based on ResNet-50 trained on ImageNet-22K☆16Jun 18, 2018Updated 8 years ago
- This package aims to make development with ML-Agents quicker and easier.☆10Oct 31, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Jul 29, 2021Updated 4 years ago
- Github Action to scrape an RSS feed to display on a Github Pages website☆13May 1, 2025Updated last year
- ☆16Feb 19, 2025Updated last year
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,429Apr 16, 2024Updated 2 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 3 years ago
- pass game protect☆11Apr 26, 2014Updated 12 years ago
- A chess adaption of GCP's Leela Zero☆14Jan 9, 2018Updated 8 years ago
- PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data☆12May 19, 2019Updated 7 years ago
- Learning to trade under the reinforcement learning framework☆518Oct 15, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Using a shared file to exchange data between Unity and Python☆13Oct 30, 2021Updated 4 years ago
- Towards Formalizing RL Theory☆54Jun 26, 2026Updated last week
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated 3 months ago
- Windows hidden thread suspend POC with code injection☆12May 27, 2017Updated 9 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆11May 30, 2018Updated 8 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Installing Sensable Phantom devices in Linux.☆10Jul 5, 2019Updated 7 years ago
- A collection of awesome projects using MuJoCo.☆17May 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (Experimental) ROS packages for Blue + Gazebo☆15Aug 4, 2019Updated 6 years ago
- ☆17Jul 11, 2020Updated 5 years ago
- android got hook under version 5.0☆12Jun 13, 2019Updated 7 years ago
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 4 years ago
- 📝 A personal collection of templates for Markdown+LaTeX-based writing.☆16Oct 11, 2018Updated 7 years ago
- Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"☆24Mar 13, 2018Updated 8 years ago
- trading by Deep Q-Network☆15Oct 20, 2016Updated 9 years ago