A simple stochastic OpenAI environment for training RL agents
☆87Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for banana-gym
Users that are interested in banana-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Nov 28, 2018Updated 7 years ago
- ☆306Apr 2, 2023Updated 3 years ago
- Qualitative Numeric Planning☆10Dec 10, 2020Updated 5 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- An introduction to CUDA programming by way of a Boids Flocking simulation☆11Jun 30, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 5GTANGO Smart Manufacturing Pilot☆13May 1, 2023Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 7 years ago
- OpenAI Gym Environment for 2048☆17Dec 4, 2022Updated 3 years ago
- Anomaly detection system for Datadog multiple metrics☆23Nov 11, 2016Updated 9 years ago
- ☆16Mar 10, 2018Updated 8 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Jun 20, 2019Updated 6 years ago
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- Neuroevolution as a direct policy search deep reinforcement learning method, implemented using Keras and DEAP.☆71Mar 17, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MultiLexNorm 2021 competition system from ÚFAL☆16Dec 30, 2021Updated 4 years ago
- A Python framework for building HTTP-based server applications☆16Apr 2, 2026Updated 2 months ago
- Replication of Uber Neuroevolution paper☆46Apr 14, 2018Updated 8 years ago
- we implemented a model to predict the market price of a nonlinear chaotic time series,using reinforcement learning☆17Dec 4, 2018Updated 7 years ago
- Deep Reinforcement Learning for Keras.☆5,553Sep 17, 2023Updated 2 years ago
- Incorporates external dependencies into HTML file using data: URI scheme☆21Nov 17, 2011Updated 14 years ago
- Simple demo for running LocalStack in Gitpod☆11Nov 17, 2023Updated 2 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Gymnasium environment for research of UAVs and risk constraints☆12Oct 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Dec 19, 2022Updated 3 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Flux baselines: Implementations of reinforcement learning algorithms using Flux☆29Apr 8, 2020Updated 6 years ago
- ☆16May 4, 2021Updated 5 years ago
- Code to accompany "Conformal Prediction as Bayesian Quadrature" by Jake Snell & Tom Griffiths (ICML 2025 Outstanding Paper)☆24Jul 14, 2025Updated 10 months ago
- A public repo for a docker image to speed up docker tests for Pipenv.☆12Sep 23, 2018Updated 7 years ago
- This repo contains the original implementation of VAuLT, the Vision-and-Augmented-Language Transformer. We provide instructions to downlo…☆18Sep 23, 2025Updated 8 months ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Projeto desenvolvido para explicar os conceitos de SOLID - Palestra TDC☆10Mar 24, 2022Updated 4 years ago
- ☆14Mar 2, 2021Updated 5 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Nov 15, 2019Updated 6 years ago
- An easy-to-use framework to turn any neural network definition in PyTorch into a Bayesian neural network.☆13Nov 24, 2023Updated 2 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆375Oct 9, 2023Updated 2 years ago
- ☆15Oct 16, 2020Updated 5 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago