A simple stochastic OpenAI environment for training RL agents
☆87Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for banana-gym
Users that are interested in banana-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 28, 2018Updated 7 years ago
- ☆306Apr 2, 2023Updated 3 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- Any Stream to Reinforcement Learning Environment (Time Series Data, Stock Market )☆11Oct 10, 2018Updated 7 years ago
- An introduction to CUDA programming by way of a Boids Flocking simulation☆11Jun 30, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OpenAI Gym Environment for 2048☆17Dec 4, 2022Updated 3 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13May 4, 2018Updated 7 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Jun 20, 2019Updated 6 years ago
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- 《人工智能法规、伦理与社会影响》书稿☆14Aug 28, 2021Updated 4 years ago
- Replication of Uber Neuroevolution paper☆46Apr 14, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- Deep Reinforcement Learning for Keras.☆5,556Sep 17, 2023Updated 2 years ago
- Dorling Cartogram and Map Widget for React☆17Dec 7, 2022Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Gymnasium environment for research of UAVs and risk constraints☆12Oct 29, 2024Updated last year
- Home of the PipeGraph extension to Scikit-Learn☆24Mar 16, 2025Updated last year
- ☆12Mar 18, 2024Updated 2 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- ☆11Feb 29, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- code for "Multi-modality contrastive learning for sarcopenia screening from hip X-rays and clinical information" in MICCAI 2023☆17Dec 3, 2025Updated 4 months ago
- ☆10Sep 3, 2021Updated 4 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- MuJoCo model for Blue☆10Mar 13, 2020Updated 6 years ago
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- Code to accompany "Conformal Prediction as Bayesian Quadrature" by Jake Snell & Tom Griffiths (ICML 2025 Outstanding Paper)☆23Jul 14, 2025Updated 9 months ago
- (DEPRECATED, migrated to main repo - hasktorch/hasktorch) Research code generation / FFI binding using libtorch 1.x for the next Hasktor…☆11Sep 13, 2019Updated 6 years ago
- A collection of multi agent environments based on OpenAI gym.☆630Jul 7, 2024Updated last year
- Hard drive failure prediction using SMART data set☆16Sep 30, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for Learning to Defer to Multiple Experts: Consistent Surrogate Losses, Confidence Calibration, and Conformal Ensembles [AISTATS'23]☆13Jul 28, 2023Updated 2 years ago
- This repo contains the original implementation of VAuLT, the Vision-and-Augmented-Language Transformer. We provide instructions to downlo…☆18Sep 23, 2025Updated 7 months ago
- Reproduction of "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization" for the Reproducibility challenge@NeurIPS…☆11Jan 14, 2020Updated 6 years ago
- A cookie clicker clone, but multiplayer.☆18Dec 29, 2013Updated 12 years ago
- Repositório oficial com os exemplos do livro "Conectividade LoRaWAN - fundamentos e prática"☆10Jul 23, 2023Updated 2 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Projeto desenvolvido para explicar os conceitos de SOLID - Palestra TDC☆10Mar 24, 2022Updated 4 years ago